We are leaving Russia. We are against aggression and war with Ukraine. It is a tragedy for our nations, it is a nightmare

Loss of SQL connection causes Job ID to increase 10k

Once a month we patch our servers. When the SQL Server that hangfire is using becomes unavailable I’ll see errors like:

2022-01-27 08:03:31,014 [ServerHeartbeatProcess #1] WARN LogExtensions.WarnException - Server :3996:8847430c encountered an exception while sending heartbeat
System.Data.SqlClient.SqlException (0x80131904): SHUTDOWN is in progress.
Login failed for user '
Cannot continue the execution because the session is in the kill state.
A severe error occurred on the current command. The results, if any, should be discarded.
Error Number:6005,State:1,Class:14

2022-01-27 08:03:55,793 [Worker #6] ERROR LogExtensions.ErrorException - Execution Worker is in the Failed state now due to an exception, execution will be retried no more than in 00:02:49
System.InvalidOperationException: Timeout expired. The timeout period elapsed prior to obtaining a connection from the pool. This may have occurred because all pooled connections were in use and max pool size was reached.

2022-01-27 08:03:59,669 [DelayedJobScheduler #1] ERROR LogExtensions.ErrorException - Execution DelayedJobScheduler is in the Failed state now due to an exception, execution will be retried no more than in 00:00:16
System.Data.SqlClient.SqlException (0x80131904): A network-related or instance-specific error occurred while establishing a connection to SQL Server. The server was not found or was not accessible. Verify that the instance name is correct and that SQL Server is configured to allow remote connections. (provider: Named Pipes Provider, error: 40 - Could not open a connection to SQL Server) —> System.ComponentModel.Win32Exception (0x80004005): The network path was not found

Eventually followed by:

2022-01-27 08:10:16,843 [CountersAggregator #1] INFO BackgroundExecution.ToRunningState - Execution CountersAggregator recovered from the Faulted state after 00:05:31.0423512 and is in the Running state now

2022-01-27 08:10:16,875 [ServerWatchdog #1] INFO BackgroundExecution.ToRunningState - Execution ServerWatchdog recovered from the Faulted state after 00:05:23.2092033 and is in the Running state now

The system seems to recover fine but my concern is that the Job IDs seem to jump dramatically. Today they went from 31242 → 41211 in one environment and 28668 → 38640 in another, an increase of about 10k.

My questions are:

  • What causes this jump?
  • Aside from shutting down the background job processor during patching, is there something I can do to prevent this?

Thank you kindly for any and all feedback.

Packages Used:
Hangfire.Console version=“1.4.2”
Hangfire.Core version=“1.7.28”
Hangfire.Ninject version=“1.2.0”
Hangfire.RecurringJobExtensions version=“1.1.6”
Hangfire.SqlServer version=“1.7.28”
Hangfire.Tags version=“1.8.1”
Hangfire.Tags.SqlServer version=“1.8.0”

Web Service queues background jobs
Web UI runs the background job server and processes jobs. Configured to always be running.

I’ve also posted these issue here for extra exposure:

I believe this articles explains what I’m seeing:

We plan to try disabling IDENTITY CACHE at the database level to see if it fixes the issues. I will report back on the outcome.