Multiple workers processing one job

mrattray · September 11, 2015, 4:31pm

One of my jobs is showing odd behaviour, the job beings processing properly on a worker, then 30 min later another worker picks up the job, this keeps happening every 30 min until all my workers are processing the same job. I have many different types of jobs, some of which run for longer than 30 min without any problems. I am at a loss as to why this is happening, has anyone encountered anything similar?

dradovic · September 12, 2015, 7:04pm

I guess you are hitting the default timeout for SQL Server stored jobs. See: http://docs.hangfire.io/en/latest/configuration/using-sql-server-with-msmq.html.

mrattray · September 14, 2015, 2:16pm

Thanks for pointing me in the correct direction, this page seems to describe my scenario exactly, http://docs.hangfire.io/en/latest/configuration/using-sql-server.html#invisibility-timeout.

dmehta · September 14, 2015, 2:23pm

1.5 has removed the restriction of invisibility time out., see if you could upgrade your project.

mrattray · September 14, 2015, 2:25pm

excellent, I will be updating as soon as it is out of beta

dradovic · September 14, 2015, 6:37pm

@dmehta: do you know if the new 1.5 “instant re-queue” feature also makes the start of a job instant (previously there was a delay of a second or more without MSMQ)?

dmehta · September 14, 2015, 7:00pm

When using SQL server without MSMQ, the queue polling interval (QueuePollInterval) determines when the job will be processed. I think the default is 15 seconds, so the job may need to wait a max of 15 seconds before it will be picked up for processing.

With MSMQ, there is no polling so jobs will be picked up instantly.

mrattray · October 12, 2015, 3:31pm

Ok so I updated to 1.5, instead of hitting a 30 min invisibility timeout, the same jobs are now exhibiting the same behaviour after 1-3 minutes. This is a much bigger problem, most of my jobs run in under 30 minutes so was only seeing the first behaviour occasionally, now almost all my jobs take close to 5 minutes to run so having them switch workers after 1 minute means they are never completing and eating up all the workers so nothing else completes.
Anyone know what the hell is going on?

mrattray · October 12, 2015, 6:02pm

Turning off MSMQ appears to have resolved this for now.

NeerajGupta33 · March 27, 2023, 1:25pm

I am having a issue. where hangfire run muliple process for one server.

Topic		Replies	Views
Sql+MSMQ Long Running Jobs Invisibility question sql-server	2	828	February 1, 2018
Multiple workers on one job bug?	11	4987	January 30, 2020
Automatic retry when job is not completed yet? bug?	7	10842	August 4, 2015
Job keep processed, never succeed nor failed bug?	3	2567	July 27, 2015
Workers seems to hang and number of active workers slowly descrease to 0 question	2	3080	August 11, 2015

Multiple workers processing one job

Related topics