Today we had a job retry - even though the job was still running.
The second attempt was only 3 minutes after the initial start - basically resulting in the same job overlapping.
On the dashboard and in the database, this looked like any other “job was interrupted, so start it again” - with only one “Enqueued” entry, several “processing” entries and only one “succeeded” entry.
However, this was not due to an application pool/server crash, and in fact, the serverid of the job was identical, only the workerid was different indicating the hangfire server was not interrupted.
All hangfire settings are default - except each IIS Site instance has its own queue and the AutomaticRetryAttribute is set to 0.
Should also add - we are just using the enqueue call to instantly execute our jobs - we aren’t doing anything fancy like scheduling jobs to execute later or execute after another job at this point.
Anyone else experienced this and/or fixed it?
As far as we’re aware this hasn’t happened before, and in fact the only reason we were aware of it this time is due to an admittedly poor legacy design choice which caused incorrect data output due to the overlapping processes.