Have PkgXferMgr set *all* jobs destined to a given Pull DP to retry after the configured interval on failure
Currently, if there are 100 jobs in DistributionJobs destined for a given Pull DP, and PkgXferMgr fails to connect to the DP for the first one, only that job is marked with a retry time 20 minutes (by default) in the future. We then move on to the next jobs, all destined for the same DP, and mark them individually for retry. In a very busy environment with lots of Pull DPs and jobs, having just 5 or 6 DPs that are offline, this looping prevents PkgXferMgr from scheduling most of the jobs that are destined to DPs that are online.
The ask is that the update to DistributionJobs set the retry time for all jobs destined to the unreachable DP (via DPID) to 20 minutes (or the current setting) into the future. This would align with Sender not trying to connect to a remote site that is offline for an hour and would allow jobs to online DPs to be actioned in a much more efficient fashion.

1 comment
-
Anonymous commented
This is critical for larger environments to allow for further infrastructure consolidation and elimination of primaries.