Change how modulus is computed

https://github.com/CERTCC/labyrinth/blob/207dbce66127b762c390ad13f101459ad95cad80/labyrinth/repo_processor.py#L201

This line uses the repo id and a modulus to decide how to split repos across parallel runs of the script. The problem is that sometimes individual runs can fail repeatedly, meaning that the same block of repos never gets worked on.

We can't just randomize it, because then we will have more than one process handling a repo.

So I'm thinking we need to add in some other factor that is constant for an individual run, but changes between runs.
Could be hour of the day, or maybe there's some run ID that can be converted to an int? The former can come from within the Python code directly, whereas the latter might require modification to the workflow scripts, unless there is some environment variable already there for the python code to use.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Change how modulus is computed #6

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Change how modulus is computed #6

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions