Node.js servers being down fails the CI workflows #20

sheerlox · 2024-12-18T17:26:14Z

When using Nodelix in CI (e.g. through semantic_release), sometimes the Node.js servers are down and the whole CI fails.

We should consider implementing an exponential backoff retry to mitigate that issue and eliminate the need to manually relaunch the workflow (sometimes multiple times).

The text was updated successfully, but these errors were encountered:

Lucassifoni · 2025-01-02T16:38:34Z

WIP here : alzo-archi@9ab9452

The idea would be to make the backoff optional (the default max_attempts \ 1 param makes it optional) to allow people to control this behavior from the outside (personnally I would launch the install in an Oban job as part of a workflow DAG).

Would that be direction that would suit you ?

I'll have to separate the actual downloading from this, to unit test the backoff separately in my next commits.

sheerlox · 2025-01-09T22:35:38Z

Looks great, like the direction this is taking!

Regarding the default for max_attempts, I'd personally put a sane number as default (3 or 5), and leave specific use-cases (like running in a DAG) to override it to 1.
I think setting the default to 1 would make sense if the async installation was the main use case, which I don't think it is for now.

What do you think?

Lucassifoni · 2025-01-10T06:29:17Z

I think you're right, in that case I'd write a tighter loop that sleeps for max 50ms at a time and checks if a time boundary has been exceeded to decide whether to sleep again or trigger a retry, because I think I remember that sleeping for more than a few tens of milliseconds is suboptimal for the BEAM's preemptive scheduling.
I'll update both code and this ticket after looking that up.

Lucassifoni · 2025-01-10T06:35:06Z

Looks I'm wrong on that one, after searching, sleeping is generally discouraged because it's often the wrong tool vs message passing but it's fine here.
I'll switch max attempts to 3 and add tests.

Lucassifoni · 2025-01-10T14:39:16Z

Now working on that in #23 .

sheerlox · 2025-01-10T15:08:09Z

🎉 This issue has been resolved in version 1.0.0-alpha.15 🎉

The release is available on:

GitHub release
v1.0.0-alpha.15

Your semantic-release bot 📦🚀

sheerlox added the bug Something isn't working label Dec 18, 2024

sheerlox mentioned this issue Jan 10, 2025

feat: add exponential backoff to archive download #23

Merged

sheerlox closed this as completed in #23 Jan 10, 2025

sheerlox added the released on @alpha label Jan 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Node.js servers being down fails the CI workflows #20

Node.js servers being down fails the CI workflows #20

sheerlox commented Dec 18, 2024

Lucassifoni commented Jan 2, 2025 •

edited

Loading

sheerlox commented Jan 9, 2025

Lucassifoni commented Jan 10, 2025

Lucassifoni commented Jan 10, 2025

Lucassifoni commented Jan 10, 2025

sheerlox commented Jan 10, 2025

Node.js servers being down fails the CI workflows #20

Node.js servers being down fails the CI workflows #20

Comments

sheerlox commented Dec 18, 2024

Lucassifoni commented Jan 2, 2025 • edited Loading

sheerlox commented Jan 9, 2025

Lucassifoni commented Jan 10, 2025

Lucassifoni commented Jan 10, 2025

Lucassifoni commented Jan 10, 2025

sheerlox commented Jan 10, 2025

Lucassifoni commented Jan 2, 2025 •

edited

Loading