-
Notifications
You must be signed in to change notification settings - Fork 1.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
IMPORTANT: Metrics issue (Abnormal status of task) - master node automatically restarting, each worker running tasks generate abnormal
tasks
#1539
Comments
@tikazyq please take a look on that. We have created similar issue few months ago, but it has been unfortunately forgotten. |
Hi @KrystianJanas , thanks for your feedback. Thanks for using Crawlab Pro and I really appreciate your invaluable feedback. I noticed the issue as well but unfortunately there is no quick solution to solve the performance issue potentially caused by the metrics module, as the engine behind is prometheus. If you can, please record the resource consumption metrics (memory, cpu, disk io) for main processes such as In the meantime, we are near a new major release (0.7.0) which is under the final stage of testing before the formal announcement. It is supposed to have addressed the issue you mentioned, given that we have got rid of most 3rd-party middleware dependencies such as Prometheus and SeaweedFS, which are replaced with native Golang code. If you are interested in the EA, please let me know and I'll push to the latest "test" version for your trial. |
Thanks @tikazyq for your reply. |
@tikazyq any answer? we will be really glad of fastest answer us, we're using this tool each day with issues. |
@KrystianJanas please refer below the instructions.
docker pull crawlabteam/crawlab-pro:test
...
image: crawlabteam/crawlab-pro:test
...
docker compose down
docker compose up -d |
@tikazyq I tried to configure it, I successfully configured master-node, but:
Can you please check your configuration? We're missing some environments you didn't updated in |
@tikazyq in logs Master Node we don't have also update after init new worker-node / existing worker node. |
@tikazyq I solved problem related with |
@tikazyq Okay, I see you modified the ENV from But the problem related with left sidebar is still actually.
|
Describe the bug
For a long time now we have noticed a problem resulting from refreshing metrics that are collected by the main master-node from the worker-node. We are currently operating on Dockerfile, on the AWS cloud. We have 1 master-node and 8 worker-nodes.
The problem is that the master-node often restarts without any problem. After longer analyses, it turned out that the problem is "metrics", which cannot be turned off in any way, because you have not implemented such a method. It would be very useful in the application.
Sometimes it is possible to "bug" them, restarting the entire infrastructure or adding one more worker-node. But this is not a permanent solution, because by bugging the metrics, the problem is solved for 1-2 days.
The problem is that because of metrics, the worker-node often loses connection with the master-node when the task is started, which is why we get the task status "abnormal", and we have to manually check whether the task has already been completed or is still running. At the moment this is very burdensome for us, as each worker has at least 4-5 tasks running.
We're running master-node and each worker-node on the
crawlab-pro:latest
image.Master-node configuration:
Worker-node configuration:
Expected behavior
Add possibility to disable/enable metrics flag, or fix this issue.
Screenshots
The text was updated successfully, but these errors were encountered: