Suppress most cli errors, but still warn if we get too many #416

plars · 2024-11-20T20:35:25Z

Description

When running a job while polling with the cli, there may be an occasional network delay that can cause you to get errors like this:

ERROR: 2024-11-20 19:04:11 client.py:64 -- Timeout while trying to communicate with the server.
WARNING: 2024-11-20 19:04:11 __init__.py:874 -- Unable to retrieve job state.
unknown

These can usually be ignored since it will try again, but can be often be interpreted by the user to think that something is wrong when it isn't. However, there's also a possibility that the server is unreachable for a long time, and we don't want to hide that from the user if it's happening.

I think this takes a pretty balanced approach and silences most of these warnings and errors (except when it's going to be fatal), while running a counter for consecutive timeout/connection errors. It will warn the user that something could be wrong at every interval of $TESTFLINGER_ERROR_THRESHOLD (default 3) consecutive errors, but also indicate that it will keep retrying.

Resolved issues

CERTTF-283

Documentation

Added a reference section to the documentation about the testflinger config command, and the supported configuration settings.

Web service API changes

N/A

Tests

Additional unit tests added

boukeas

The changes are very useful as it is indeed the case that users may not know how to interpret these messages.

I do believe that a requirement for 10 consecutive failures will effectively suppress all messages, even when there are actual networking issues, thus making these messages ineffective as a diagnostic tool. So my suggestions are to:

retrieve the number of messages required to display a warning from the config file, so that we are able to control it more easily
reduce the number to 5 or even 3

cli/testflinger_cli/client.py

tang-mm

Thanks for adding the reference! I just need a few clarifications on the command usage.
Also, in the previous how to authentication guide, we asked users to reload an .env file to update their environment variable. should we also change that doc to recommend the method using cli?

docs/reference/cli-config.rst

Co-authored-by: tang-mm <[email protected]>

plars requested a review from a team December 3, 2024 14:08

boukeas requested changes Dec 9, 2024

View reviewed changes

cli/testflinger_cli/client.py Outdated Show resolved Hide resolved

plars force-pushed the suppress-cli-server-errors branch from 6132a08 to 7cd7fe0 Compare December 9, 2024 17:10

plars requested review from boukeas and tang-mm December 9, 2024 17:15

tang-mm reviewed Dec 10, 2024

View reviewed changes

docs/reference/cli-config.rst Show resolved Hide resolved

docs/reference/cli-config.rst Show resolved Hide resolved

tang-mm reviewed Dec 10, 2024

View reviewed changes

docs/reference/cli-config.rst Outdated Show resolved Hide resolved

plars added 3 commits January 6, 2025 11:57

Suppress most cli errors, but still warn if we get too many

f488d9c

Add a config option for setting the error threshold

caeaf7f

Add some clarification about config resolution order

cf521ec

plars force-pushed the suppress-cli-server-errors branch from 3cabd62 to 13effc5 Compare January 6, 2025 17:57

plars and others added 2 commits January 6, 2025 12:33

Fix typo in docs/reference/cli-config.rst

a4fb1a0

Co-authored-by: tang-mm <[email protected]>

Workaround strange pylint issue on github

30f2ab3

plars force-pushed the suppress-cli-server-errors branch from 13effc5 to 30f2ab3 Compare January 6, 2025 18:33

plars requested a review from tang-mm January 6, 2025 18:35

boukeas approved these changes Jan 23, 2025

View reviewed changes

plars merged commit 4ba3eee into main Jan 23, 2025
5 checks passed

plars deleted the suppress-cli-server-errors branch January 23, 2025 15:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Suppress most cli errors, but still warn if we get too many #416

Suppress most cli errors, but still warn if we get too many #416

plars commented Nov 20, 2024 •

edited

Loading

boukeas left a comment

tang-mm left a comment

Suppress most cli errors, but still warn if we get too many #416

Suppress most cli errors, but still warn if we get too many #416

Conversation

plars commented Nov 20, 2024 • edited Loading

Description

Resolved issues

Documentation

Web service API changes

Tests

boukeas left a comment

Choose a reason for hiding this comment

tang-mm left a comment

Choose a reason for hiding this comment

plars commented Nov 20, 2024 •

edited

Loading