Skip to content

Reduce firmware read timeout from 60s to 10s to prevent port breakout failure#12

Merged
PJHsieh merged 1 commit intoedge-core:202311.Xfrom
PJHsieh:Reduce_firmware_read_timeout_from_60s_to_10s_to_prevent_port_breakout_failure
Apr 18, 2025
Merged

Reduce firmware read timeout from 60s to 10s to prevent port breakout failure#12
PJHsieh merged 1 commit intoedge-core:202311.Xfrom
PJHsieh:Reduce_firmware_read_timeout_from_60s_to_10s_to_prevent_port_breakout_failure

Conversation

@PJHsieh
Copy link

@PJHsieh PJHsieh commented Apr 18, 2025

Issue:
Reading the firmware version on specific CMIS transceivers in xcvrd can take up to 60 seconds, causing delays that lead to port breakout failures.

Solution:
Reduce the maximum firmware read timeout from 60 seconds to 10 seconds to improve responsiveness and avoid port breakout issues.

Note:

  1. The firmware read timeout mentioned above is also called CDB which is defined in CMIS.
  2. The issue happens under the condition that the xcvr's CDB takes long time to response, and several dynamic port breakout mode commands are issued with a short internal.
  3. The CDB timeout 60 seconds will block the xcvrd thread and result in the situation that the work for the previous port breakout mode is not done yet but the new port breakout mode is applied.
  4. It takes too much effort and is not worthy to handle such condition. So simply reduce the timeout to a more reasonable value to avoid the error.

Description

Motivation and Context

How Has This Been Tested?

Additional Information (Optional)

…ent port breakout failure

Issue:
Reading the firmware version on specific CMIS transceivers in xcvrd can take up to 60 seconds,
causing delays that lead to port breakout failures.

Solution:
Reduce the maximum firmware read timeout from 60 seconds to 10 seconds to improve responsiveness
and avoid port breakout issues.

Note:
1. The firmware read timeout mentioned above is also called CDB which is
   defined in CMIS.
2. The issue happens under the condition that the xcvr's CDB takes long
   time to response, and several dynamic port breakout mode commands are
   issued with a short internal.
3. The CDB timeout 60 seconds will block the xcvrd thread and result in
   the situation that the work for the previous port breakout mode is not
   done yet but the new port breakout mode is applied.
4. It takes too much effort and is not worthy to handle such condition.
   So simply reduce the timeout to a more reasonable value to avoid
   the error.
@PJHsieh PJHsieh merged commit a07e618 into edge-core:202311.X Apr 18, 2025
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant