Skip to content

[ERROR] client reporting Error CQE, when running gpunetio_verbs_put_bw example on two node. #2

@foraxe

Description

@foraxe

[ERROR] client reporting Error CQE, when running gpunetio_verbs_put_bw example on two node.
Hi, everyone.

[Description]
Run commands:
Server:

cd /workspace/gpunetio/examples/gpunetio_verbs_put_bw
LD_LIBRARY_PATH=${LD_LIBRARY_PATH}:/workspace/gpunetio/lib DOCA_GPUNETIO_LOG=6 ./gpunetio_verbs_put_bw -g 08:00.0 -d mlx5_bond_0

Client:

cd /workspace/gpunetio/examples/gpunetio_verbs_put_bw
LD_LIBRARY_PATH=${LD_LIBRARY_PATH}:/workspace/gpunetio/lib DOCA_GPUNETIO_LOG=6 ./gpunetio_verbs_put_bw -g 08:00.0 -d mlx5_bond_0 -c x.x.x.x

When run on single node in tmux, the example returns Sample finished successfully and give the BW results.

But when run on two node, the client return Error CQE. And i also tried -p 1 mode, same errors.

ps: I am trying this example for solving the issue in NIXL gpunetio backend ai-dynamo/nixl#952

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions