This directory contains several components of tracing files.
-
npkit_dump: Contains npkit trace dump files.
-
npkit_trace
- npkit_event_trace.json: A trace dump file generated by
npkit_trace_generator.py.
- npkit_event_trace.json: A trace dump file generated by
- npkit_event_trace.html: This file is generated by the trace viewer using
npkit_event_trace.json.
-
example_2.cu: Perform all reduce collective communications between 8 GPUs.
-
job_example_2.sh: This script is used to execute
example_2.cuon 4 nodes, with the following specifications:- 2 GPUs per node
- 2 tasks per node
- 1 GPU per task