Skip to content
This repository was archived by the owner on May 13, 2022. It is now read-only.
This repository was archived by the owner on May 13, 2022. It is now read-only.

Implement same-thread execution model for stages #14

@j-stephan

Description

@j-stephan

While having each stage run in its own thread was all fun and games when working with CUDA, recent examinations have shown that this approach possibly downgrades performance when doing work on the host (with OpenMP for example). This is especially true for stages which have different workloads.

We should therefore be able to run multiple stages in the same thread of execution.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions