Skip to content

Conversation

@BingooYang
Copy link

@BingooYang BingooYang commented Jan 19, 2026

📌 Description

Support the AllReduce fusion kernel based on Paddle, which primarily involves the following modifications:

  1. Fix the inconsistency issue of the all_gather_object interface behavior between Paddle and Torch frameworks
  2. Fix the inconsistency issue between Paddle and Torch frameworks regarding the behavior of not explicitly setting rank in the tensor(device=torch.device("cuda")) interface
  3. Fix the issue of inconsistent ProcessGroup paths
  4. Add testing for the all reduce fusion kernel based on Paddle distributed

🔍 Related Issues

🚀 Pull Request Checklist

Thank you for contributing to FlashInfer! Before we review your pull request, please make sure the following items are complete.

✅ Pre-commit Checks

  • I have installed pre-commit by running pip install pre-commit (or used your preferred method).
  • I have installed the hooks with pre-commit install.
  • I have run the hooks manually with pre-commit run --all-files and fixed any reported issues.

If you are unsure about how to set up pre-commit, see the pre-commit documentation.

🧪 Tests

  • Tests have been added or updated as needed.
  • All tests are passing (unittest, etc.).

Reviewer Notes

@SigureMo SigureMo merged commit 062990f into PFCCLab:paddle Jan 19, 2026
1 check failed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants