Parallel Mesh Collection #22297

aevyrie · 2025-12-29T07:20:09Z

Objective

Speed up collect_meshes_for_gpu_building, a bottleneck for scenes with many moving meshes.

Solution

Parallelize the gather step for mesh collection.
Immediately start up a task for serial collection of meshes, which cannot be parallelized.
Spawn many tasks for gathering meshes, and send batches of these to the collection task
This allows the serial collection step to start immediately, instead of being delayed until after all collection is finished.

Testing

Built a new bevymark_3d stress test for benchmarking dynamic 3d mesh scenes. This is not currently covered by our stress tests. Bevymark 3D #22298
With 200k meshes, this drops total frame times from 16.4ms to 12.3ms (-4.1ms)

Mesh collection itself drops from 7.9ms to 3.6ms (-4.3ms)

alice-i-cecile · 2025-12-29T19:34:25Z

CI failures are real, but should be easy to fix.

…ve docs

# Objective - Add a stress test that exercises the 3d mesh pipeline for dynamic scenes. - Large static scenes like caldera hotel don't expose performance issues when many meshes are moving. - Give us a way to benchmark PRs like - #22297 - #22281 - #22226 ## Solution - Make a 3d version of `bevymark`, sticking to the existing patterns as closely as possible. ## Testing <img width="1072" height="684" alt="image" src="https://github.com/user-attachments/assets/41214ba9-ffad-471d-a320-1f007490dead" /> --------- Co-authored-by: Carter Anderson <mcanders1@gmail.com>

aevyrie · 2025-12-30T03:37:47Z

@alice-i-cecile g2g now

aevyrie · 2025-12-30T03:39:14Z

Added to the milestone as it seems about equivalent to my others perf PRs that were also added.

aevyrie · 2025-12-30T03:56:15Z

~~This PR needs more thorough testing before I'd feel comfortable merging. Parallelizing isn't always a speedup and can increase the total amount of CPU work needed even if throughput increases.~~

So far, things are still looking promising after my latest round of commits

cargo rer bevymark_3d --features=debug,trace_tracy -- --benchmark --waves 250 --per-wave 1000

comparing this branch to main

frametime

collect_meshes_for_gpu_building

pcwalton

The logic looks fine, but I think with some different factoring this would be easier to maintain and check. I'm not 100% sure the refactoring is viable, but I'd like to see if we can try.

crates/bevy_pbr/src/render/mesh.rs

pcwalton

OK, I love this. This will help so much with making other parts of the system parallel, and addons and apps should be able to use this for increased parallelism too. In fact, it's essentially a big upgrade for the ECS, allowing easy parallelism in situations where par_iter() on a query isn't enough.

Thanks a bunch for taking the time to refactor it!

aevyrie · 2026-01-02T22:40:57Z

Revisiting benches after visibility optimizations merged, the improvements are still reproducible, and overall frametimes are improved thanks to the optimizations on main.

Aceeri · 2026-01-16T13:11:50Z

Profiled the changes here vs not and got some decent results on my project. The main important peaks are those 2 at the end which this PR reduces from ~22ms to ~10ms.

The faster portions are a result of flying out to lower LODs so not super representative, but maybe shows a bit of the overhead for smaller amounts of meshes (left 2 peaks are 18 meshes), increasing it from 25µs to ~35-50µs.

Aceeri

I like this, the buffered channel seems useful for my own parallel code and the recycling is something I have been doing as well but making it generic is very nice.

Solid performance improvement on my project as well: 22ms -> 10ms in the expected case.

aevyrie · 2026-01-17T01:53:58Z

Thanks for testing in your project - that gives me way more confidence this is a positive change. I'm pretty happy with how the buffered channel turned out!

github-actions · 2026-01-17T01:58:21Z

You added a new example but didn't add metadata for it. Please update the root Cargo.toml file.

aevyrie added 3 commits December 28, 2025 20:11

Channel-based mesh collection and prep.

6aafecb

Tweak chunk size

1ba8491

Revert toml formatter changes

3eba77c

aevyrie mentioned this pull request Dec 29, 2025

Bevymark 3D #22298

Merged

Fix bug in mesh material indexing

a2cfded

IceSentry added A-Rendering Drawing game state to the screen C-Performance A change motivated by improving speed, memory usage or compile times S-Needs-Review Needs reviewer attention (from anyone!) to move forward labels Dec 29, 2025

github-project-automation bot added this to Rendering Dec 29, 2025

alice-i-cecile added S-Waiting-on-Author The author needs to make changes or address concerns before this can be merged and removed S-Needs-Review Needs reviewer attention (from anyone!) to move forward labels Dec 29, 2025

aevyrie added 3 commits December 29, 2025 13:15

fix clippy lints

cd40ac6

more lints

d360eb9

reduce diff

8e4780f

tychedelia self-requested a review December 29, 2025 21:27

aevyrie added 2 commits December 29, 2025 16:17

better handle mesh re-extraction, fix single threaded deadlock, impro…

860d3cf

…ve docs

Fix doc links

8917e4d

Merge branch 'main' into par-mesh-collection

04de8fc

aevyrie added this to the 0.18 milestone Dec 30, 2025

aevyrie added 2 commits December 29, 2025 20:05

Tune chunk size

6f785cb

Refactor order and comments for readability

eca0c30

alice-i-cecile removed this from the 0.18 milestone Jan 1, 2026

pcwalton reviewed Jan 1, 2026

View reviewed changes

crates/bevy_pbr/src/render/mesh.rs Show resolved Hide resolved

crates/bevy_pbr/src/render/mesh.rs Outdated Show resolved Hide resolved

crates/bevy_pbr/src/render/mesh.rs Outdated Show resolved Hide resolved

crates/bevy_pbr/src/render/mesh.rs Show resolved Hide resolved

aevyrie added 2 commits January 1, 2026 18:16

Refactor buffered channel logic into a struct.

d1cd1e5

Async buffered channel

7735924

aevyrie commented Jan 2, 2026

View reviewed changes

crates/bevy_pbr/src/render/mesh.rs Show resolved Hide resolved

aevyrie commented Jan 2, 2026

View reviewed changes

crates/bevy_pbr/src/render/mesh.rs Show resolved Hide resolved

Fix vecs not being reused when consumed by intoiterator

69f235e

pcwalton approved these changes Jan 2, 2026

View reviewed changes

Merge branch 'main' into par-mesh-collection

23c8f08

aevyrie mentioned this pull request Jan 2, 2026

Parallel GPU buffer writes #22314

Merged

alice-i-cecile added S-Needs-Review Needs reviewer attention (from anyone!) to move forward and removed S-Waiting-on-Author The author needs to make changes or address concerns before this can be merged labels Jan 3, 2026

alice-i-cecile requested review from IceSentry, NthTensor, atlv24 and hymm January 3, 2026 03:04

Merge branch 'main' into par-mesh-collection

a3aa5a7

alice-i-cecile added this to the 0.19 milestone Jan 14, 2026

alice-i-cecile added A-Tasks Tools for parallel and async work X-Uncontroversial This work is generally agreed upon D-Modest A "normal" level of difficulty; suitable for simple features or challenging fixes labels Jan 14, 2026

Aceeri approved these changes Jan 16, 2026

View reviewed changes

Merge branch 'main' into par-mesh-collection

ae85d1f

aevyrie force-pushed the par-mesh-collection branch from b3cb07b to ae85d1f Compare January 17, 2026 02:00

aevyrie added 2 commits January 16, 2026 18:25

Fix bevy_tasks dep missing

9e4f42f

Merge branch 'main' into par-mesh-collection

f732eca

alice-i-cecile added S-Ready-For-Final-Review This PR has been approved by the community. It's ready for a maintainer to consider merging it and removed S-Needs-Review Needs reviewer attention (from anyone!) to move forward labels Jan 17, 2026

alice-i-cecile enabled auto-merge January 17, 2026 02:48

alice-i-cecile disabled auto-merge January 17, 2026 03:18

Uh oh!

Parallel Mesh Collection #22297

Are you sure you want to change the base?

Parallel Mesh Collection #22297

Uh oh!

Conversation

aevyrie commented Dec 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Objective

Solution

Testing

Uh oh!

alice-i-cecile commented Dec 29, 2025

Uh oh!

aevyrie commented Dec 30, 2025

Uh oh!

aevyrie commented Dec 30, 2025

Uh oh!

aevyrie commented Dec 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

frametime

collect_meshes_for_gpu_building

Uh oh!

pcwalton left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pcwalton left a comment

Choose a reason for hiding this comment

Uh oh!

aevyrie commented Jan 2, 2026

Uh oh!

Aceeri commented Jan 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Aceeri left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

aevyrie commented Jan 17, 2026

Uh oh!

github-actions bot commented Jan 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

aevyrie commented Dec 29, 2025 •

edited

Loading

aevyrie commented Dec 30, 2025 •

edited

Loading

Aceeri commented Jan 16, 2026 •

edited

Loading

Aceeri left a comment •

edited

Loading