Shader Generator ND_mix_bsdf optimization by ld-kerley · Pull Request #2499 · AcademySoftwareFoundation/MaterialX

ld-kerley · 2025-07-22T21:38:44Z

Taking inspiration from #2459, which is a significant win in terms of HW shader performance. Instead of optimizing the node graph directly, here we apply the same idea, but programmatically at shader generation time.

If we meet the following condition.

"A mix node that has BSDF type inputs, and if those two nodes have weight inputs themselves."

Then we can replace the mix node with an add and modify the weight inputs, by multiplying by the mix input, and the inverse of the mix input.

Old nodegraph

---
config:
      theme: redux
---
flowchart TD
    weightA{{"weight_A"}}
    weightB{{"weight_B"}}
    mix{{"mix"}}
    BsdfA["BSDF_A"]
    BsdfB["BSDF_B"]
    Mix["Mix"]
    weightA --> BsdfA
    weightB --> BsdfB
    BsdfA --> Mix
    BsdfB --> Mix
    mix --> Mix

New nodegraph

flowchart TD
    weightA{{"weight_A"}}
    weightB{{"weight_B"}}
    mix{{"mix"}}
    Invert["Invert"]
    MultA["Multiply"]
    MultB["Multiply"]
    BsdfA["BSDF_A"]
    BsdfB["BSDF_B"]
    Add["Add"]
    mix --> Invert
    weightA --> MultA
    mix --> MultA
    weightB --> MultB
    Invert --> MultB

    MultA --> BsdfA
    MultB --> BsdfB

    BsdfA --> Add
    BsdfB --> Add

This allows the BSDF nodes to early out if their contribution is not necessary in the mix.

Taking this approach has a couple of advantages:

We programmatically match all instances of this possible optimization, not just the few that are hand crafted.
We can leave the uber shader nodegraphs, in their original state. The use of the mix node better conveys the idea behind the nodegraph.
Its possible that some backends might be able to optimize the original graph more effectively than the current hand-crafted representation.

jstone-lucasfilm · 2025-07-23T01:59:55Z

I'm very excited about the future of this work, @ld-kerley, and my main recommendation would be that we consider it a new investment of research and engineering effort for MaterialX 1.39.5, so that we have the time to validate this approach for all MaterialX shading models, from both a performance and visual parity perspective.

Once we're confident that this gives us the same or better performance as #2483, #2459, and #2467, we can replace all of those manual optimizations with this automated approach, with the expectation that other shading models (even custom studio models) will benefit in the same way.

kwokcb · 2025-07-23T17:36:10Z

Hi @ld-kerley ,
What are your thoughts on integrations that don't use shader generation and hence won't ever see / use this optimization.
It feels like this can be a pre-processing step or even a stand-alone optimization utility.
It can still be integrated here to automatically perform these (or other optimizations) but is not directly tied to code generation.

fpsunflower · 2025-08-19T18:15:13Z

I'm not super familiar with the details of codegen here, but can you explain what is different between the two implementations?

What can the add/mult nodes do that the mix node cannot ?

ld-kerley · 2025-08-20T20:37:41Z

Related to #2480

ld-kerley · 2025-08-20T20:50:33Z

@fpsunflower Its not so much a feature of codegen, as much as function of the HW shader language backends here. The implementation of the BSDF nodes that accept a weight input are written to early out if the weight is zero. In the mix case the weight is often not zero, but unfolding the mix math and moving the weight to be combined with the weight input to the BSDF node does often end up with zero weights in the BSDF nodes. In a language like OSL I would expect the OSL optimizer to be able to fold away the unused branches if the mix value is set to 0 or 1.

@bernardkwok today anyone not using codegen wouldn't have access to the optimization in this form. We could pre-process nodegraphs during build with this logic if we wanted. Personally I'm not a fan of optimizing the nodegraphs, when we've only validated it a performance improvement in subset of the backend languages.

I actually wonder if we should put this optimization behind a code gen option switch to make it easier for downstream consumers to profile either way and make their own decision if they want it.

If we grow the number of these sorts of optimizations, which I hope we do, there are a number of other really valuable ones we used at Imageworks, then I can imagine a world where perhaps downstream consumers could run "code gen" to generate an optimized version of the graph - but not go all the way to shader code. I think this might actually be an important part of any future overhaul of the code gen system, to centralize the graph based construction and optimization, and allow an "exit-point" here. But this is all me just shooting from the hip.

fpsunflower · 2025-08-21T16:26:24Z

Ah I see now -- the issue is the mix happens after the BSDFs are already evaluated, so there's no chance to short-circuit them. In the new layout they can be. This makes sense.

ld-kerley · 2025-08-29T18:25:24Z

Just for some more context - Slack conversations with Kai Rohmer here, suggest that at least in MDL this optimization might not be ideal. So we should refactor things here to be constrained to the HW shading languages for now.

ld-kerley · 2025-09-22T18:19:10Z

I've added a GenOption based option to toggle this behavior on and off. It's also enabled for the GLSL/MSL code generators used in MaterialXView and MaterialXGraphEditor.

Before this is merged the specialized versions of the graph should be removed from (libraries/bxdf/genglsl) - but I'm leaving them here for now to allow easier A/B testing between the modified nodegraph approach and the programmatic approach in this PR.

jstone-lucasfilm · 2025-11-20T02:31:49Z

@ld-kerley Following up on our last MaterialX TSC meeting, I'd love to bring this PR back into focus again, so that we can make it available as a platform for #2680 and further generator optimizations in the future.

When you have a moment, would you mind resolving the handful of merge conflicts in MaterialXGenShader?

I can then take the next step and compare the performance of these latest generator optimizations against the manually optimized shading model graphs, and we can work through any minor differences that may remain.

…n mixed with 2 nodes. Add possible refactor of gltf_pbr that allows it to match the existing optimization rule Add checkbox to MaterialXView to control bsdf mix optimization add GenOptions option to control the optimization remove trailing underscores from node names - because these still make variables with double underscores. Fix asan crash. remove debug print remove double underscores from node names - appears to be illegal. Refactor OpenPBR nodegraph optimization in to a ShaderGenerator optimization step for ND_mix_bsdf. Replace the mix with an add and modify the upstream weight inputs accordingly to perform the same math as a mix. This allows the BSDF nodes to early out if their contribution is not necessary in the mix.

ld-kerley · 2025-11-20T18:47:54Z

I rebased and pushed - and re-ran the test suite locally - but didn't have time to validate the optimizations again - I think they should still be valid, but it would be really interesting to find a way to add a unit test to verify they are applied correctly.

source/MaterialXGenShader/ShaderGraph.cpp

Disconnect outputs when removing a node

jstone-lucasfilm

This is looking good, @ld-kerley! In my local tests, I was not able to detect any performance differences between the manual and automatic optimizations, which is a great sign, and I had just two minor recommendations to match existing behavior and conventions.

source/MaterialXGraphEditor/RenderView.cpp

jstone-lucasfilm · 2025-12-03T00:24:33Z

source/MaterialXGenShader/GenOptions.h


+    /// Analyse the graph of ShaderNodes and replace any ND_mix_bsdf nodes
+    /// with a linear combination of their weighted inputs
+    bool optReplaceBsdfMixWithLinearCombination;


Since many of our existing generator options also influence performance, I'd recommend omitting the opt prefix here, and just naming the flag replaceBsdfMixWithLinearCombination.

I think we're going to be adding a number of additional "optimization" steps - so I think it would be helpful to catagorize them as such. Especially as it hints as where the code lives in the shader generator as well.

Are there other existing options that are really steps that optimize the generated material in some way? or are they more options where performance level can be tuned to user need?

One that fits this category is elideConstantNodes. An optimization to remove any nodes classified as constants, which is also done in ShaderGraph::optimize().

we should make a note to prefix this with opt as well at some point when we can break the interface then.

I'm not sure an opt prefix for all optimization-related flags is a good naming convention, since so many of our shader generation flags influence performance, but also have subtle impacts on visual fidelity and other properties of rendering.

For now, I'd recommend following the pattern in @niklasharrysson's example of elideConstantNodes, where the name of the flag compactly describes what it does, without aiming to group all of our flags into optimization, visual fidelity, or other categories.

I interpret opt as the later definition here. ShaderGraph changes that may or may not affect performance, but are explicitly intended not to affect the final image. Something that very well may need to be tuned, depending on render destination.

I think that elideConstants is the only existing option that would need to be reclassified. Because there is only one, perhaps we can just add an optElideContants option that takes precedence if it's enabled. This would allow for a graceful deprecation of the old name, and allow us to move forward with the proposed classification definition Niklas outlines above.

This sounds good to me 👍

@ld-kerley @ppenenko @niklasharrysson I like this direction forward, and the idea of a prefix that purely represents graph refactoring steps seems like a natural evolution of language-specific prefixes such as hw and osl. Graph refactoring steps can be enabled in any generated shading language, though naturally their value may be greater in some environments and languages than others.

In order to create an intuitive connection between the new option prefix and its expected behavior, what would you think about the new prefix string being graph? To my mind, this seems like a self-documenting convention that users would quickly understand, and it allows no ambiguity over whether any future added option belongs in this category or not.

@jstone-lucasfilm TBH graph sounds ambiguous to me, mostly because there are concepts of Node Graph and Shader Graph. Could it be shaderGraph or sg?

Sure, I would be open to any variation on this idea, as long as we're communicating the intended meaning of a "shader graph refactoring step".

I'll leave the specific choice between graph, shaderGraph, and sg to the group, though in the world of VFX and real-time rendering, I usually think of sg as referring to "scene graph" rather than shader graph.

ppenenko · 2026-01-22T21:47:12Z

source/MaterialXGenShader/ShaderGraph.cpp

        else if (node->hasClassification(ShaderNode::Classification::DOT))
        {
            // Filename dot nodes must be elided so they do not create extra samplers.
            ShaderInput* in = node->getInput("in");
            if (in && in->getType() == Type::FILENAME)
            {
                bypass(node, 0);
                ++numEdits;
            }
        }


By the way, this elision seems essential not just for optimization, but also for correctness: filenames or samplers can't be local variables in GLSL and HLSL. So there can't be a Dot node whose result is cached in a local.

ppenenko · 2026-01-26T14:47:37Z

Should a test run be added with the optimization enabled?

ld-kerley requested a review from niklasharrysson July 22, 2025 22:27

ld-kerley force-pushed the openpbr-graph-optimization-as-shader-generator-step branch from c2ae4e4 to 206c44d Compare September 22, 2025 18:10

ld-kerley force-pushed the openpbr-graph-optimization-as-shader-generator-step branch from 206c44d to 7724d4f Compare September 22, 2025 20:08

ppenenko mentioned this pull request Nov 14, 2025

Lobe Pruning optimization in MaterialX proper #2680

Open

ld-kerley force-pushed the openpbr-graph-optimization-as-shader-generator-step branch from e6c2f0b to e459f73 Compare November 20, 2025 18:47

ppenenko added 2 commits November 29, 2025 12:37

Disconnect outputs when removing a node

b434dcd

Revert other attempted fixes

cf1e038

ppenenko reviewed Nov 29, 2025

View reviewed changes

source/MaterialXGenShader/ShaderGraph.cpp Show resolved Hide resolved

Merge pull request #3 from ppenenko/ppenenko/fix_remove_node

b3f9742

Disconnect outputs when removing a node

jstone-lucasfilm reviewed Dec 3, 2025

View reviewed changes

ppenenko mentioned this pull request Jan 8, 2026

Add Perfetto tracing and a test output directory option #2742

Merged

ppenenko reviewed Jan 22, 2026

View reviewed changes

Conversation

ld-kerley commented Jul 22, 2025

Old nodegraph

New nodegraph

Uh oh!

jstone-lucasfilm commented Jul 23, 2025

Uh oh!

kwokcb commented Jul 23, 2025

Uh oh!

fpsunflower commented Aug 19, 2025

Uh oh!

ld-kerley commented Aug 20, 2025

Uh oh!

ld-kerley commented Aug 20, 2025

Uh oh!

fpsunflower commented Aug 21, 2025

Uh oh!

ld-kerley commented Aug 29, 2025

Uh oh!

ld-kerley commented Sep 22, 2025

Uh oh!

jstone-lucasfilm commented Nov 20, 2025

Uh oh!

ld-kerley commented Nov 20, 2025

Uh oh!

Uh oh!

jstone-lucasfilm left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ppenenko commented Jan 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants