Skip to content

Fix mixed attention KV cache sizing and add max sequence length results

466631b
Select commit
Loading
Failed to load commit list.
Open

Add Trinity model family (AfmoeForCausalLM) contrib #55

Fix mixed attention KV cache sizing and add max sequence length results
466631b
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs