qwen3next : fix chunking #19321

ggerganov · 2026-02-04T08:09:16Z

rel #19305

Based on my analysis in #19305 (comment) I am just restoring the old chunking logic. This seems to resolve the reported issue.

Note that I am blindly restoring the old code, and still haven't understood in details what this logic actually does. So extra look, logprobs tests and validation would be needed before we merge this. Draft for now.

ggerganov · 2026-02-04T11:26:19Z

Superseded by #19324

ggerganov · 2026-02-04T11:28:28Z

src/models/qwen3next.cpp

            ggml_row_size(core_attn_out->type, S_v),
            ggml_row_size(core_attn_out->type, S_v * chunk_size * n_chunks),
            ggml_row_size(core_attn_out->type, S_v * chunk_size * n_chunks * H_v), 0);
-    output_tokens = ggml_cont(ctx0, output_tokens);


@ngxson Btw, this cont seems redundant still.

Anyway, not very important. I think there is a lot to improve in this graph - will take a look in the next days.

qwen3next : fix chunking

1213a03

ggerganov mentioned this pull request Feb 4, 2026

Eval bug: Qwen3-Coder-Next Poor Outputs #19305

Closed

github-actions bot added the model Model specific label Feb 4, 2026

ggerganov closed this Feb 4, 2026

ggerganov commented Feb 4, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

qwen3next : fix chunking #19321

qwen3next : fix chunking #19321

ggerganov commented Feb 4, 2026

Uh oh!

ggerganov commented Feb 4, 2026

Uh oh!

ggerganov Feb 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

qwen3next : fix chunking #19321

qwen3next : fix chunking #19321

Conversation

ggerganov commented Feb 4, 2026

Uh oh!

ggerganov commented Feb 4, 2026

Uh oh!

ggerganov Feb 4, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant