[WebGPU] Plug memory leaks and free resources on shutdown #19315

nikhilJain17 · 2026-02-04T04:19:59Z

This diff frees wgpu::Buffers and in buffer pools on shutdown to prevent memory leaks on GPU. It also fixes memory leaks on the heap, where we allocate backend, backend_ctx, buffer_ctx, and decisions on the heap but never delete them. These are either explicitly deleted (wrt ggml lifecycle) or changed to be smart pointers.

We implement destructors for our buffer pool structs, webgpu_context struct and webgpu_global_context struct. Since webgpu_global_context is a refcounted smart pointer, it will destruct automatically when all thread contexts have been destroyed.

We call free on all the buffers we allocate, and we explicitly free our buffer pools and debug/error/staging buffers.
Also, since we explicitly wait on all our callbacks, we do not have to worry about waiting for callbacks while shutting down.

Memory leak on the heap before.

No memory leak on the heap after.

…ext, and webgpu_buf_pool

nikhilJain17 · 2026-02-04T04:21:20Z

ggml/src/ggml-webgpu/ggml-webgpu-shader-lib.hpp

    ggml_webgpu_processed_shader result;
    result.wgsl                                         = preprocessor.preprocess(shader_src, defines);
    result.variant                                      = variant;
-    ggml_webgpu_flash_attn_shader_decisions * decisions = new ggml_webgpu_flash_attn_shader_decisions();


I changed this into a shared_ptr because this was leaking on the heap since we never deallocated it.

nikhilJain17 · 2026-02-04T04:23:23Z

ggml/src/ggml-webgpu/ggml-webgpu.cpp

+        this->get_tensor_staging_buf.Destroy();
+#ifdef GGML_WEBGPU_DEBUG
+        debug_host_buf.Destroy();
+        debug_dev_buf.Destroy();


I believe other wgpu members, like Instance, Device, Queue, and Pipeline, are refcounted and delete automatically when all references are deleted. But Buffers need to be explicitly destroyed.

Also since webgpu_global_context is a shared_ptr, its destructor is automatically called once all references to it are deleted.

nikhilJain17 · 2026-02-04T04:24:43Z

ggml/src/ggml-webgpu/ggml-webgpu.cpp

 #endif
+
+    delete ctx;
+    delete backend;


These are both allocated on the heap and leak if we don't delete them. Maybe we could turn them into shared_ptr but I don't know how it would behave once the pointer is passed around in the ggml lifecycle.

nikhilJain17 added 6 commits January 30, 2026 12:15

Merge

df60497

Merge

5ae7583

Fix memory leaks in shader lib, backend, backend_context, buffer_cont…

4a619f5

…ext, and webgpu_buf_pool

Free pools

f2eb8b9

Cleanup

2395b8a

More cleanup

98cbfd2

nikhilJain17 commented Feb 4, 2026

View reviewed changes

github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Feb 4, 2026

Run clang-format

3b6a596

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WebGPU] Plug memory leaks and free resources on shutdown #19315

[WebGPU] Plug memory leaks and free resources on shutdown #19315

nikhilJain17 commented Feb 4, 2026

Uh oh!

nikhilJain17 Feb 4, 2026

Uh oh!

nikhilJain17 Feb 4, 2026

Uh oh!

nikhilJain17 Feb 4, 2026

Uh oh!

nikhilJain17 Feb 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[WebGPU] Plug memory leaks and free resources on shutdown #19315

Are you sure you want to change the base?

[WebGPU] Plug memory leaks and free resources on shutdown #19315

Conversation

nikhilJain17 commented Feb 4, 2026

Uh oh!

nikhilJain17 Feb 4, 2026

Choose a reason for hiding this comment

Uh oh!

nikhilJain17 Feb 4, 2026

Choose a reason for hiding this comment

Uh oh!

nikhilJain17 Feb 4, 2026

Choose a reason for hiding this comment

Uh oh!

nikhilJain17 Feb 4, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant