Skip to content

Conversation

@gdies
Copy link

@gdies gdies commented Jan 2, 2026

declare index as Int() when used in < operations later.

ehsanmok and others added 30 commits September 22, 2025 11:16
Update platform name and note on puzzle 9 and 10 nvidia-only
modular#86)

dtype can be passed form python to mojo as a parameter in the same way as in attention.mojo and TPB can be removed because its use in determining the block dimensions for both layernorm_kernel and add_bias_kernel unecessarily restricts the puzzle to using values of HIDDEN_DIM and OUTPUT_DIM <= TPB.
* revise p09 formatting for consistency

* renane kernal parameters for consistency

* revise p09 formatting for consistency

* explain buffer_cache_size cmd
* revise p09 formatting for consistency

* renane kernal parameters for consistency

* revise p09 formatting for consistency

* explain buffer_cache_size cmd

* fix p11 typo

* define dot product in p12

* revise p14 cmd to align with puzzle

* revise p14 cmd argument to complete

* Note accurate dot color in p16 visualization

* Revise p16 text to prevent overlap

* define softmax in p18

* Revise phrasing in p18

* Add filename to code completion steps in p18

* Add filename to code completion step in p17

* Fix p19 typo
* revise p14 cmd to align with puzzle

* revise p14 cmd argument to complete

* Update arguments for p23 BenchConfig

* Revise p24 SIMD comment for clarity

* Tweak p26 formatting

* Correct file path of p26 solution code snippets

* Correct file path of p29 solution code snippets
…lization which is currently dominating the results. (modular#102)

Updated kernels to write to different memory locations to avoid race condition and allow testing (previously functional benchmarking was only running a single warp)
* revise p14 cmd to align with puzzle

* revise p14 cmd argument to complete

* Revise p27 code organization

* Define Stencil operation

* Define SAXPY
aahrun and others added 28 commits November 6, 2025 09:35
Added a SECURITY.md file with content based on the relevant
Modular web page.
…odular#127)

* Begin migration of enqueue_function to enqueue_function_checked.

* Fix typo.

* Update to MutAnyOrigin, ImmutAnyOrigin.

* Formatting fix.
* Fix DeviceContext enqueue_fill() no longer returning self

This is a change due to
modular/modular@ce7e4d6#diff-e53a900e59316a16c5793137123d7dc10021feb176cdcc3987153ca8be53f7b8

Updated all locations of enqueue_fill()

Before:
```
    var output_buffer = ctx.enqueue_create_buffer[DType.int](
        buffer_size
    ).enqueue_fill(9)
 ```
After:
```
    var output_buffer =
ctx.enqueue_create_buffer[DType.int](buffer_size)
    output_buffer.enqueue_fill(9)
```

* Fix Typo
* Migrate p13 and p14.

* Migrate p15 and p16 to checked functions.
…tion_checked` (modular#140)

* Migrate p18 and p22 to enqueue_function_checked.

* Migrate p25 and p26 to enqueue_function_checked.

* Migrating p09, p10, and p19 to enqueue_function_checked.

* Migrate p31, p32 to enqueue_function_checked.

* Migrate p33, p34, and the one remaining case in p09 to enqueue_function_checked.

* Migrate p21 to compile_function_checked.
Co-authored-by: raju <raju.ptvs@gmail.com>
…A A2000 to test script (modular#154)

* add nvidia A2000 (Ada Generation) to compute capability 8.9 group in GPU test script

* fix: modular#152

* ran `pixi run format`

---------

Co-authored-by: David Meaux <dmeaux@geomatys.com>
Co-authored-by: David Meaux <dmeaux@geomatys.com>
…bers to match code (modular#158)

Co-authored-by: David Meaux <dmeaux@geomatys.com>
* Embed YouTube videos for puzzles 1-3

* Increase video player margin

* Add breakpoint support
Added special note for WSL users regarding CUDA debugging tools.
* Fix command syntax in third_case.md

Corrected the command syntax for running the third case.

* Fix command syntax in third_case.md

Corrected command syntax for running the third case.
Updated shared memory representation and access patterns for Block 1, including zero padding details and condition evaluations.
ehsanmok

This comment was marked as duplicate.

Copy link
Collaborator

@ehsanmok ehsanmok left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just noticed, please rebase. There's a lot of upstream commits here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.