Skip to content

hendrych-upenn/Project0-Getting-Started

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

43 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Project 0 Getting Started

University of Pennsylvania, CIS 5650: GPU Programming and Architecture, Project 0

  • Oliver Hendrych
  • Tested on: Windows 11, i7-14700K @ 3.4GHz 32GB, RTX 4070 SUPER 12GB (Personal)

Assignment

Include screenshots, analysis, etc. (Remember, this is public, so don't put anything here that you don't want to share with the world.)

2.1.1.1

Compute Capability: 8.9 (GeForce RTX 4070)

2.1

2.1.2

Modified CUDA Project Window:

Screenshot with Oliver Hendrych in title

2.1.3

I set to the conition to index == 1621. With a width of 800, that would mean the pixel is at x=21, y=2. With block sizes of 16x16, then the block index should be x=1, y=0, and the thread index should be x=5, y=2, which is what we observe.

Screenshot with index == 1621

2.1.4

In the analysis summary, we can see details about the configuration, and we can see that most of the CPU utilization comes from the cuda-gl-check.exe, followed by Idle

Screenshot showing configuration and cpu utilization

We can see the cuda-gl-check.exe process, and if we look at the kernel calls, we can see the createVersionVisualization kernel invokes in the event view.

Screenshot with timeline and events view

2.1.5

Note: there is a known bug relating to Nsight Compute when trying to use the kernel replay. For that reason, rather than trying to downgrade Nsight Compute (which would also involve downgrading my CUDA Toolkit version), I used the WAR which is to use Application for the replay mode. I'm not sure if it makes a difference. https://forums.developer.nvidia.com/t/error-failed-to-profile-createversionvisualization-in-process-12840/343411/16

In the summary view, we can see the createVersionVisualization kernel calls.

Screenshot of summary view

In the detailed view, we can see details about the throughput and launch statistics.

Screenshot of detailed view

2.2

In Chrome, we can see that both WebGL versions (1 and 2) are available.

Screenshot of enabled WebGL1

Screenshot of enabled WebGL2

2.3

In Chrome, we can see that WebGPU is enabled with an nvidia vendor.

Screenshot of enabled WebGPU

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C++ 48.6%
  • Cuda 33.9%
  • CMake 13.9%
  • C 2.5%
  • Makefile 1.1%