SPOOKY

             ____________
           --            --
         /                  \\
        /                    \\
       /     __               \\
      |     /  \       __      ||
      |    |    |     /  \     ||
            \__/      \__/
     |             ^            ||
     |                          ||
     |                          ||
    |                            ||
    |                            ||
    |                            ||
     \__         ______       __//
        \       //     \_____//
         \_____//

Description

Pseudospectral code to do HD/MHD simulations on a triply-periodic box on (one) NVidia GPU with CUDA. Largely inspired by the Snoopy code (https://ipag.osug.fr/~lesurg/snoopy). Work in progress.

Prerequisites

The current implementation of SPOOKY requires:

a CUDA compiler (tested with cuda-11.8 and cuda-12.0)
CUDA toolkit
cmake (minimum 3.24)
Python 3.+ with numpy, matplotlib, argparse (necessary for some tests)
libconfig and HDF5 libraries (can be installed automatically if not present)

Installation

git clone git@github.com:LorenzoLMP/spooky-git.git
cd spooky-git

Compiling with cmake (instructions to compile and run on newton cluster to follow)

Create build directory if not already present for out-of-source build (recommended)

$ mkdir build
$ cd build

A typical build command looks like this:

$ cmake -DBUILD_TESTS=ON -DCMAKE_CUDA_COMPILER=/path/to/cuda/bin/nvcc -DHDF5_ROOT=/path/to/hdf5/ -DLIBCONFIG_ROOT=/path/to/libconfig/ -DCMAKE_CUDA_ARCHITECTURES="XX" ..

The cuda architectures have to be chosen based on the hardware that is available. 75 for NVIDIA Quadro RTX 8000, 80 for A100.
Depending on the version of your default g++ compiler, it might be incompatible with the .... If so, add the option -DCMAKE_CUDA_FLAGS="-ccbin /path/to/g++" with the path to a compatible version of g++
If you don't want to build the tests, simply do -DBUILD_TESTS=OFF or omit.
If you don't have libconfig or hdf5 installed, omit the option -DLIBCONFIG_ROOT or -DHDF5_ROOT and CMake will attempt to automatically donwload and build the appropriate version of the libraries.

If the configuration step was successful, now simply compile as:

$ make clean && make -j 8

The SPOOKY executable can be run as

$ ./src/spooky --input-dir /path/to/input/dir

Running tests

If you want to run the tests (-DBUILD_TESTS=ON) do instead: (NOTE: to verify the sts scalings you need to comment out certain parts in the code, replacing the RK3 with Forward Euler)

$ ctest -V -R "spooky" -E "sts"

which will run all the spooky tests and show the output.

On local laptop (last update: 2025-01-05)

cmake -DBUILD_TESTS=ON ..

On Newton (last update: 2024-11-17)

For interactive jobs:

srun -p a100 --gres=gpu:1 --job-name "GpuInteractiveJob" --time=04:00:00 --pty bash

$ source load_modules

cd build
rm -rf *

$ cmake -DBUILD_TESTS=ON -DCMAKE_CUDA_COMPILER=/usr/local/cuda-12.5/bin/nvcc -DCMAKE_CUDA_FLAGS="" -DCMAKE_CXX_FLAGS="-O3 -std=c++2a" -DHDF5_ROOT=/home/lperrone/myhdf5/hdf5/ -DLIBCONFIG_ROOT=/home/lperrone/mylibconfig/libconfig/ -DCMAKE_CUDA_ARCHITECTURES="80" ..

$ make clean && make -j 8

or just for one executable:

$ make clean && make spooky -j 8

For the generic test problem:

./problems/generic/spooky --input-dir ../problems/generic/ --output-dir /lustre/lperrone/spooky/tests/tmp --stats 100

You can also submit a job using slurm as follows:

#!/bin/bash
#! Which partition (queue) should be used
#SBATCH -p a100
#SBATCH -J SPOOKY_job
#SBATCH -o job.%j.out
#SBATCH -e job.%j.err

### compute nodes
#SBATCH --nodes=1
###  MPI ranks
#SBATCH --ntasks=1
###  MPI ranks per node
#SBATCH --ntasks-per-node=1
###  tasks per MPI rank(eg OMP tasks)
#SBATCH --cpus-per-task=1
###  gpu per node
#SBATCH --gres=gpu:1

#!How much wallclock time will be required (HH:MM:SS)
#SBATCH --time=04:00:00
#SBATCH --mail-type=END,FAIL
#SBATCH --mail-user=lperrone@aip.de
##SBATCH --begin=now+16hours

source /home/lperrone/spooky-git/load_modules

./spooky-mti3d --input-dir ./ --output-dir /lustre/lperrone/spooky/tests/Pm4_beta5e5T_Re_6400_Rm_12800_Pe_1600_N2_2e-1_new --stats 1000

Steps for profiling

$ nsys start --stop-on-exit=false
$ nsys launch --trace=cuda,nvtx spooky
$ nsys stop

$ sudo -E /opt/nvidia/hpc_sdk/Linux_x86_64/23.1/profilers/Nsight_Systems/bin/nsys-ui &

File -> Open .nsys-rep

$ sudo /opt/nvidia/hpc_sdk/Linux_x86_64/23.1/profilers/Nsight_Compute/ncu --target-processes all spooky

for a single kernel

$ sudo /opt/nvidia/hpc_sdk/Linux_x86_64/23.1/profilers/Nsight_Compute/ncu --export "/home/lorenzolmp/Documents/NVIDIA Nsight Compute/report%i" --force-overwrite --target-processes all --kernel-name axpyDouble --launch-count 1 spooky

Name		Name	Last commit message	Last commit date
Latest commit History 162 Commits
CMakeFiles		CMakeFiles
libs		libs
problems		problems
src		src
tests		tests
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
Makefile		Makefile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SPOOKY

Description

Prerequisites

Installation

Compiling with cmake (instructions to compile and run on newton cluster to follow)

Running tests

On local laptop (last update: 2025-01-05)

On Newton (last update: 2024-11-17)

Steps for profiling

About

Uh oh!

Releases

Packages

Languages

LorenzoLMP/spooky-git

Folders and files

Latest commit

History

Repository files navigation

SPOOKY

Description

Prerequisites

Installation

Compiling with cmake (instructions to compile and run on newton cluster to follow)

Running tests

On local laptop (last update: 2025-01-05)

On Newton (last update: 2024-11-17)

Steps for profiling

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages