Skip to content

FYI: Here are some benchmarks on my laptop #2

@magland

Description

@magland

This is not an issue, just a report for your info. Below is the result of running the run_all_tests.sh script 4 times on my laptop -- Dell Precision 7530.

(This excludes MATLAB)

(weirdly if I run fortran test on its own, rather than in the context of the script, I generally get higher rate - ranging between 1.7 and 2 Gpair/sec)

vscode@dc13b6d0654a:/workspaces/floatingspeed/lap3dkernel$ ./run_all_tests.sh 
C++SIMD:
5 repetitions each run...
N=10000, M=10000. manual VCL SIMD avx512, ans: 470638087.690212
                                time: 0.189 s           2.64 Gpair/sec
N=10000, M=10000. manual VCL SIMD avx2 (256), ans: 470638087.690212
                                time: 0.183 s           2.74 Gpair/sec
N=10000, M=10000. target outer loop, ans: 470638087.690212
                                time: 0.424 s           1.18 Gpair/sec
N=10000, M=10000. source outer loop, ans: 470638087.690212
                                time: 0.641 s           0.78 Gpair/sec
FORTRAN:
./run_all_tests.sh: line 10: gfortran: command not found
 ns=       10000    nt=       10000
 ntest =           20  ...
 tot=   7549471.0676897569     
   100000000 src-targ pairs in   1.26053715     s:   1.58662510     Gpair/s
JULIA:
targ-vec: 100000000 src-targ pairs in 1.96 s: 0.0511 Gpair/s
devec: 100000000 src-targ pairs in 0.28 s: 0.357 Gpair/s
devec par: 100000000 src-targ pairs in 0.0532 s: 1.88 Gpair/s
PYTHON:
native: 100000000 src-targ pairs in 2.04 s: 0.049 Gpair/s
numba:  100000000 src-targ pairs in 0.047 s: 2.13 Gpair/s
pot err numba vs native:  6.82e-13
MATLAB:
./run_all_tests.sh: line 22: matlab: command not found
OCTAVE:
octave: X11 DISPLAY environment variable not set
octave: disabling GUI features
test 1/r kernel in 3D. N=10000, M=10000...
targ-outer:     u(test)=-204.925905525712        t=1.58 s        0.0635 Gpair/s
src-outer:      u(test)=-204.925905525712        t=1.47 s        0.0681 Gpair/s
dense-matvec:   u(test)=-204.9259055257115       t=3.51 s        0.0285 Gpair/s




vscode@dc13b6d0654a:/workspaces/floatingspeed/lap3dkernel$ ./run_all_tests.sh 
C++SIMD:
5 repetitions each run...
N=10000, M=10000. manual VCL SIMD avx512, ans: 470638087.690212
                                time: 0.131 s           3.82 Gpair/sec
N=10000, M=10000. manual VCL SIMD avx2 (256), ans: 470638087.690212
                                time: 0.137 s           3.64 Gpair/sec
N=10000, M=10000. target outer loop, ans: 470638087.690212
                                time: 0.297 s           1.68 Gpair/sec
N=10000, M=10000. source outer loop, ans: 470638087.690212
                                time: 0.711 s           0.703 Gpair/sec
FORTRAN:
./run_all_tests.sh: line 10: gfortran: command not found
 ns=       10000    nt=       10000
 ntest =           20  ...
 tot=   7453247.5122789210     
   100000000 src-targ pairs in   1.23374486     s:   1.62108076     Gpair/s
JULIA:
targ-vec: 100000000 src-targ pairs in 1.97 s: 0.0507 Gpair/s
devec: 100000000 src-targ pairs in 0.258 s: 0.388 Gpair/s
devec par: 100000000 src-targ pairs in 0.0501 s: 2 Gpair/s
PYTHON:
native: 100000000 src-targ pairs in 1.97 s: 0.0508 Gpair/s
numba:  100000000 src-targ pairs in 0.0437 s: 2.29 Gpair/s
pot err numba vs native:  6.82e-13
MATLAB:
./run_all_tests.sh: line 22: matlab: command not found
OCTAVE:
octave: X11 DISPLAY environment variable not set
octave: disabling GUI features
test 1/r kernel in 3D. N=10000, M=10000...
targ-outer:     u(test)=-287.8825547996272       t=1.46 s        0.0683 Gpair/s
src-outer:      u(test)=-287.8825547996272       t=1.46 s        0.0687 Gpair/s
dense-matvec:   u(test)=-287.8825547996273       t=3.53 s        0.0283 Gpair/s




vscode@dc13b6d0654a:/workspaces/floatingspeed/lap3dkernel$ ./run_all_tests.sh 
C++SIMD:
5 repetitions each run...
N=10000, M=10000. manual VCL SIMD avx512, ans: 470638087.690212
                                time: 0.161 s           3.11 Gpair/sec
N=10000, M=10000. manual VCL SIMD avx2 (256), ans: 470638087.690212
                                time: 0.17 s            2.93 Gpair/sec
N=10000, M=10000. target outer loop, ans: 470638087.690212
                                time: 0.425 s           1.18 Gpair/sec
N=10000, M=10000. source outer loop, ans: 470638087.690212
                                time: 0.663 s           0.755 Gpair/sec
FORTRAN:
./run_all_tests.sh: line 10: gfortran: command not found
 ns=       10000    nt=       10000
 ntest =           20  ...
 tot=   7421530.2827616464     
   100000000 src-targ pairs in   1.27976596     s:   1.56278574     Gpair/s
JULIA:
targ-vec: 100000000 src-targ pairs in 1.93 s: 0.0519 Gpair/s
devec: 100000000 src-targ pairs in 0.264 s: 0.379 Gpair/s
devec par: 100000000 src-targ pairs in 0.0523 s: 1.91 Gpair/s
PYTHON:
native: 100000000 src-targ pairs in 1.96 s: 0.0511 Gpair/s
numba:  100000000 src-targ pairs in 0.0512 s: 1.95 Gpair/s
pot err numba vs native:  5.12e-13
MATLAB:
./run_all_tests.sh: line 22: matlab: command not found
OCTAVE:
octave: X11 DISPLAY environment variable not set
octave: disabling GUI features
test 1/r kernel in 3D. N=10000, M=10000...
targ-outer:     u(test)=-143.9898175208911       t=1.45 s        0.0688 Gpair/s
src-outer:      u(test)=-143.9898175208911       t=1.47 s        0.0679 Gpair/s
dense-matvec:   u(test)=-143.9898175208914       t=3.53 s        0.0283 Gpair/s



vscode@dc13b6d0654a:/workspaces/floatingspeed/lap3dkernel$ ./run_all_tests.sh 
C++SIMD:
5 repetitions each run...
N=10000, M=10000. manual VCL SIMD avx512, ans: 470638087.690212
                                time: 0.121 s           4.13 Gpair/sec
N=10000, M=10000. manual VCL SIMD avx2 (256), ans: 470638087.690212
                                time: 0.115 s           4.36 Gpair/sec
N=10000, M=10000. target outer loop, ans: 470638087.690212
                                time: 0.298 s           1.68 Gpair/sec
N=10000, M=10000. source outer loop, ans: 470638087.690212
                                time: 0.593 s           0.843 Gpair/sec
FORTRAN:
./run_all_tests.sh: line 10: gfortran: command not found
 ns=       10000    nt=       10000
 ntest =           20  ...
 tot=   7469534.6971662715     
   100000000 src-targ pairs in   1.24486721     s:   1.60659707     Gpair/s
JULIA:
targ-vec: 100000000 src-targ pairs in 1.96 s: 0.0511 Gpair/s
devec: 100000000 src-targ pairs in 0.259 s: 0.386 Gpair/s
devec par: 100000000 src-targ pairs in 0.0506 s: 1.98 Gpair/s
PYTHON:
native: 100000000 src-targ pairs in 1.98 s: 0.0504 Gpair/s
numba:  100000000 src-targ pairs in 0.0494 s: 2.02 Gpair/s
pot err numba vs native:  4.55e-13
MATLAB:
./run_all_tests.sh: line 22: matlab: command not found
OCTAVE:
octave: X11 DISPLAY environment variable not set
octave: disabling GUI features
test 1/r kernel in 3D. N=10000, M=10000...
targ-outer:     u(test)=120.4067525416816        t=1.43 s        0.0699 Gpair/s
src-outer:      u(test)=120.4067525416816        t=1.46 s        0.0687 Gpair/s
dense-matvec:   u(test)=120.4067525416806        t=3.55 s        0.0282 Gpair/s

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions