Skip to content

RDNA3 not being utilised to its full potential #5

@muziqaz

Description

@muziqaz

HI,
I'm nearly done testing all of my AMD GPUs comparing them between OpenCL and HIP environments, and today it was 7900xtx turn. Here are the results and comparison vs 6900xt:

6900xt MBA (23.04TFLOPS)
  OpenCL (ns/day) HIP (ns/day) Diff.%
gbsa 967.65 1644.17 69.91%
rf 831.189 1410.187 69.66%
pme 398.526 1046.064 162.48%
apoa1rf 342.568 505.241 47.49%
apoa1pme 183.49 381.036 107.66%
apoa1ljpme 127.05 300.048 136.17%
amoebagk 2.4 37.444 1460.17%
amoebapme 12.021 16.261 35.27%
7900xtx Nitro+ (61+TFLOPS)
  OpenCL (ns/day) HIP (ns/day) Diff.% HIP (6900xt/7900xtx)
gbsa 1075.82 1812.23 68.45% 10.22%
rf 912.438 1503.63 64.79% 6.63%
pme 415.988 1103.77 165.34% 5.52%
apoa1rf 437.261 645.718 47.67% 27.8%
apoa1pme 231.098 521 125.45% 36.73%
apoa1ljpme 164.816 400.924 143.26% 33.62%
amoebagk 4.22695 42.958 916.29% 14.73%
amoebapme 17.0797 23.0998 35.25% 42.06%

Not much of the improvement going from 6900xt. I'll try to get AMD's attention to this.
Will post the rest of the GPU test results in other hip/openmm area monday most likely.
conda env built was standard. Have no knowledge on how to play around with fft backends, but I think that wouldn't change the outcome too much compared to vkfft

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions