improve latency test by drossetti · Pull Request #112 · NVIDIA/gdrcopy

drossetti · 2020-03-03T00:44:08Z

print estimated bw, useful for large buffer sizes
add -d param
add warmup extra iterations and -w param

That helps comparing performance for large buffer sizes

drossetti · 2020-07-31T23:43:48Z

@pakmarkthub mind having a look?

pakmarkthub · 2020-08-01T14:54:35Z

tests/copylat.cpp

                break;
            case 'h':
-                printf("syntax: %s -s <buf size> -d <gpu dev id> -w <write iters> -r <read iters> -h[help] -c[do-cuMemcpy]\n", argv[0]);
+                printf("syntax: %s [-s <buf size>][-d <gpu dev id>][-w <write iters>][-r <read iters>][-h][-c][-w]\n"


The last option should be [-W <# iterations>]. You forgot to capitalize the letter.

pakmarkthub · 2020-08-01T14:55:14Z

tests/copylat.cpp

-                printf("syntax: %s -s <buf size> -d <gpu dev id> -w <write iters> -r <read iters> -h[help] -c[do-cuMemcpy]\n", argv[0]);
+                printf("syntax: %s [-s <buf size>][-d <gpu dev id>][-w <write iters>][-r <read iters>][-h][-c][-w]\n"
+                       "-c                   benchmark cuMemcpy\n"
+                       "-w <# iterations>    modify warmup (default %d)\n",


Capitalize the latter W.

pakmarkthub · 2020-08-01T15:12:57Z

tests/copylat.cpp

 // manually tuned...
 int num_write_iters = 10000;
 int num_read_iters = 100;
+int small_size_iter_factor = 1000;


I understand the intention and usefulness for small sizes. However, it changes what the number of iterations users specify. Is there a better way to do this or could you provide an explanation message? Currently, the users need to read the code in order to know that small sizes and large sizes use different number of iterations.

pakmarkthub · 2020-08-01T15:22:37Z

tests/copylat.cpp

    bool do_cumemcpy = false;
    struct timespec beg, end;
    double lat_us;
+    double bw;


Isn’t this redundant with copybw?

If you want to do shmoo for bw, is it better to rename the test? “copylat” doesn’t sound right anymore in that case.

print estimated bw in copylat

4f66be0

That helps comparing performance for large buffer sizes

drossetti requested a review from spotluri March 3, 2020 00:44

drossetti self-assigned this Mar 3, 2020

add param -d gpuid to sanity

0647993

drossetti force-pushed the fixlat branch from 53bce75 to a6115ca Compare March 3, 2020 00:47

add extra warmup iterations to latency test

a6115ca

pakmarkthub requested changes Aug 1, 2020

View reviewed changes

drossetti added this to the next milestone Aug 4, 2020

cxz66666 mentioned this pull request May 9, 2024

feat: support assign gpu id for sanity test #297

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improve latency test#112

improve latency test#112
drossetti wants to merge 3 commits intomasterfrom
fixlat

drossetti commented Mar 3, 2020

Uh oh!

drossetti commented Jul 31, 2020

Uh oh!

pakmarkthub Aug 1, 2020

Uh oh!

pakmarkthub Aug 1, 2020

Uh oh!

pakmarkthub Aug 1, 2020

Uh oh!

pakmarkthub Aug 1, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

drossetti commented Mar 3, 2020

Uh oh!

drossetti commented Jul 31, 2020

Uh oh!

pakmarkthub Aug 1, 2020

Choose a reason for hiding this comment

Uh oh!

pakmarkthub Aug 1, 2020

Choose a reason for hiding this comment

Uh oh!

pakmarkthub Aug 1, 2020

Choose a reason for hiding this comment

Uh oh!

pakmarkthub Aug 1, 2020

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants