Skip to content

Fix the algorithm is not correct when the number of elements is not evenly divisible by 10 or 2#1

Open
nguyenvuduc wants to merge 1 commit intocarol-coded:masterfrom
nguyenvuduc:master
Open

Fix the algorithm is not correct when the number of elements is not evenly divisible by 10 or 2#1
nguyenvuduc wants to merge 1 commit intocarol-coded:masterfrom
nguyenvuduc:master

Conversation

@nguyenvuduc
Copy link

Hi Carol-He,

I think it is a brilliant idea with a minor flaw.
I found the CUDA algorithm will miss the remainder part when the number of elements is not evenly divisible by the factor (10 or 2) that was hardcoded in the CUDA kernel. I have fixed it, and I would like to contribute to your original work.

Thank you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant