Issues

Parallel-primitive based quad-tree construction on GPU and CPU using Thrust.

Refrence: https://adms-conf.org/2019-camera-ready/zhang_adms19.pdf

Note: Fix building for stanalone library for linking to pure C++ projects.

Issues

Using Nsight Compute and Nsight Systems tools following observations an be made:

Total kernel computation takes time in tens of ms dominating operation being sort which took ~4ms.
Execution is dominated by cudaMalloc, cudaFree and synchronization between streams.

In code there can be seen that there are many allocations/deallocations of temporary vectors which are required in the algorithm.

Summary of level on finest level

If this is the lowest level we can compute center of mass from points directly. Looking at x axis: x_com = SUM_i (m_i * x_i) / SUM_i (m_i) So we got two sums to compute. This has to be done for each quadrant. For instance quadrant k ranges from i_{begin_k} to i_{end_k}. If we have over all scan we can see that SUM(points in quadrant k) = SCAN[i_{end_k}] - SCAN[i_{begin_k}] One issue might be numerics - i mean points are normalized to [0, 1] range but scan for large number of points might get quite big and after that we divide - we lose precision IDK.

If mass is used, this should be passed to inclusive sum.

auto begin = thrust::make_transform_iterator(
    thrust::make_zip_iterator(
        thrust::make_tuple(
            x.begin(),
            thrust::make_constant_iterator<float>(1.0f)
        )
    ),
    [] __device__ __host__ (thrust::tuple<float, float> t) 
    {
        float x = thrust::get<0>(t);
        float m = thrust::get<1>(t);
        return x * m;
    }
);

Summary of level if we have prev

We have nlen of each child as well as COM of each child. Simply do weighted average of those.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
inc		inc
res		res
src		src
.gitignore		.gitignore
build.bat		build.bat
build.sh		build.sh
build_cpu.bat		build_cpu.bat
build_cpu.sh		build_cpu.sh
build_gpu.bat		build_gpu.bat
build_gpu.sh		build_gpu.sh
initial_benchmark.nsys-rep		initial_benchmark.nsys-rep
main.cpp		main.cpp
main.cu		main.cu
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Issues

Summary of level on finest level

Summary of level if we have prev

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Issues

Summary of level on finest level

Summary of level if we have prev

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages