Skip to content

NVIDIA/dl-cuda-graph-doc

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CUDA Graph Best Practice For PyTorch

A comprehensive guide to using CUDA Graphs effectively with PyTorch, covering CUDA fundamentals, PyTorch integration, Megatron-LM implementations, and practical troubleshooting.

📚 Documentation

View the documentation at: https://docs.nvidia.com/dl-cuda-graph/

📖 What's Inside

  • CUDA Graph Basics: Fundamentals, constraints, and quantitative benefits
  • PyTorch CUDA Graphs: Integration, Transformer Engine & Megatron-LM, best practices, and handling dynamic patterns
  • Examples: Real-world implementations from MLPerf Training (Llama 3.1 405B, GPT-3 175B, Stable Diffusion v2, RNN-T)
  • Troubleshooting: Capture failures, numerical errors, memory issues, and performance debugging

🚀 Quick Start

# Clone and serve locally (auto-installs dependencies)
git clone https://github.com/NVIDIA/dl-cuda-graph-doc.git
cd dl-cuda-graph-doc
./scripts/sphinx-serve.sh

Visit http://127.0.0.1:8000 to view the documentation.

Manual build:

uv sync --group docs
uv run --group docs sphinx-build docs docs/_build/html

📝 Contributing

We welcome contributions! Please see CONTRIBUTING.md for guidelines.

📜 License

This project uses dual licensing:

  • Documentation (Markdown files): CC-BY-4.0 - Creative Commons Attribution 4.0 International
  • Code (Python, shell scripts, configuration): MIT

See THIRD-PARTY-NOTICES.md for third-party dependencies and their licenses.

🔗 Resources

About

A comprehensive guide to using CUDA Graphs effectively with PyTorch, covering CUDA fundamentals, PyTorch integration, Megatron-LM implementations, and practical troubleshooting.

Topics

Resources

License

Unknown, Unknown licenses found

Licenses found

Unknown
LICENSE
Unknown
LICENSE-DOCS

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages