Skip to content
Change the repository type filter

All

    Repositories list

    • Enterprise-grade orchestration for high-density LLM inference on Ampere computing. Real-time monitoring of aggregate throughput, session peak TPS, and special…
      TypeScript
      MIT License
      0000Updated Mar 16, 2026Mar 16, 2026
    • Python
      0000Updated Mar 11, 2026Mar 11, 2026
    • Python
      MIT License
      1202Updated Mar 4, 2026Mar 4, 2026
    • AML's goal is to make benchmarking of various AI architectures on Ampere CPUs a pleasurable experience :)
      Python
      Apache License 2.0
      723810Updated Feb 26, 2026Feb 26, 2026
    • Gradio-based launcher for Ampere-optimized AI demos: LLM Chat with RAG (Ollama), YOLOv11 Object Detection, and Whisper Speech-to-Text. Runs in Docker containers…
      Python
      MIT License
      3420Updated Feb 10, 2026Feb 10, 2026
    • llama.cpp

      Public
      Ampere optimized llama.cpp
      Python
      53652Updated Jan 30, 2026Jan 30, 2026
    • 0000Updated Jan 30, 2026Jan 30, 2026
    • AI models trained by Google to classify species in images from motion-triggered wildlife cameras.
      Python
      Apache License 2.0
      55000Updated Jan 30, 2026Jan 30, 2026
    • ampere-ai-agents

      Public
      Build and automate Agentic AI workflows. This demo leverages n8n, Ollama, and SearXNG to help you create intelligent agents that can browse the web and process …
      Dockerfile
      MIT License
      0000Updated Dec 17, 2025Dec 17, 2025
    • Turn your natural language questions into executable SQL. This demo leverages LlamaIndex and Open WebUI to help you analyze your database schema and retrieve in…
      Shell
      MIT License
      0000Updated Dec 17, 2025Dec 17, 2025
    • Generate robust Python solutions using Qwen3-Coder and Ampere Optimized llama.cpp on Ampere Altra & AmpereOne CPUs. Includes a Dockerized UI.
      Shell
      MIT License
      0000Updated Dec 12, 2025Dec 12, 2025
    • Python
      0220Updated Jul 28, 2025Jul 28, 2025
    • 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
      Python
      Apache License 2.0
      6.9k000Updated Jun 16, 2025Jun 16, 2025
    • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
      Python
      Apache License 2.0
      33k000Updated Jun 16, 2025Jun 16, 2025
    • Shell
      Apache License 2.0
      1000Updated Apr 21, 2025Apr 21, 2025
    • State-of-the-Art Text Embeddings
      Python
      Apache License 2.0
      2.8k000Updated Dec 13, 2024Dec 13, 2024
    • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
      Python
      Apache License 2.0
      33k000Updated Dec 5, 2024Dec 5, 2024
    • Llama3-8B scale out scripts
      Python
      0000Updated Sep 7, 2024Sep 7, 2024
    • Shell
      1110Updated Aug 28, 2024Aug 28, 2024
    • Scripts to reproduce AI results on AmpereOne platform.
      Jupyter Notebook
      0100Updated Aug 19, 2024Aug 19, 2024
    • Fork of tensorflow serving for ARM64 build
      C++
      Apache License 2.0
      2300Updated Jul 31, 2024Jul 31, 2024
    • local-rag

      Public
      Python
      1000Updated May 22, 2024May 22, 2024
    • Integrating Ampere's high performance LLM inference with popular application building frameworks in the industry
      Python
      Apache License 2.0
      1030Updated May 22, 2024May 22, 2024
    • whisper

      Public
      Robust Speech Recognition via Large-Scale Weak Supervision
      Python
      MIT License
      12k001Updated Apr 25, 2024Apr 25, 2024
    • AutoGPTQ

      Public
      An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
      Python
      MIT License
      539110Updated Mar 13, 2024Mar 13, 2024
    • Qualcomm Cloud AI SDK (Platform and Apps) enable high performance deep learning inference on Qualcomm Cloud AI platforms delivering high throughput and low late…
      Jupyter Notebook
      Other
      17000Updated Mar 11, 2024Mar 11, 2024
    • Python
      3520Updated Mar 5, 2024Mar 5, 2024
    • LlamaIndex is a data framework for your LLM applications
      Python
      MIT License
      7.2k000Updated Mar 4, 2024Mar 4, 2024
    • 🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
      Python
      Apache License 2.0
      33k000Updated Dec 16, 2023Dec 16, 2023
    • Stable Diffusion web UI
      Python
      GNU Affero General Public License v3.0
      30k000Updated Nov 24, 2023Nov 24, 2023
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.