You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Enterprise-grade orchestration for high-density LLM inference on Ampere computing. Real-time monitoring of aggregate throughput, session peak TPS, and special…
Gradio-based launcher for Ampere-optimized AI demos: LLM Chat with RAG (Ollama), YOLOv11 Object Detection, and Whisper Speech-to-Text. Runs in Docker containers…
AmpereComputingAI/ampere-ai-agents’s past year of commit activity
Build and automate Agentic AI workflows. This demo leverages n8n, Ollama, and SearXNG to help you create intelligent agents that can browse the web and process …
Turn your natural language questions into executable SQL. This demo leverages LlamaIndex and Open WebUI to help you analyze your database schema and retrieve in…
Qualcomm Cloud AI SDK (Platform and Apps) enable high performance deep learning inference on Qualcomm Cloud AI platforms delivering high throughput and low late…