ampere

Star

Here are 29 public repositories matching this topic...

RobTillaart / INA226

Sponsor

Star

Arduino library for INA226 power sensor

arduino sensor power voltage ampere

Updated Jan 10, 2026
C++

dougeeai / llama-cpp-python-wheels

Star

Pre-built wheels for llama-cpp-python across platforms and CUDA versions

Updated Apr 18, 2026

Sandermage / genesis-vllm-patches

Star

Runtime patches for vLLM — Qwen3.6 (27B int4 / 35B-A3B FP8) on consumer Ampere. 50+ patches: TurboQuant KV, MTP / DFlash / ngram spec-decode, FULL cudagraph, 256K-320K context. v7.64: P67 non-pow-2 GQA, Cliff 1 fix, 6 docs (FAQ/HARDWARE/CONFIGS/CLIFFS), Genesis Compat Layer.

Updated May 1, 2026
Python

AmpereComputingAI / llama.cpp

Star

Ampere optimized llama.cpp

meta ai llama arm64 ampere llm llamacpp

Updated Jan 30, 2026
Python

egaoharu-kensei / flash-attention-triton

Star

Cross-platform FlashAttention-2 Triton implementation for Turing+ GPUs with custom configuration mode

Updated Jan 12, 2026
Python

AmpereComputingAI / ampere_model_library

Star

AML's goal is to make benchmarking of various AI architectures on Ampere CPUs a pleasurable experience :)

machine-learning natural-language-processing computer-vision model-zoo tensorflow inference pytorch artificial-intelligence arm64 aarch64 ampere armv8-a onnxruntime mlperf-inference dlrm large-language-models yolov8 llama2

Updated Feb 26, 2026
Python

husjon / valheim_server_oci_setup

Sponsor

Star

Setup instructions for running Valheim on Oracle Cloud Infrastructure using Arm also available at https://codeberg.org/husjon/valheim_server_oci_setup

arm ampere valheim

Updated Feb 23, 2026
Shell

BmdOnline / EnergyMonitor

Star

Arduino energy monitor, using SCT-013-030 current sensors

Updated Jun 3, 2019
OpenSCAD

thc1006 / qwen3.6-speculative-decoding-rtx3090

Star

First public benchmark of llama.cpp speculative decoding on Qwen3.6-35B-A3B with a single RTX 3090 (post PR #19493 merge, 2026-04-19). 19 configurations covering ngram-cache, ngram-mod, and classic draft with vocab-matched Qwen3.5-0.8B. Finding: no variant achieves net speedup on Ampere + A3B MoE. Raw JSON, plots, full reproducibility.

benchmark cuda moe ampere mixture-of-experts inference-benchmark llama-cpp ggml local-llm llm-inference qwen speculative-decoding qwen3 rtx-3090

Updated Apr 26, 2026
Python

fosshostorg / aarch64.com

Star

We are a fosshost project which is delivering ARM-based hardware into multiple, global data centers. We document and keep a diary of our project daily.

ecosystem arm arm64 ampere fosshost

Updated May 19, 2022
TypeScript

groxaxo / GPTQ-Pro

Star

GPTQ optimized for Ampere (RTX 3090/3060) — Marlin JIT nvcc compatibility fixes included

nvidia marlin ampere 4bit pro gptq gemma4

Updated Apr 30, 2026
Python

badr42 / oke_A1

Star

Terraform to provision an OCI OKE cluster on Ampere A1 Processors, and then deploy nginx on it

nginx oci ampere iaac

Updated Dec 10, 2022
HCL

ADLINK / meta-adlink-ampere

Star

Single Yocto layer for all Ampere Altra Arm 64-bit based Computer on Modules (COM-HPC). This layer has the support for the following products AADP, AADK, AADR, AVA

linux cloud embedded server yocto ampere adlink ampere-altra com-hpc soafee

Updated Dec 17, 2025
BitBake

pantaleone-ai / private-ai-stack

Sponsor

Star

Deploy a complete, self-hosted AI stack for private LLMs, agentic workflows, and content generation. One-command Docker Compose deployment on any cloud.