tesla_agent // Local Agentic AI for Utilities

Important

Explore the Interactive Guide & Dashboard Live at:
👉 boxwrench.github.io/tesla_agent

Welcome to tesla_agent! This repository is a clean-room, plug-and-play template designed to teach water treatment operators and utility professionals how to build and run local, private agentic AI workflows on consumer AMD hardware.

This is the guide we wish we had when we started: verbose, explanatory, data-driven, and honest about what didn't work.

The public repo has four jobs:

Job	What to use
Learn	The chapter guide explains local agents, Strix Halo hardware, safety, and utility workflows in plain language.
Reproduce	The reference matrix records model files, checksums, backend versions, gates, and measured benchmark rows.
Choose	The model ladder and decision tree help pick a default, quality lane, speed lane, or experimental lane without trying every release from scratch.
Build safely	The workflow and safety chapters show how to run a supervised, local water-agent workflow without connecting an agent to production systems.

Track recommendation and benchmark pivots in the Changelog. For a machine- and human-readable map of canonical sources, see Repository Map and Canonical Sources.

For related writing on utility work, technology, and practical operations, see Title 22 — water, systems, strategy.

Tracking the wider field? Of Agents and Aquifers is a running, curated collection of research papers, repositories, and notable finds at the intersection of AI / agentic systems and utilities — the reading list behind this work (source repo).

⚠️ Legal Disclaimer & Warning

Caution

CRITICAL INFRASTRUCTURE WARNING: This repository and its associated scripts, guides, and models are educational resources and research prototypes only. They are NOT certified, approved, or designed for use in real-time control, automated process adjustment, regulatory reporting, or direct operations of public drinking water systems, wastewater treatment facilities, municipal SCADA systems, or any other critical infrastructure.

The code, data, and models are provided "AS IS" without warranties of any kind, express or implied. Under no circumstances shall the authors or copyright holders be liable for any operational failures, water quality compliance violations, health hazards, equipment damage, or legal penalties resulting from the use of this software.

🛑 Before You Start: The Apprentice Has the SCADA Keys

Warning

An agent with high-level permissions is three things at once: an apprentice you've handed the SCADA console, a five-year-old with your phone, and a junior staffer with the corporate credit card. Confident, eager, and unsupervised, it will do something you didn't intend.

Before you give an agent write access to anything that matters, read Chapter 11 — Agent Safety. It covers sandboxing, least-privilege access, credential isolation, spend limits, kill switches, and the incident playbook for when (not if) something goes sideways.

The single most important control for cloud agents is a hard spend cap set on the provider side, today, before the first run — it's the difference between a $5 lesson and a $1,000 hole when an agent loops on a failing task overnight. (Every model in this repo runs locally specifically so this failure mode does not exist.)

The cheapest hour you'll ever spend on this stack is the one you spend reading the safety chapter before you let an agent run unattended for the first time.

📖 The Guide

Start here and read in order — each chapter builds on the last. Written for water-treatment and utility professionals new to AI, not just engineers.

#	Chapter	What it covers
01	What is Agentic AI?	Core concepts — agents, tools, the tool-call loop
02	Why Local?	Privacy, offline capability, and cost
03	The Hardware	Strix Halo, unified memory, and GTT pools
04	The Journey	What failed, what worked, and why
05	Setup	Step-by-step build with troubleshooting
06	Verification	Running the Nonce Gate and Coding Eval
07	Choosing a Model	Benchmarks and reasoning toggles
08	Speed and Tuning	Vulkan vs ROCm, MTP speed lanes, reasoning budgets
09	Building Your Workflow	Putting it to work on real utility tasks
10	How Agents Work Together	One agent vs pipelines, batch, and orchestrators
11	Agent Safety	Sandboxing, least privilege, spend limits, kill switches — read before write access

Prefer a rendered, interactive version? See the live site. For technical lookups, jump to the Reference.

🛠️ Reference Testing Stack

All benchmarks were run on local consumer hardware with the following configuration:

Hardware (APU): AMD Ryzen Strix Halo (gfx1151), 128 GB LPDDR5X system RAM (configured via modprobe with 96 GB GTT graphics memory pool).
Server Backend: llama.cpp/llama-server (stable build b9247; opt-in MTP lanes reproduced on b9360, with Gemma QAT MTP probes on Atomic b9019). The default backend is Vulkan/RADV (Mesa 25.2.8); ROCm 7.2.x (HIP) is the retained fallback and the path some rows are measured on (see matrix).
Parameters: Greedy decoding (temperature = 0), context buffers scaled from 8,192 to 32,768, Flash Attention active.

📊 Model Performance Matrix

Below are the actual measured results across the different configurations. Community consensus still rates Qwen models highly for reasoning, and Qwen remains available in this stack. The recommendations below follow the local Strix Halo agent-workflow gates and pairwise results measured for this repository.

Current model ladder (stable set, 2026-06-02):

CODE / general workhorse (default): Qwen 3.6 35B-A3B MoE — Vulkan default, ROCm fallback.
PLAN / AGENTIC baseline: Qwen 3.5 35B-A3B MoE.
QUALITY champion: StepFun Step-3.7-Flash (UD-IQ4_XS + MTP Q8_0 draft) — graduated 2026-06-02, replacing Qwen 122B at the same decode speed with higher quality.
AMERICAN-ONLY tier (US-origin models, for agencies that may require domestic-only model provenance): gpt-oss-120B (OpenAI) for quality/speed and Gemma 4 31B IT (Google) as the coding second-opinion.
HARD-CODE challenger: Qwen3-Coder-Next. Quiver (break-glass): Qwen 3.6 27B Dense.
Retired: Qwen 3.5 122B (2026-06-02).

Warning

Correction (2026-05-31): An earlier version of this table listed Gemma 4 31B IT Q6_K at "43-48 tok/s" — that number was misattributed and belonged to gpt-oss-120B Vulkan quality runs. The corrected Gemma 31B figure is below. Gemma 31B is a dense model; it reads the full weight set each token, making it memory-bound slow on this APU. MoE models (Qwen 35B, 122B, gpt-oss) route fewer active parameters per token and are correspondingly faster.

Model & Quantization	RAM Footprint	Context Window	Think Toggle	Planning Quality (Scorecard)	Generation Speed (Decode)	Nonce Gate (Tool Use)	Verdict / Fit
gpt-oss-120B (MXFP4, 3 shards)	~63 GB	32,768	High reasoning	Pairwise: 5-1 vs Qwen 35B; 4-2 vs Qwen 122B	~46 tok/s (Vulkan)	3 / 3 Pass	AMERICAN-ONLY quality/speed — US-origin (OpenAI) lane for agencies that may require domestic-only model provenance
Gemma 4 31B IT (Q6_K)	25.2 GB	32,768	On for coding	Pairwise: 4-2 vs Gemma 26B-A4B	~8.25 tok/s tg128; ~7.7 tok/s sustained (Vulkan; pp8192 ~133.6 tok/s)	3 / 3 Pass	AMERICAN-ONLY coding second-opinion — US-origin (Google), dense/slow decode; use orchestrated path
Qwen 3.6 35B MoE (Vulkan RADV)	21.7 GB	32,768	On	82 / 84	~58.5 tok/s (Vulkan)	3 / 3 Pass	CODE/general workhorse (default)
Qwen 3.5 35B-A3B MoE (MXFP4)	21.0 GB	32,768	On	79 / 84	47.3 tok/s (ROCm)	3 / 3 Pass	PLAN/AGENTIC baseline
Qwen 3.6 35B MoE MXFP4-MTP (Vulkan RADV)	19.3 GB	32,768	On	same production quant	~72.7 tok/s (+24% vs workhorse)	3 / 3 Pass	Opt-in speed lane using `--spec-type draft-mtp`; technique surfaced via strix-halo-guide
Qwen 3.6 35B MoE Q4_K_M-MTP (Vulkan RADV)	20.7 GB	32,768	On	Won quality pairwise 4-2	~81.2 tok/s (+39% vs workhorse)	3 / 3 Pass	Opt-in speed lane; human-check regulatory figures
Qwen 3.6 35B MoE (ROCm)	21.7 GB	32,768	On	82 / 84	44.2 tok/s (ROCm)	3 / 3 Pass	ROCm fallback backend
Qwen 3.6 35B MoE (ROCm)	21.7 GB	32,768	Off	82 / 84	43.7 tok/s	3 / 3 Pass	Cuts wall-time in half for prose (falls to 1/3 coding E2E)
StepFun Step-3.7-Flash MTP (UD-IQ4_XS + Q8_0 draft, Vulkan RADV)	88.79 GiB + 3.5 GB draft	12,288	model-native	plain StepFun pairwise: 6-0 vs gpt-oss-soulfix; 4-0-2 vs 122B	27.9 tok/s; pp 183.5 tok/s; wall std 78.0 s	3 / 3 Pass	QUALITY champion (graduated 2026-06-02); MTP acceptance 89.3%; `ub=256` default (ubatch sweep 2026-06-06). Replaced Qwen 122B at the same decode speed with higher quality. (Public source/checksum pins still pending — see reference matrix.)
Qwen 3.5 122B MoE (MXFP4)	70.0 GB	12,288	On	80 / 84	19.4 tok/s (ROCm)	3 / 3 Pass	Retired 2026-06-02 — StepFun replaces at same decode speed, higher quality. Retained on disk for regression only
Qwen 3.5 122B MoE MTP (MXFP4_MOE, Vulkan RADV)	~70 GB	12,288	On	quality parity: 3-3 tie vs prior MTP config	28.3 tok/s; pp 324.9 tok/s	3 / 3 Pass	Retired 2026-06-02 (tuned 122B speed lane; `DRAFT_N=1`, `PMIN` unset) — kept as a technical record
StepFun Step-3.7-Flash plain (UD-IQ4_XS, Vulkan RADV)	88.79 GiB	16,384 gate / 32,768 coding	model-native	6-0 vs gpt-oss-soulfix; 4-0-2 vs 122B	20.4-22.3 tok/s; pp 212.0 tok/s	3 / 3 Pass	QUALITY champion — plain (no-draft) lane; coding 4/5 E2E
Gemma 4 26B-A4B IT (UD-Q6_K_XL)	21.2 GB	32,768	Off	Pairwise: 2-4 vs Gemma 31B	44.8 tok/s tg128; pp512 1002.8 tok/s	3 / 3 Pass	Verified plain-control baseline; simpler lane for general reasoning/JSON/prose
Gemma 4 26B-A4B QAT Q4_0	13.45 GiB	32,768	Off	quality control vs non-QAT Q4 pending	59.4 tok/s; pp 1194.4 tok/s	3 / 3 Pass	Fast Gemma QAT lane; official Google QAT GGUF, best general Gemma speed row so far
Gemma 4 26B-A4B QAT Q4_0 + MTP/Q8 KV (QAT head)	13.45 GiB + ~310 MiB assistant	12,288	Off	QAT-matched head; quality control pending	71.4 tok/s; pp 729.3 tok/s	3 / 3 Pass	Single-stream speed lane; MTP acceptance 91.8% with matched QAT head; 29.6 s wall std
Gemma 4 12B QAT Q4_0	6.50 GiB	32,768	Off	quality control pending	25.7 tok/s; pp 666.5 tok/s	not run	Compact QAT row; slower than 26B-A4B on this stack
Gemma 4 12B QAT Q4_0 + MTP/Q8 KV (QAT head)	6.50 GiB + ~313 MiB assistant	12,288	Off	QAT-matched head; quality control pending	45.6 tok/s; pp 539.9 tok/s	not run	Single-stream +77% vs plain; MTP acceptance 78.4%; 46.0 s wall std
Gemma 4 31B QAT Q4_0	16.44 GiB	32,768	Off	quality control pending	11.0 tok/s; pp 204.2 tok/s	not run	Dense QAT control; faster than prior Q6 but still memory-bound
Gemma 4 31B QAT Q4_0 + MTP (QAT head)	16.44 GiB + ~337 MiB assistant	12,288	Off	QAT-matched head; quality control pending	19.1 tok/s; pp 203.6 tok/s	not run	Single-stream +73% vs plain; MTP acceptance 60.4%; 110.4 s wall std
Qwen 3.6 27B Dense (UD-Q4_K_XL)	16.4 GB	32,768	On	0-6 vs Qwen 122B	9.6-11.5 tok/s tested normal decode	3 / 3 Pass	Experimental — not in the stack (see note)
Qwen 3.6 27B Dense (UD-Q4_K_XL)	16.4 GB	32,768	Off	—	9.6-11.5 tok/s tested normal decode	3 / 3 Pass	Experimental — not in the stack
Qwen3-Coder-Next (UD-Q4_K_XL, Vulkan RADV)	49.6 GB	32,768	Off	one orchestrated 4-step coding run: saved grader checks PASS	44.4 tok/s; pp 723.2 tok/s	3 / 3 Pass recorded	128GB Coder Challenger; Vulkan b9360 promoted over ROCm

Note

MTP speed options are opt-in. The Qwen3.6-35B-A3B-MTP GGUFs carry a native next-token prediction head, so recent llama-server builds can self-speculate with --spec-type draft-mtp and no separate draft model. The workhorse default remains the standard MXFP4 Qwen 3.6 35B lane. The speed technique was surfaced by the community strix-halo-guide; see the acknowledgments below and the MTP case study.

Gemma 4 26B-A4B plain control baseline: The no-spec Vulkan lane with --reasoning off and F16 KV now measures pp512 ~1003 tok/s and tg128 ~44.8 tok/s with Hermes nonce 3/3. It is the simpler lane for general reasoning, JSON extraction, and prose; the MTP comparison only pays off on heavy code generation.

Gemma 4 QAT Q4_0 sweep: The official Google QAT GGUFs are now measured with matched QAT assistant heads for all MTP rows. QAT means quantization-aware training: the model is trained/adapted while accounting for the low-precision target, with the goal of retaining more behavior at Q4 than a simple post-training quant. The strongest row is 26B-A4B QAT plain Vulkan at 59.4 tok/s and 1194.4 tok/s prefill; with the QAT-matched MTP head it reaches 71.4 tok/s single-stream at 91.8% acceptance (the earlier 56.9% acceptance came from a mismatched non-QAT head — that gap was entirely the head mismatch). 12B QAT MTP (QAT head) reaches 45.6 tok/s (+77% vs plain) at 78.4% acceptance; 31B QAT MTP (QAT head) reaches 19.1 tok/s (+73% vs plain) at 60.4% acceptance and 110.4 s wall std. Quality comparison against ordinary non-QAT Q4/K-quant controls is still pending.

Latest large-model MTP lanes: Qwen 122B MTP (now retired as of 2026-06-02, kept only as a technical record) reached a tuned Vulkan profile (DRAFT_N=1, PMIN unset) at 28.3 tok/s decode with 81.8% MTP-probe acceptance. Its quality role is now held by the StepFun champion. StepFun Step-3.7-Flash MTP reaches 27.9 tok/s decode (wall std 78.0 s) with 89.3% acceptance using ub=256 — a ubatch sweep (2026-06-06) showed smaller micro-batches reduce per-speculative-step latency and compound over long outputs (+7% tg, −5% wall std vs the prior ub=512 default). The DRAFT_N / PMIN levers are exhausted for StepFun: acceptance is model-determined at ~89–90% regardless of config.

Note

The dense Qwen 3.6 27B is benchmarked but NOT in the production stack — community discussion often treats it as a strong reasoner, but the local Strix Halo gates did not corroborate that for this workflow. A blind quality pairwise put it 0-6 against the 122B on the standard prompt set, and tested backends remained around 9.6-11.5 tok/s for normal decode. It remains a break-glass "arrow in the quiver" for tough, blocked projects where trying a different dense trace might help, not a first- or second-line choice.

Technical aside (why it's interesting even though unshipped): DFlash speculative decoding lifts the dense route to ~31 tok/s (2.82×) with a footprint-minimized Q4_K_M draft. That result came from a separate dense-model path; for Qwen 35B MoE, the current opt-in speed path is native MTP.

Note

Why is Gemma 4 31B so much slower than Qwen 35B MoE? Both are ~25 GB models, but they have very different internal architectures. Qwen 3.6 35B is a Mixture-of-Experts model that activates only ~3B parameters per token — so each decode step reads far less weight data from memory. Gemma 4 31B is a dense model: every one of its 31B parameters must be read for every token generated. On a memory-bandwidth-bound APU like Strix Halo, this difference dominates, producing ~8 tok/s for the dense Gemma vs ~58.5 tok/s for the MoE Qwen workhorse on the same hardware.

Tip

For full reproducibility data, model checksums, evaluation methodologies, and detailed post-mortems of failed attempts (such as vLLM compilation timeouts and MoE speculative decoding latency overhead), see the Reproducibility Matrix & Deep-Dive.

For the long-form research that informed these recommendations — Strix Halo backend tradeoffs, MoE integration architecture, prompt-architecture patterns — see the Research folder.

🧪 Does It Actually Work on Real Plant Data?

The benchmarks above measure the model. A separate, ongoing evaluation measures the whole agent doing real operational work: given a plain operator question and a pointer to data, can the local stack read the data itself, write and run its own analysis, and reach a sound, operator-facing conclusion — entirely on the local machine, with nothing leaving the box?

We test it against the public SWaT water-treatment dataset across three different kinds of problem:

Pump short-cycling — event detection (a duty-cycle fault)
Membrane fouling — trend fitting (a slow pressure-rise trend on real, cyclically-cleaned data)
pH / acid-dosing control — control-loop tracking (is the controlled variable holding its band against its actuator?)

The short answer: it basically works — as a supervised assistant, with caveats. The local agent (Qwen 3.6 35B-A3B MoE) reliably runs the full loop to completion, reproduces hand-computed ground-truth numbers to several significant figures on clean data, and writes clear operator briefs. Fully local is verified per run, not assumed.

The most interesting finding is one we didn't expect: on real, messy data the agent repeatedly out-reasoned our hand-built answer key. On the membrane data it discovered the clean-in-place cycles on its own and analyzed the steady trend between them — a more correct method than our first ground truth. On the pH data it caught a 3-minute acid-pump dropout that crashed pH to 6.0, flagged it as a water-quality event, and disagreed with our rubric — which a domain expert then confirmed the agent had gotten right. Four times the evaluation ended up auditing us, not the model.

It is not an unsupervised operator. The honest caveats — a single yes/no verdict on a borderline case can flip between runs; most results are one MoE model; real-data windows are still short — are documented alongside the wins, because the goal here is to help people learn to work with a new tool, not to sell one. Chapter 04 — The Journey and 11 — Agent Safety carry that same "what failed and why" spirit.

Read it / audit it yourself:

📋 The field guide — the full write-up: the scenario, the verbatim prompts, the agent's own output, and where it stumbled and what we changed.
🧪 The evaluation bundle — the probes, scorers, synthetic stubs, derived real-data slices, and every captured run (prompt sent, transcript, answer, chart, score) behind each claim.
💡 Insights — the transferable lessons (a benchmark number physics says is impossible; why decode speed didn't pick the winner; "detected ≠ diagnosed"; the day a domain expert sided with the agent over our own answer key).
📊 Scoreboard — one row per run, across all probes.

The full SWaT dataset (~128 MB) is not redistributed here; get it from the public source. The synthetic stubs and the exact real-data slices the agent saw are committed, so the results reproduce without it.

1. Core Objectives

100% Data Privacy: Run models completely offline. Your local logs, documents, and sensitive data files never leave your workstation.
Plug-and-Play Setup: Built specifically for the AMD Strix Halo (gfx1151) Unified Memory Architecture.
Teach-by-Building: Designed for learners and professionals with little to no machine learning experience.

2. Directory Structure

tesla_agent/
│
├── README.md                      # Front door: what, who, and reading order
├── REPO_MAP.md                    # Canonical-source policy and folder roles
├── CHANGELOG.md                   # Public-facing recommendation/history log
├── LICENSE                        # CC BY-NC 4.0 (Attribution Required, Non-Commercial)
│
├── guide/                         # THE TEACHING GUIDE (Read in order)
│   ├── 01-what-is-agentic-ai.md   # Core concepts (agents, tool use)
│   ├── 02-why-local.md            # Privacy, offline capability, and costs
│   ├── 03-the-hardware.md         # Strix Halo, UMA, and GTT pools
│   ├── 04-the-journey.md          # History: what failed and what worked
│   ├── 05-setup.md                # Step-by-step setup with troubleshooting
│   ├── 06-verification.md        # Running the Nonce Gate and Coding Eval
│   ├── 07-choosing-a-model.md     # Benchmarks and reasoning toggles
│   ├── 08-speed-and-tuning.md     # Going faster: Vulkan, MTP, budgets
│   ├── 09-building-your-workflow.md # Transition to real-world utility work
│   ├── 10-orchestrating-agents.md # One agent vs pipelines, batch, orchestrators
│   └── 11-agent-safety.md         # SAFETY: sandboxing, spend limits, kill switches
│
├── reference/                     # TECHNICAL REFERENCE (Quick lookups)
│   ├── README.md                  # Pinned versions, checksums, benchmarks
│   ├── glossary.md                # Plain-language glossary of ML terms
│   ├── architecture-diagram.md    # Mermaid stack diagram
│   ├── decision-tree.md           # Mermaid model chooser flowchart
│   └── reproducibility-matrix.md  # Canonical benchmark/checksum source
│
├── research/                      # DEEP RESEARCH ARTIFACTS (Long-form)
│   ├── README.md                  # Index, reading order, who-this-is-for
│   ├── strategic-architecture-frontier-moe-strix-halo.md
│   ├── cognitive-dual-stack-engineering-local-workflows.md
│   ├── high-performance-orchestration-reasoning-architectures.md
│   ├── american-stack-research-synthesis.md
│   ├── american-stack-research-synthesis-round2.md
│   ├── strix-halo-agentic-evaluation-report.md
│   ├── infrastructure-profiling-framework.md
│   ├── mtp-speculative-decoding-strix-halo.md
│   └── gemma-4-26b-control-vs-mtp-strix-halo.md
│
├── eval/                          # THE EVALUATION FRAMEWORK
│   ├── README.md                  # Explanation of the testing approach
│   ├── coding/                    # 4-step sequential coding evaluation
│   ├── quality/                   # 7-dimension planning quality rubric
│   └── realdata-eval/             # Water-treatment data evaluation bundle
│
├── docs/                          # GitHub Pages dashboard and guide mirror
│   ├── index.html                 # Interactive guide/dashboard
│   ├── app.js                     # Dashboard model finder and verifier logic
│   └── guide/                     # Mirror of guide/ for the rendered site
│
├── assets/                        # Static images used by markdown docs
├── archive/                       # Preservation notes for old working states
│
└── scripts/                       # PORTABLE SYSTEM UTILITIES
    ├── config.env.example         # Template configuration file
    ├── setup/                     # Host configuration scripts
    ├── serving/                   # Model server launchers
    └── eval/                      # Validation test runners

3. Quickstart Workflow

To get started, follow the guide chapters in order, or execute these quick setup commands:

Host Diagnostic: Check compatibilities:
```
bash scripts/setup/check_host.sh
```
VRAM Pool Allocation: Allocate graphics memory (requires sudo & reboot):
```
sudo bash scripts/setup/apply_gtt.sh
```
Environment Setup: Export driver overrides (source this in every new window):
```
source scripts/setup/set_hsa_env.sh
```

Launch Inference Server: Configure environment files and start:

cp scripts/config.env.example scripts/config.env
# Edit scripts/config.env to match your path folders
bash scripts/serving/serve_rocm.sh

Run Nonce Gate Verification: Prove tool calling is working:
```
bash scripts/eval/nonce_gate.sh
```

4. Interactive Web Dashboard

This repository includes a gorgeous interactive guide and log verifier dashboard. You can access it in two ways:

Online: Visit the live deployment at boxwrench.github.io/tesla_agent
Offline/Local: Simply open the docs/index.html file in any web browser.

Use the dashboard to select recommended models, follow interactive setup steps with troubleshooting assistance, and paste log outputs to test them against the Nonce Gate verifier.

5. Security & Data Sovereignty

Public utilities handle critical infrastructure. Sending SCADA readings or operating logs to public cloud APIs violates standard security policies. Running this stack locally protects your data sovereignty, ensuring zero network data leaks.

Acknowledgments

The MTP (multi-token-prediction / self-speculative decoding) speed work was led by the community strix-halo-guide by hogeheer499, whose reproducible Strix Halo benchmarks pointed us at --spec-type draft-mtp and the Qwen3.6-35B-A3B-MTP GGUFs. We independently reproduced their pipeline; our Q4_K_M requant came out SHA-identical to theirs, and we adapted the quant choice to this repo's quality bar.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tesla_agent // Local Agentic AI for Utilities

⚠️ Legal Disclaimer & Warning

🛑 Before You Start: The Apprentice Has the SCADA Keys

📖 The Guide

🛠️ Reference Testing Stack

📊 Model Performance Matrix

🧪 Does It Actually Work on Real Plant Data?

1. Core Objectives

2. Directory Structure

3. Quickstart Workflow

4. Interactive Web Dashboard

5. Security & Data Sovereignty

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
archive		archive
assets		assets
docs		docs
eval		eval
guide		guide
reference		reference
research		research
scripts		scripts
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
HANDOFF.md		HANDOFF.md
LICENSE		LICENSE
README.md		README.md
REPO_MAP.md		REPO_MAP.md

Folders and files

Latest commit

History

Repository files navigation

tesla_agent // Local Agentic AI for Utilities

⚠️ Legal Disclaimer & Warning

🛑 Before You Start: The Apprentice Has the SCADA Keys

📖 The Guide

🛠️ Reference Testing Stack

📊 Model Performance Matrix

🧪 Does It Actually Work on Real Plant Data?

1. Core Objectives

2. Directory Structure

3. Quickstart Workflow

4. Interactive Web Dashboard

5. Security & Data Sovereignty

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages