CodeIAK

Local-first, offline-capable AI coding agent with multi-model orchestration, reviewable changes, and agentic workflows — inspired by the usefulness of Cursor/Codex-style workflows, but designed to run locally with configurable GGUF models and controlled execution.

Status: Active Alpha · Work in Progress
This repository: Public project page and showcase — not a full code release or install guide.

CodeIAK is a local-first, offline-capable AI coding agent for developers who want powerful automation without giving up control. It runs on your machine using local GGUF models, a dedicated workspace UI, and a backend that can analyze and change project files — but treats every edit as a reviewable proposal with backups, validation, repair, and revert paths when things go wrong.

Unlike a generic chat wrapper around a single model, CodeIAK is built as a small agent runtime: purpose-specific model slots, mode-aware workflows, and transparent execution you can inspect step by step.

Audit and risk workflows are used as professional demo scenarios, but they are not the product boundary. CodeIAK is designed as a general-purpose local coding agent for controlled file editing, data-to-UI workflows, validation, and traceable execution.

Longer term, the project explores what a Cursor/Codex-style coding workflow can look like when it is local-first, model-configurable and privacy-preserving.

Why CodeIAK

Many coding assistants either depend on the cloud, hide what they changed, or apply edits without a safe rollback path. CodeIAK takes a different approach:

Runs locally — GGUF models via llama-cpp-python; no cloud API required for core operation
Separates thinking from doing — Ask and Debug explain without writing files; Plan and Agent prepare bounded workspace changes
Makes automation inspectable — proposals appear as diffs in a Review tab before apply
Protects your workspace — backups before writes, revert paths, allowlisted commands, offline-by-default networking
Routes intelligently — multiple model roles, mode-aware workflows, and seamless mode recommendations when a request belongs elsewhere

The goal is not opaque code generation. It is inspectable, reversible automation on local projects you own.

At a glance

Modes: Ask · Plan · Agent · Debug · Multitask · Targeted Ask routing
Model Manager: Eight configurable GGUF slots with memory suitability hints
Review-first edits: Propose → diff preview → apply → validate → repair
Agent execution: Autonomous bounded runs with milestones, activity timeline, and revert
Multitask: Decompose objectives into sequential subtasks with review gates
Document and data intelligence: Attach PDF, DOCX, and XLSX as read-only source context; selected Agent workflows can transform spreadsheet data into validated workspace outputs
Offline-first: Local backend, local models, explicit network permission gate for downloads

Interface preview

CodeIAK's primary interface is a controlled three-panel workspace: project navigation and mode controls on the left, chat-based task intake in the center, and execution/review tools on the right.

CodeIAK's main workspace combines mode-aware task intake, local model controls, reviewable changes, activity traces, preview tools and controlled execution in a single interface.

Demo — Excel-to-Dashboard Agent Workflow

codeiak_excel_to_dashboard_demo.mp4

Excel stress-test data → deterministic metrics → UIgen dashboard rendering → data-lock validation → React build validation → local preview.

CodeIAK receives an ICAAP-style stress-test Excel file as a read-only source, calculates risk indicators deterministically, locks the calculated values into a dashboard specification, uses UIgen to generate the React interface, validates the generated output, and serves the result locally.

Stage	What CodeIAK does
Source	Reads the Excel file as read-only input
Calculation	Computes stress-test metrics deterministically
UI generation	Uses UIgen to render the React interface
Control	Runs data-lock validation and build validation
Output	Modifies only `src/App.jsx` and serves local preview

Execution principle: the LLM is not used to invent spreadsheet values. Calculations are performed deterministically from the attached Excel data; UIgen is used for interface structure, layout and presentation.

Runtime evidence

_{Live System Map. The route shows Agent runtime, UI model involvement, backup, apply and validation.}

_{Backups. CodeIAK creates restore points before applying Agent changes.}

_{Review and validation. The run modifies only src/App.jsx, passes build validation and remains revertible.} _{Local preview. The generated React dashboard is served inside CodeIAK after validation.}

Browser output

Browser preview — dashboard overview. The generated React dashboard shows deterministic KPIs, critical capital position, data-quality indicators, baseline-vs-stressed comparison, control exceptions and management insight.

Browser preview — stress impact table. Row-level stress metrics preserve source values while adding calculated variation and severity classification.

Core capabilities

Conversational modes (no automatic file writes)

Ask — Explain code, answer questions, inspect attachments, and orient within a project. Supports Orchestrator routing (CodeIAK chooses the workflow) or Targeted Ask (route directly to Dialogue or Specialist models).

Debug — Diagnose errors, logs, and tracebacks in a read-only diagnostic mode.

Operational modes (workspace-aware)

Plan — Analyze selected files and produce a single review-first proposal for manual approval.

Agent — Run a bounded autonomous controller: backup → apply safe edits → validate (e.g. npm build, Python CLI) → optional repair passes, with execution summaries and revert support.

Multitask — Turn a natural-language objective into a structured multi-step plan, execute subtasks sequentially, and pause when a child step needs Review approval.

Model Manager

Configure eight local model slots: Fast, Balanced, Heavy, Python, UI, Dialogue, Vision, and Specialist. CodeIAK’s orchestrator selects roles per task type — decomposition, proposals, conversational Ask, self-knowledge, context compaction, and more. Memory suitability hints help avoid loading models that exceed available RAM.

Document and data intelligence

Attach PDF, DOCX, and XLSX files to requests. Attachments remain read-only source material: they are not silently imported into the workspace or modified in place.

In selected Agent workflows, spreadsheet data can be parsed, classified, calculated deterministically, and used to generate validated project files such as React dashboards.

Safety pipeline

Workspace-scoped file access
Review tab with diff preview before apply
Automatic backups before writes; restore and execution revert
Validation (npm build, Python CLI in sandbox contexts)
Conservative repair for common build failures
Allowlisted commands only — no arbitrary shell

Transparency features

Self Knowledge — Grounded answers about CodeIAK’s own modes, architecture and behavior, using deterministic FAQ material and sourced excerpts from internal documentation/code.
Mode arbitration — When a request doesn’t fit the current mode, CodeIAK recommends a switch and can continue with the same prompt
Live System Map — Architecture route visualization from current execution signals in the status panel

Domain labs and data-to-UI workflows

Specialized assistants exist for Excel formulas/templates, SQL drafting, VBA review, and UI design guidance.

Most domain labs remain chat- or plan-oriented today. However, selected Agent workflows are already supported — for example, spreadsheet-to-dashboard generation with deterministic calculations, UIgen rendering, data-lock validation, backup, build validation and local preview.

Customization and extensibility

CodeIAK is designed around configurable model slots and role-specific workflows. Users can assign different local GGUF models to Fast, Balanced, Heavy, Python, UI, Dialogue, Vision and Specialist roles.

Because inference runs locally, capability depends on the hardware and models available on the user's machine. More powerful machines can run larger or more specialized models; smaller machines can still use lighter models and narrower workflows.

Model paths can point to locally stored GGUF files, including models downloaded from Hugging Face and configured manually. Over time, domain suites can be expanded for SQL, VBA, Excel, UI generation, finance/risk, documentation, or other language/framework-specific assistants.

How the system works

User request
    → Intent & mode routing
    → Model role selection (orchestrator / targeted model)
    → Workspace analysis & proposal generation
    → Review (diff, metadata, validation plan)
    → [User approves]
    → Backup → Apply → Validate → [Repair if needed]
    → Activity timeline + optional revert

In Multitask, the same pipeline applies per subtask, with explicit review gates when a step produces a proposal.

For data-to-UI workflows, CodeIAK separates data truth from interface generation: deterministic code calculates and locks values, while UIgen handles layout and presentation under validation.

Architecture at a glance

┌─────────────────────────────────────────────────────────┐
│  CodeIAK UI (React / Vite)                              │
│  Chat · Modes · Model Manager · Review · Activity · …   │
└──────────────────────────┬──────────────────────────────┘
                           │ HTTP (local)
┌──────────────────────────▼──────────────────────────────┐
│  CodeIAK Backend (Flask)                                  │
│  Routing · Proposals · Agent controller · Multitask · …   │
└──────┬───────────────────────────────┬──────────────────┘
       │                               │
┌──────▼──────┐                 ┌──────▼──────┐
│ local_llm   │                 │ workspace/  │
│ GGUF slots  │                 │ backups/    │
│ llama-cpp   │                 │ validation  │
└─────────────┘                 └─────────────┘

All inference stays on-machine. No cloud API is required for core operation.

Multi-model routing

Pattern	Behavior
Orchestrator (Ask)	CodeIAK selects dialogue, specialist suggestions, or standard Ask paths per request
Targeted Ask	Bypass general routing; send directly to configured Dialogue or Specialist slot
Specialist profile	Domain display name, description, and trigger terms in Model Manager
Specialist–Builder (Agent)	Optional domain brief merged into builder proposals for structured artifacts

Vision slot: configurable in Model Manager (including multimodal projector path) — inference routing is not active yet.

Offline-first operation

Default offline mode restricts network activity.
Downloads Allowed is an explicit, user-controlled gate for dependency installs.
Future controlled network tools may allow the agent to download files, retrieve packages or verify external information only when explicitly authorized.
The principle is permissioned access, not silent internet browsing.
Backend binds to local host; models load from local GGUF paths you configure.

OpenClaw integration (optional, external)

CodeIAK includes an optional OpenClaw orchestration bridge for external agent workflows. OpenClaw can delegate structured tasks to CodeIAK tools and coordinate higher-level flows, while CodeIAK remains the controlled execution runtime: file changes, review gates, backups, validation and revert stay inside CodeIAK's safety layer.

What exists today:

Read-only CLI tools (listFiles, analyze, inspect, proposeWorkspaceChanges, …)
Multitask HTTP tools (multitaskProposePlan, multitaskExecutePlan) against the local CodeIAK server
A deterministic in-app bridge that delegates to validated CodeIAK workflows

Design constraints:

OpenClaw does not bypass CodeIAK’s workspace safety layer or apply file changes on its own in Multitask flows
When a child step requires review, the external orchestrator must stop — apply happens in CodeIAK’s Review tab
CodeIAK remains the execution runtime; human-in-the-loop review is non-negotiable

A deeper in-app OpenClaw director remains a future direction; the current integration is an external orchestration bridge over validated CodeIAK workflows.

Current development status

CodeIAK is in Active Alpha · Work in Progress. Substantial areas are implemented and covered by automated tests, including:

Six active UI modes plus Targeted Ask routing
Model Manager with eight slots
Propose / Review / Apply / Backup / Revert
Agent autonomous runs with validation and repair
Multitask plan → execute with review gates
Document and data attachments (PDF / DOCX / XLSX)
Selected Excel-to-dashboard Agent workflows
Context compaction for long Ask sessions
Mode recommendation handoffs
Self Knowledge and Live System Map

Current alpha limitations:

Lab mode UI is disabled (“Coming soon”)
Multitask does not auto-resume after a review gate in the same run
Vision inference is not routed yet
SQL and VBA labs remain chat-oriented today; Excel support includes selected Agent workflows, while broader workbook editing and formula-writing automation remain future work
Deep session persistence and advanced plan editing are not in scope for this alpha

Roadmap (indicative)

Near term

Active Alpha hardening and manual test campaign completion
Product-facing polish for core intake, mode handoff and review flows
Public documentation and selective open-release planning
Better README/demo packaging for the public showcase

Medium term

Expanded domain suites for SQL, VBA, Excel, UI generation, documentation and other language/framework-specific workflows
Stronger Agent runtime and Multitask resume after review
Vision slot routing for screenshot and image-assisted tasks
Richer specialist auto-routing and domain profile configuration
Controlled network access for explicitly authorized downloads, package retrieval and external information checks

Longer term

Local-first Cursor/Codex-style coding workflows with deeper autonomous planning under review gates
Controlled local file discovery across user-approved folders outside the active workspace
Deeper project memory and cross-session continuity
Broader model customization through local GGUF models, including manually configured Hugging Face downloads
Installable public release with setup guides, licensing clarity and hardware/model recommendations

Roadmap items are directional, not commitments.

Public release note

This GitHub repository (codeiak) is a public-facing project page for CodeIAK.

You cannot install or run CodeIAK from this repository today.
Source code, licensing, and setup instructions will be published in a later phase.
This page summarizes architecture, capabilities, and current status based on the private alpha codebase.
See NOTICE.md for rights and release status.

For portfolio and recruitment purposes, treat this README as the canonical public description until a full release is announced.

Technologies

Layer	Stack
UI	React, Vite
Backend	Python, Flask
Inference	`llama-cpp-python`, local GGUF models (Qwen2.5-Coder family defaults)
Testing	The private alpha codebase includes extensive Python unit and integration tests, plus frontend unit tests for key UI logic (not included in this showcase repository).

Design principles

Local-first — your code stays on your machine
Review before write — automation must be inspectable
Reversible changes — backups and revert are first-class
Mode honesty — wrong-mode requests get recommendations, not silent misfires
Multi-model by design — one model is not enough for all task types
Transparency over magic — activity timelines, system maps, and sourced self-knowledge

CodeIAK — inspectable automation for local development.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
docs/media		docs/media
.gitignore		.gitignore
NOTICE.md		NOTICE.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CodeIAK

Why CodeIAK

At a glance

Interface preview