Skip to content

ruicore/codex-skills

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 

Repository files navigation

Codex Skills

Reusable AI-assisted engineering workflows for Codex.

This repository is not a prompt gallery. It is a small operating manual library for repeatable engineering work: diagnosis, code review, architecture review, test-driven development, specification writing, PRD-to-issue planning, and durable decision traces.

The goal is to make AI-assisted engineering more reliable under real project pressure. Each skill defines a workflow that an agent can follow with local repository evidence, explicit validation, and clear boundaries.

This is a personal skill library, not an official OpenAI project.

Why This Exists

I use Codex as an engineering partner, not only as a code generator. The work that benefits most from reusable skills is the work where consistency matters:

  • turning vague requests into scoped implementation plans
  • debugging with a reproduce, minimize, instrument, fix, and regression-test loop
  • reviewing architecture using repository-local evidence
  • preserving decisions so future work does not rediscover the same constraints
  • keeping TDD practical and behavior-focused
  • converting product or engineering briefs into executable issue plans

The skills are intentionally written as operating procedures because the target outcome is repeatable engineering behavior, not clever wording.

Install

Copy the skill directory you want into your Codex skills directory:

mkdir -p ~/.codex/skills
cp -R skills/diagnose ~/.codex/skills/

Then review the copied SKILL.md before relying on it in another repository. Some skills include supporting notes or scripts; keep those files with the skill directory.

Use

Each skill is self-contained:

  • skills/*/SKILL.md contains the main workflow.
  • skills/*/agents/openai.yaml contains optional discovery metadata.
  • skills/*/*.md contains supporting references, templates, or conventions.
  • skills/*/scripts/ contains helper scripts when a workflow needs one.

Use a skill when the task matches the workflow, not because the skill name sounds related. For example, use diagnose when there is a concrete bug or performance regression; use architecture-review when the question is about ownership, boundaries, authority, or change surface.

Skills and Engineering Outcomes

Skill Engineering outcome What it protects against
diagnose Builds a fast feedback loop before fixing bugs. Guessing, unverified fixes, missing regression tests.
tdd Drives implementation through behavior-focused red-green-refactor slices. Brittle tests, implementation-coupled tests, speculative code.
architecture-review Reviews ownership, authority, boundaries, drift, and change surface. Scattered rules, duplicated concepts, unclear module responsibility.
agent-legibility-review Finds repository navigation risks for future coding agents. Hidden conventions, conflicting docs, ambiguous task entry points.
grill-me Applies direct senior-engineer critique to plans and implementation choices. Weak assumptions, vague tradeoffs, under-specified risks.
grill-with-docs Grounds critique in local repository docs and decision records. Generic advice that ignores project-specific constraints.
python-backend-review Reviews Python backend fundamentals across typing, async, config, logging, packaging, and tests. Backend regressions caused by ecosystem or maintainability blind spots.
python-ecosystem-review Checks important third-party library usage and integration boundaries. Misused dependencies, version drift, unsafe ecosystem assumptions.
write-a-prd Turns rough ideas into structured product or engineering briefs. Ambiguous requirements and unreviewable implementation starts.
prd-to-issues Splits a brief into scoped, dependency-aware execution issues. Horizontal task splitting, untestable tickets, unclear sequencing.
decision-trace-writer Records durable technical decisions for future readers and agents. Repeated rediscovery of settled constraints and rationale.

Design Principles

  • Workflow over prompt: a useful skill defines steps, evidence, checkpoints, and validation.
  • Local evidence first: repository docs, code, tests, configs, and runtime behavior outrank generic best practices.
  • Bounded scope: each skill should do one kind of engineering work well.
  • Observable outcomes: a skill should produce a review, test, issue plan, decision trace, or verified fix.
  • Validation before confidence: claims should be backed by tests, repro loops, source inspection, or explicit uncertainty.
  • Reusable but not universal: these skills encode opinions and defaults; adapt them before using them in a different engineering culture.

Example Workflow

Vague task:

"This API sometimes returns the wrong result. Can you fix it?"

Repeatable skill-driven flow:

  1. Use diagnose to build a deterministic feedback loop: failing test, replay script, CLI fixture, or HTTP request.
  2. Reproduce the exact symptom and capture the failure mode.
  3. Generate ranked, falsifiable hypotheses before changing code.
  4. Add targeted instrumentation that maps to one hypothesis at a time.
  5. Convert the minimized repro into a regression test at the correct seam.
  6. Fix the bug and rerun both the regression test and the original repro.
  7. If the bug exposed unclear ownership or poor test seams, use architecture-review to identify the smallest boundary improvement.
  8. If the fix requires a broader change, use prd-to-issues to split follow-up work into verifiable slices.
  9. Use decision-trace-writer when an important tradeoff or constraint needs to be preserved.

The value is not that the agent receives a better phrase. The value is that the work becomes inspectable, repeatable, and easier to resume later.

Maintenance

This repository is versioned informally through Git history. When changing a skill:

  • keep the skill focused on its original engineering outcome
  • update supporting notes when workflow assumptions change
  • prefer concrete examples over broad advice
  • remove stale instructions rather than layering exceptions
  • test scripts or templates locally when they are part of the workflow

Because these are personal workflows, treat them as examples of agent operating design rather than universal best practices.

Releases

No releases published

Packages

 
 
 

Contributors

Languages