Codex Academic Skills

A curated list of research-oriented skills that currently work with OpenAI Codex.

This list stays intentionally conservative. Every entry is kept only when the upstream source is one of the following:

an official OpenAI Codex skill
a repository that explicitly documents Codex interoperability
a repository that follows the open Agent Skills format Codex can read

Skill names below follow the upstream skill name or folder slug as closely as practical, so install paths and prompt mentions stay close to the source repository.

What Are Codex Skills?

Codex skills are folder-based instruction bundles that help Codex handle specific tasks more reliably.

A typical skill usually includes:

a SKILL.md file with trigger rules and workflow guidance
optional scripts, templates, and references
a stable folder structure that Codex can discover from standard skill locations

In practice, a good skill works like a reusable playbook. Codex loads it when the task matches, follows the instructions, and combines that guidance with the local repository context.

Inclusion Rules

This list keeps entries that satisfy at least one of the following:

official OpenAI Codex skills
repositories that explicitly document Codex support or interoperability
repositories built around the open Agent Skills format that Codex can consume with little or no adaptation

This list intentionally excludes:

skills that are exclusive to other platforms
workflows that rely on platform-specific built-ins and do not translate cleanly into reusable Codex skills
repositories whose Codex compatibility is unclear

How To Use This List

Treat this repository as a research-workflow index, not a marketplace. The tables help narrow the search space; the upstream SKILL.md remains the source of truth.

If you are new to the list, a task-based pass is usually enough:

For workflow design, research orchestration, and context management, start with sections 1 and 2.
For paper drafting and formal scholarly writing, start with section 3.
For literature review and evidence synthesis, start with section 4.
For demos, figures, talks, and polished presentation assets, start with section 5.
For experiments, evaluation, fine-tuning, and reproducibility work, start with section 6.
For mechanistic interpretability and model analysis, start with section 7.

Skill List

1. Planning and Workflow

Skill	What It Does	Link
`project-development`	Helps scope LLM projects and design practical research-agent architectures.	muratcankoylan/Agent-Skills-for-Context-Engineering
`notion-research-documentation`	Researches across Notion and synthesizes cited briefs and reports.	openai/skills
`notion-knowledge-capture`	Captures conversations, notes, and decisions into structured Notion pages for wiki, FAQ, or decision-log reuse.	openai/skills
`notion-meeting-intelligence`	Prepares agendas, pre-reads, and decision docs in Notion using existing context plus Codex research.	openai/skills
`autoresearch`	Orchestrates end-to-end autonomous AI research projects, routing literature, experiments, synthesis, and paper-writing workflows.	Orchestra-Research/AI-Research-SKILLs
`brainstorming-research-ideas`	Guides structured ideation for high-impact research directions.	Orchestra-Research/AI-Research-SKILLs
`creative-thinking-for-research`	Applies creativity frameworks to generate novel research ideas.	Orchestra-Research/AI-Research-SKILLs
`dspy`	Uses declarative prompt programming and optimizers to build structured research-agent workflows.	Orchestra-Research/AI-Research-SKILLs
`guidance`	Controls generation with regexes and grammars for structured outputs, JSON/XML/code, and multi-step prompting workflows.	Orchestra-Research/AI-Research-SKILLs
`instructor`	Produces Pydantic-validated structured outputs for extraction, labeling, and reliable research automation.	Orchestra-Research/AI-Research-SKILLs
`outlines`	Constrains generation with grammars and finite-state machines for structured outputs and synthetic data workflows.	Orchestra-Research/AI-Research-SKILLs
`multi-agent-patterns`	Designs supervisor, swarm, and hierarchical multi-agent systems with explicit coordination and context isolation.	muratcankoylan/Agent-Skills-for-Context-Engineering
`memory-systems`	Designs persistent agent memory architectures, compares frameworks, and evaluates retrieval quality across sessions.	muratcankoylan/Agent-Skills-for-Context-Engineering
`tool-design`	Designs agent-facing tools and MCP interfaces with clearer contracts, naming, and reduced selection ambiguity.	muratcankoylan/Agent-Skills-for-Context-Engineering

2. Deep Thinking and Research Framing

Skill	What It Does	Link
`context-fundamentals`	Explains how context works in agent systems.	muratcankoylan/Agent-Skills-for-Context-Engineering
`context-degradation`	Diagnoses lost-in-the-middle and other context failure modes.	muratcankoylan/Agent-Skills-for-Context-Engineering
`context-compression`	Compresses long sessions while preserving critical state.	muratcankoylan/Agent-Skills-for-Context-Engineering
`context-optimization`	Applies caching, masking, compaction, and partitioning strategies to extend effective context capacity and reduce cost.	muratcankoylan/Agent-Skills-for-Context-Engineering
`evaluation`	Builds evaluation frameworks for agent systems with rubric-based scoring, regressions, and outcome-focused quality gates.	muratcankoylan/Agent-Skills-for-Context-Engineering
`advanced-evaluation`	Covers LLM-as-a-judge and bias-aware automated evaluation.	muratcankoylan/Agent-Skills-for-Context-Engineering

3. Writing and Scholarly Communication

Skill	What It Does	Link
`doc`	Codex-oriented DOCX workflow with rendering checks.	openai/skills
`notion-research-documentation`	Useful for research briefs and structured evidence summaries.	openai/skills
`pdf`	Reads, creates, and reviews PDFs when layout and rendering matter.	openai/skills
`slides`	Creates and edits `.pptx` slide decks with editable output and layout validation.	openai/skills
`huggingface-paper-publisher`	Publishes papers on Hugging Face Hub, links them to models or datasets, and manages paper metadata.	huggingface/skills
`ml-paper-writing`	Writes publication-ready ML/AI/Systems papers.	Orchestra-Research/AI-Research-SKILLs
`systems-paper-writing`	Provides paragraph-level blueprints and venue-specific guidance for OSDI, SOSP, ASPLOS, NSDI, and EuroSys papers.	Orchestra-Research/AI-Research-SKILLs

4. Literature Reading and Evidence Synthesis

Skill	What It Does	Link
`notion-research-documentation`	Turns multi-source findings into cited literature notes.	openai/skills
`pdf`	Useful for paper packets, annotated drafts, and layout-sensitive reading workflows.	openai/skills
`transcribe`	Transcribes interviews, meetings, or recorded talks with optional speaker diarization.	openai/skills
`whisper`	Runs robust multilingual speech recognition and translation for interviews, lectures, podcasts, and audio corpora.	Orchestra-Research/AI-Research-SKILLs
`huggingface-papers`	Looks up Hugging Face paper pages and structured paper metadata for summaries or analysis.	huggingface/skills
`llamaindex`	Builds document ingestion and retrieval pipelines for research corpora.	Orchestra-Research/AI-Research-SKILLs
`faiss`	Provides high-performance dense retrieval for paper collections.	Orchestra-Research/AI-Research-SKILLs
`sentence-transformers`	Generates embeddings for literature search, clustering, and retrieval.	Orchestra-Research/AI-Research-SKILLs

5. Visualization and Presentation

Skill	What It Does	Link
`huggingface-gradio`	Builds Gradio web UIs and interactive research demos in Python.	huggingface/skills
`huggingface-trackio`	Tracks training metrics, alerts, and dashboards with Hugging Face Trackio.	huggingface/skills
`slides`	Builds editable slide decks for talks, posters, and result reviews.	openai/skills
`academic-plotting`	Generates publication-quality charts, ablations, and architecture figures for ML papers.	Orchestra-Research/AI-Research-SKILLs
`presenting-conference-talks`	Turns papers into Beamer/PPTX talk decks with speaker notes and talk scripts.	Orchestra-Research/AI-Research-SKILLs
`speech`	Generates narration, accessibility reads, and voiceovers via the OpenAI Audio API.	openai/skills
`imagegen`	Creates or edits bitmap figures, mockups, infographics, and other visual assets for papers or demos.	openai/skills
`transformers-js`	Runs Hugging Face models directly in JavaScript for browser-side demos and interactive research artifacts.	huggingface/skills
`langsmith`	Adds tracing, evaluation, and monitoring to LLM research workflows.	Orchestra-Research/AI-Research-SKILLs
`phoenix`	Open-source observability for tracing, evaluation, and experiment analysis.	Orchestra-Research/AI-Research-SKILLs
`tensorboard`	Visualizes scalars, embeddings, profiles, and training diagnostics.	Orchestra-Research/AI-Research-SKILLs
`stable-diffusion`	Generates figures, concept art, and presentation assets for multimodal research.	Orchestra-Research/AI-Research-SKILLs

6. Data and Experimentation

Research workflows now depend on reproducible data handling, evaluation, fine-tuning, and deployment. This section keeps those skills together.

Skill	What It Does	Link
`jupyter-notebook`	Creates clean, reproducible Jupyter notebooks for experiments and tutorials.	openai/skills
`spreadsheet`	Creates, edits, and analyzes spreadsheets with formula-aware workflows and visual checks.	openai/skills
`hf-cli`	Manages Hugging Face auth, repos, papers, datasets, buckets, jobs, and endpoints from the `hf` CLI.	huggingface/skills
`huggingface-datasets`	Explores Hugging Face datasets via the Dataset Viewer API, including configs, rows, search, filters, and parquet access.	huggingface/skills
`huggingface-community-evals`	Adds and manages evaluation results in model cards, imports external scores, and runs custom HF Hub evaluations.	huggingface/skills
`huggingface-tool-builder`	Builds reusable scripts around the Hugging Face API for metadata collection, automation, and repeatable research workflows.	huggingface/skills
`huggingface-llm-trainer`	Trains or fine-tunes language models with TRL on Hugging Face Jobs, including SFT, DPO, GRPO, reward models, and GGUF export.	huggingface/skills
`huggingface-vision-trainer`	Trains or fine-tunes detection and classification models with Transformers Trainer on Hugging Face Jobs or locally.	huggingface/skills
`huggingface-jobs`	Runs Python workloads, scheduled jobs, and CPU/GPU/TPU experiments on Hugging Face infrastructure.	huggingface/skills
`axolotl`	Provides YAML-first fine-tuning workflows for 100+ models, including LoRA, QLoRA, DPO, and multimodal training.	Orchestra-Research/AI-Research-SKILLs
`llama-factory`	Provides WebUI and CLI workflows for no-code or low-code fine-tuning across 100+ language and multimodal models.	Orchestra-Research/AI-Research-SKILLs
`unsloth`	Accelerates LoRA/QLoRA fine-tuning with lower memory use for rapid local experimentation.	Orchestra-Research/AI-Research-SKILLs
`peft`	Covers parameter-efficient fine-tuning with LoRA, QLoRA, DoRA, and related adapter methods.	Orchestra-Research/AI-Research-SKILLs
`trl-fine-tuning`	Uses TRL for post-training workflows such as SFT, DPO, PPO, GRPO, and reward-model training.	Orchestra-Research/AI-Research-SKILLs
`grpo-rl-training`	Specializes in GRPO-based post-training for reasoning, verifiable tasks, structured outputs, and custom reward functions.	Orchestra-Research/AI-Research-SKILLs
`ray-data`	Scales batch inference, preprocessing, and multi-modal ETL from a single machine to clusters.	Orchestra-Research/AI-Research-SKILLs
`nemo-curator`	Curates training corpora with GPU-accelerated deduplication, quality filtering, PII redaction, and multimodal cleanup.	Orchestra-Research/AI-Research-SKILLs
`weights-and-biases`	Tracks experiments, sweeps, artifacts, and model registries.	Orchestra-Research/AI-Research-SKILLs
`mlflow`	Handles experiment tracking, model registry, deployment, and autologging workflows.	Orchestra-Research/AI-Research-SKILLs
`lm-evaluation-harness`	Runs standardized LLM benchmarks such as MMLU, HumanEval, GSM8K, and TruthfulQA.	Orchestra-Research/AI-Research-SKILLs
`bigcode-evaluation-harness`	Benchmarks code models with HumanEval, MBPP, MultiPL-E, and `pass@k` workflows.	Orchestra-Research/AI-Research-SKILLs
`nemo-evaluator`	Runs reproducible multi-backend benchmarking across 100+ LLM/VLM benchmarks with containerized workflows.	Orchestra-Research/AI-Research-SKILLs
`vllm`	Serves LLMs with high-throughput inference and OpenAI-compatible endpoints.	Orchestra-Research/AI-Research-SKILLs
`sglang`	Serves LLMs and VLMs with fast structured generation, prefix caching, and strong JSON/tool-calling workflows.	Orchestra-Research/AI-Research-SKILLs
`llama-cpp`	Runs quantized LLMs on CPUs, Apple Silicon, and non-CUDA hardware for local or edge research deployments.	Orchestra-Research/AI-Research-SKILLs

7. Interpretability and Model Analysis

Skill	What It Does	Link
`transformer-lens`	Supports mechanistic interpretability research with HookPoints, activation caching, and causal tracing on transformer internals.	Orchestra-Research/AI-Research-SKILLs
`saelens`	Trains and analyzes sparse autoencoders for monosemantic feature discovery and superposition research.	Orchestra-Research/AI-Research-SKILLs
`nnsight`	Runs local or remote interpretability experiments on PyTorch models, including very large models via NDIF.	Orchestra-Research/AI-Research-SKILLs
`pyvene`	Performs causal interventions, activation patching, and interchange intervention training on PyTorch models.	Orchestra-Research/AI-Research-SKILLs

Installation and Usage

This repository is a curated list, not a package manager. Current Codex docs distinguish between:

local skill folders for authoring and day-to-day use
plugins for distributing reusable skill bundles more broadly

In practice, most entries here are still used by placing an upstream skill folder in a standard Codex skill directory. Some third-party repositories also ship their own installers or plugin manifests; when they do, prefer the upstream installation path they document.

Install a skill in Codex

Current Codex docs describe these standard skill locations:

repository scope: .agents/skills/<skill-name>/
user scope: ~/.agents/skills/<skill-name>/

Example 1: install an official curated skill from openai/skills

$skill-installer pdf

Example 2: install a Hugging Face skill manually

mkdir -p ~/.agents/skills
cd /tmp
git clone --depth 1 https://github.com/huggingface/skills.git
cp -R skills/skills/huggingface-papers ~/.agents/skills/

Example 3: install the Orchestra Research bundle with its upstream installer

npx @orchestra-research/ai-research-skills

Example 4: manually copy a single Orchestra skill if you only want one

mkdir -p ~/.agents/skills
cd /tmp
git clone --depth 1 https://github.com/Orchestra-Research/AI-Research-SKILLs.git
cp -R AI-Research-SKILLs/20-ml-paper-writing/academic-plotting ~/.agents/skills/

Some older guides and repos still mention .codex/skills, but the current OpenAI documentation uses .agents/skills as the standard local location.

Use a skill in Codex

Once the folder is available in a valid Codex skill location, you can invoke it naturally in your prompt.

Examples:

Use autoresearch to set up an experiment loop for this idea.
Use academic-plotting to turn these ablation results into camera-ready figures.
Use hf-cli to inspect the model, dataset, and paper metadata for this checkpoint.
Use transformer-lens to run activation-patching experiments on this model.
Use huggingface-gradio to build a demo for this paper artifact.

Recommended usage pattern

Pick one skill for one clear bottleneck.
Start with a narrow task instead of a full workflow.
Read the upstream SKILL.md before relying on the result.
If the source repository ships its own installer, plugin manifest, or fallback AGENTS.md, read its install docs before mixing methods.
For academic work, manually check citations, claims, equations, data handling, and benchmark settings.
If a skill touches remote services or external datasets, verify authentication, quotas, privacy, and licensing before running it at scale.

License

The content of this repository is released under the MIT License.

Third-party skills linked from this list keep their own licenses. Always check the original repository before installing or redistributing anything.

If you notice a dead link, a naming change, or a clearly better entry for the list, a short issue or PR is enough.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.github/workflows		.github/workflows
data		data
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README.zh-CN.md		README.zh-CN.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Codex Academic Skills

Table of Contents

What Are Codex Skills?

Inclusion Rules

How To Use This List

Skill List

1. Planning and Workflow

2. Deep Thinking and Research Framing

3. Writing and Scholarly Communication

4. Literature Reading and Evidence Synthesis

5. Visualization and Presentation

6. Data and Experimentation

7. Interpretability and Model Analysis

Installation and Usage

Install a skill in Codex

Use a skill in Codex

Recommended usage pattern

License

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Codex Academic Skills

Table of Contents

What Are Codex Skills?

Inclusion Rules

How To Use This List

Skill List

1. Planning and Workflow

2. Deep Thinking and Research Framing

3. Writing and Scholarly Communication

4. Literature Reading and Evidence Synthesis

5. Visualization and Presentation

6. Data and Experimentation

7. Interpretability and Model Analysis

Installation and Usage

Install a skill in Codex

Use a skill in Codex

Recommended usage pattern

License

References

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages