Skip to content

hideaway007/2d-comic-video-workflow

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

2D Comic Video Workflow

Local-first tooling for turning static comic pages, narration beats, and storyboard prompts into reviewable 2.5D motion-comic video assets.

This repository is not a one-click AI video black box. It keeps the production chain inspectable:

comic pages / storyboard beats
  -> panel or vertical-page planning
  -> structured motion plan
  -> deterministic normalization
  -> Remotion preview rendering
  -> FFmpeg and QA verification

What It Does

  • Detects and crops comic panels, then builds a stable panel_pack.json.
  • Supports 9:16 single-page story frames for short-form comic videos.
  • Validates narration beats, director visual plans, storyboard continuity, image prompts, and runtime inputs.
  • Converts AI-friendly planning output into deterministic motion_plan.json data.
  • Renders conservative 2.5D motion previews with Remotion.
  • Produces QA reports, review sheets, and FFprobe-backed video checks.
  • Includes Codex skills under .codex/skills/video-upstream-planner and .codex/skills/video-generation-template for generic narration-first 9:16 video production.

Current Boundaries

  • Runs locally. No hosted SaaS or deployed backend is claimed.
  • Keeps generated media out of git by default.
  • Does not include private API keys, personal run history, or generated video/image outputs.
  • TTS integration expects user-provided provider credentials through environment variables.
  • AI planning is optional; the deterministic mock planner keeps the sample workflow runnable without external model access.

Requirements

  • Node.js 24 or compatible
  • npm
  • FFmpeg and FFprobe

Check:

node --version
npm --version
ffmpeg -version
ffprobe -version

Install

npm install
npm --prefix render/remotion install

Quick Start

Generate sample inputs and project artifacts:

npm run build:sample

Run tests:

npm test

Render previews after building sample output:

npm run render:preview

Run QA and verification:

npm run qa
npm run verify

Input Layout

Put real comic pages here:

input/pages/
  page_001.png
  page_002.png

Supported image formats are png, jpg, jpeg, and webp. Local outputs are written to project_output/.

Output Layout

project_output/
  panels/
    panel_pack.json
    crops/
  plans/
    analysis_plan.json
    normalizer_report.json
    motion_plan.json
  assets/
  render/
    remotion/
  reports/
    qa_report.md
    review_sheet.md

For 9:16 video runs, the skill workflow uses:

project_output/control-page-runs/<run-id>/
  01_script/
  02_prompts/
  03_images/
  05_video/

Main Scripts

  • npm run build: build panel packs, planning input, analysis plan, normalized motion plan, and runtime assets.
  • npm run build:sample: create deterministic sample pages and build the sample workflow.
  • npm run render:preview: render Remotion previews.
  • npm run qa: write QA reports and review sheets.
  • npm run verify: verify assets, videos, and QA gates.
  • npm run validate:vertical-prompts -- <run-folder>: validate 9:16 storyboard/image prompt contracts.

Codex Skill

The project-local skills are:

.codex/skills/video-upstream-planner/
.codex/skills/video-generation-template/

They define a generic narration-first workflow for explainers, fiction shorts, product demos, documentary shorts, social ads, and custom visual videos:

  1. Build and validate the 00_brief upstream planning pack.
  2. Write narration and a review package.
  3. Build a beat-first timeline.
  4. Create a director visual plan.
  5. Create storyboard continuity plan.
  6. Generate 9:16 vertical page prompts and image worker batches.
  7. Gate image review before TTS and video rendering.

Optional TTS

The TTS script reads provider credentials from environment variables or a local env file. Do not commit credentials.

export DOUBAO_TTS_API_KEY=...
# or
export VOLCENGINE_TTS_API_KEY=...

Dry-run mode can validate request construction without calling the provider.

Repository Hygiene

Ignored by default:

  • node_modules/
  • input/
  • project_output/
  • generated audio/video/image outputs
  • local editor files

This public package was exported as a clean snapshot without the source repository history.

License

MIT

About

Local-first 2D comic to 2.5D motion-comic video workflow with Remotion, FFmpeg, QA gates, and a Codex skill

Topics

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors