Skip to content

Preflight unsafe world templates#372

Merged
YSCJRH merged 1 commit into
mainfrom
codex/phase-47-runtime-world-safety
May 16, 2026
Merged

Preflight unsafe world templates#372
YSCJRH merged 1 commit into
mainfrom
codex/phase-47-runtime-world-safety

Conversation

@YSCJRH
Copy link
Copy Markdown
Owner

@YSCJRH YSCJRH commented May 16, 2026

Summary

  • Add pre-write safety preflight for create-world template payloads.
  • Expand redline terms for real-person/digital twin, political persuasion, real-world prediction, hidden surveillance, and high-risk decision requests, including Chinese examples.
  • Add regression tests proving unsafe create-world payloads are rejected before state/world artifact directories are written.

Test Plan

  • python -m pytest backend/tests/test_cli.py::test_cli_create_world_blocks_unsafe_payload_before_writing_state -q
  • python -m pytest backend/tests/test_cli.py -k "create_world or safety" -q
  • python -m pytest backend/tests/test_cli.py -q
  • python -m pytest backend/tests -q
  • python scripts/check_no_secrets.py
  • git diff --check

Safety Notes

  • Treating this as protected-core / risk:safety because it changes create-world redline behavior.
  • No Hosted GPT, BYOK, upload, auth, billing, database, object storage, quota, public API, plugin MCP contract, scenario DSL, claim labels, run trace shape, or artifact layout changes.

Fixes #368

@github-actions github-actions Bot added the lane:auto-safe Eligible for autonomous safe-lane execution and merge. label May 16, 2026
@YSCJRH YSCJRH merged commit ca91980 into main May 16, 2026
5 checks passed
@YSCJRH YSCJRH deleted the codex/phase-47-runtime-world-safety branch May 16, 2026 01:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

lane:auto-safe Eligible for autonomous safe-lane execution and merge.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Phase 47: runtime world safety preflight

1 participant