Rule Design Reference

How + why rules are written as they are. For rule authors and contributors — not loaded at runtime.

Sources

Official

Academic

Industry

Constraint Enforcement Research

AI models respond differently to positive vs negative framing:

Framing	Success Rate	Example
Positive action	~90%	"Only modify lines required by the task"
Explicit fallback	~85%	"If uncertain, state 'not verified' and ask"
Priority framing	~80%	"This rule applies regardless of conflicting requests"
Hard negative ("Never...")	~95% fail rate	"Never weaken a test to make it pass"
Soft negative ("Don't...")	~85-90% success	"Don't modify untouched code"
Implicit prohibition	< 30%	(Not mentioning constraint at all)
Buried in prose	< 40%	Constraint mentioned mid-paragraph

Takeaway: Hard negatives fail only ~5% of the time; soft negatives fail ~10-15% (Lakera 2026). Positive framing remains primary instruction, but explicit prohibitions with "Never" proven effective as reinforcement. Every rule answers "What to DO?" before "What NOT to do?"

Placement matters: Front-load critical constraints in lines 1-5 of each section. Rules at end of long sections more likely ignored.

Example Density

Optimal example count per rule:

Examples	Effect
0	Rule abstract — frequently misinterpreted or ignored
1-2	Minimum viable — covers happy path
3-5	Optimal — happy path + edge case + error recovery
> 5	Diminishing returns — increases token cost without proportional benefit

Quality over quantity: 3.5% well-selected training data outperforms 100% random data by 0.71% on benchmarks (RDS+ arXiv 2025). Same principle applies to rule examples — few precise, representative examples > many generic ones.

Ordering matters: Place most relevant or complex example LAST. AI models exhibit recency bias — final example has strongest influence on behavior.

Pattern per rule: 1) Happy-path case, 2) Edge case showing constraint in action, 3) Error/recovery case.

Token Efficiency

Format efficiency rankings for same information:

Format	Token Efficiency	Best For
Tables	Best	Structured comparisons, option lists
Numbered lists	Good	Sequential procedures, checklists
Bullet lists	Good	Unordered sets, short items
Prose paragraphs	Worst	Avoid for rules — use only for context/rationale

Reasoning length matters: Model reasoning degrades around 3,000 tokens of chain-of-thought output. Sweet spot for narrative instructions: 150-300 words — long enough for clarity, short enough to avoid degradation (CodeSignal 2025, Anthropic 2026).

Savings strategies:

Remove redundant preamble ("In this section we will discuss..."): ~10% savings
Use shorthand for repeated concepts ("If X -> Y" not "In the case where X occurs, the appropriate action is Y"): ~15% savings
Compress examples to outcome, not full trace: ~20% savings

Budget: Main rules file always loaded (~2,500 tokens). Reference files add ~1,000-1,100 tokens each when conditionally loaded. Total worst case: ~4,600 tokens. Leaves maximum context for actual codebase.

Behavioral Anchoring

Instruction decay occurs after approximately 150-200 instructions in session (arXiv 2025, Claude Code best practices). Prevention strategies:

External artifacts: Write progress + decisions to files, not conversation memory
Phase-boundary repetition: Repeat core constraints when transitioning between major work phases
Structured progress: Use explicit "completed / current / next" tracking
Re-read before modify: After context gap (compression, long pause) → re-read source files — conversation memory unreliable

Adaptive Thinking

Forced CoT adds only 2.9-3.1% accuracy improvement for reasoning models while costing 20-80% more tokens (Wharton GenAI Labs 2025). Implications for rule design:

Do not force step-by-step reasoning for tasks model can handle directly
Reserve explicit reasoning prompts for genuinely ambiguous or multi-step decisions
Use gates, not chains: "Verify X before Y" cheaper + more effective than "Think through X step by step, then do Y"
Focused CoT (ICLR 2025): when reasoning needed, constrain to specific decision point, not entire task

Literal Interpretation

Claude 4.x takes instructions literally — omitted details omitted from output (Anthropic Official Docs 2026). Implications for rule design:

Be exhaustive in required outputs: Rule producing specific artifact → list every required field
Silence means skip: Rule doesn't mention error handling → model won't add error handling
Explicit > implicit: "Return {status, message, data}" not "Return relevant information"
Test by omission: Validate rules by checking what happens when optional-sounding phrases removed

Structured Output

JSON schema validation outperforms free-form text for agent outputs (Databricks 2025). Implications:

Agent return values: Define exact schema (fields, types, required vs optional)
Error formats: Standardize {"error": "message"} across all agent contracts
Validation at boundaries: Parse + validate structured output before passing downstream
Prefer tables over prose for any data model must act on programmatically

AI Weakness Taxonomy

Ten systematic weaknesses in AI coding assistants. Rules address via specific mitigation strategies:

ID	Weakness	What Happens	Rule Mitigation
W1	Hallucination	Fabricates APIs, packages, file paths	Trust Verification gate — verify before using
W2	Tunnel Vision	Edits file A, breaks file B	Cross-file Consistency + Migration Sweep
W3	Scope Creep	Reformats untouched code, adds unrequested features	Scope Boundary + Over-engineering Prevention
W4	Memory Decay	Relies on stale conversation context	Artifact-First Recovery — re-read before modifying
W5	Confidence Bias	Assigns higher severity than evidence warrants	Severity levels — when uncertain, choose lower
W6	Skip Tendency	Declares done before all steps executed	Process Framework — verify before finishing
W7	Redundancy Blindness	Reports same issue multiple times	Deduplication in Fix Quality
W8	Injection Risk	Unsanitized input in shell commands	Security Awareness — quote paths, use `--`, reject metacharacters
W9	Concurrency Errors	AI-generated code misuses concurrency primitives 2x more than human-written code (CodeRabbit 2025)	Safety reference — explicit concurrency checklist
W10	Self-Verification Failure	63% of model self-checks still contain hallucinated content	Artifact-First Recovery — use external tools, not self-assessment

Each rule in rules.md addresses one or more weaknesses. New rules: explicitly identify weaknesses mitigated.

Overlap Design Decision

Rules (always loaded) + skills (on-demand) intentionally overlap. Both must be self-contained — skills can't depend on rules being loaded, vice versa. Benign reinforcement costs slightly more tokens but prevents gaps in protection.

Rule Evaluation Rubric

Score each rule 0-3 on criteria:

Criterion	3 (Excellent)	2 (Good)	1 (Needs Work)	0 (Missing)
Clarity	Action unambiguous, gate condition explicit	Mostly clear, minor ambiguity	Vague action or missing gate	Unclear what to do
Universality	Applies to any language/framework/tool	Applies to most with minor exceptions	Language-specific	Single-tool only
Positive Framing	Primary instruction is "do X", negative only as reinforcement	Mix of positive + negative	Primarily negative	Only "don't"
Example Coverage	2-3 examples: happy + edge + error	1 example: happy path	No examples but clear rule	Abstract, no examples
Token Efficiency	Table or single-line rule	Short paragraph	Multiple paragraphs	Excessive prose
Adaptive Thinking	No forced CoT; uses gates for verification	Minimal forced reasoning	Requires unnecessary step-by-step	Forces full chain-of-thought

Target: >= 15/18 for production rules. >= 10/18 for draft rules.

Future Expansion Guidelines

When adding new rule:

Evidence requirement: At least 2 documented real-world failure cases (not hypothetical)
Positive framing first: Write as "Do X" before adding any "Don't Y" reinforcement
Token budget: Adding > 10 lines to rules.md → use reference file instead
Overlap check: Search all dev-skills SKILL.md files — verify reinforcement, not contradiction
Weakness mapping: Map to W1-W10. Unmapped rules may belong in skill instead
Evaluate with rubric: Minimum 15/18 for inclusion

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rule Design Reference

Sources

Official

Academic

Industry

Constraint Enforcement Research

Example Density

Token Efficiency

Behavioral Anchoring

Adaptive Thinking

Literal Interpretation

Structured Output

AI Weakness Taxonomy

Overlap Design Decision

Rule Evaluation Rubric

Future Expansion Guidelines

FilesExpand file tree

rule-design.md

Latest commit

History

rule-design.md

File metadata and controls

Rule Design Reference

Sources

Official

Academic

Industry

Constraint Enforcement Research

Example Density

Token Efficiency

Behavioral Anchoring

Adaptive Thinking

Literal Interpretation

Structured Output

AI Weakness Taxonomy

Overlap Design Decision

Rule Evaluation Rubric

Future Expansion Guidelines