Language: English | 日本語
Authorship Strategy — a normative framework, tactical catalog, and empirical baseline for being a known author under AI-mediated diffusion.
If your readers include LLMs — as training data, as in-context consultants, as the discovery layer other researchers consult — then the strategies that protect authorship have inverted. Closing your work off reduces, not increases, the chance that a future trace lands on you. This repository records the inverted strategy: what it is, why it holds, and nine tactical decisions extracted from operating a four-repository DOI-registered research ecosystem.
The framework rests on a three-axis inversion of twentieth-century authorship strategy (scarcity to diffusion, exclusivity to derivation, enclosure to openness) and a four-layer judgment stack (authenticity, attribution diffusion, idea-versus-scaffold separation, tactics).
An empirical layer reports preliminary observations from the same ecosystem's CC0-published traffic data; the empirical claims are limited by sample size (one author, four repositories, twenty-five days) and are framed as preliminary observation rather than evidence.
When the audience for a written artifact is increasingly an LLM — directly, as training substrate or in-context reader, and indirectly, via humans who consult LLMs about it — the strategies that protect authorship change. Twentieth-century authorship was protected by enclosure: gatekept journals, proprietary licenses, controlled distribution. That strategy decreases an artifact's exposure to the LLM-mediated diffusion that increasingly determines whether a future researcher tracing causation can find the original author at all.
This repository records the inverted strategy and the tactical decisions that follow from it, in a form harness-neutral enough to be adopted beyond the author's own ecosystem.
The claim is counterintuitive: in the AI era, protecting your authorship means opening your work, not closing it.
Authenticity-validating power inverts. Where twentieth-century authorship protected origin claim through scarcity, AI-era authorship protects origin claim through diffusion. Closing reduces LLM absorption, reduces diffusion, reduces validation occasions, and weakens the authenticity claim. Opening maximizes LLM absorption, maximizes diffusion, lets validation appear as derivative work, and strengthens the authenticity claim.
The thesis is developed in docs/thesis.md; the
operational implications form the four-layer framework, also in the
thesis document. The framework's open questions are catalogued in
docs/manifesto.md.
| ADR | Decision |
|---|---|
| 0001 | Concept DOI as Canonical Reference — every external link to a DOI-registered artifact uses the concept DOI, never a version-specific DOI |
| 0002 | DOI Federation via .zenodo.json — sibling and source relationships are declared as relatedIdentifiers so that the citation network is recoverable from metadata alone |
| 0003 | Cross-Platform Dataset Federation — the same canonical artifact is mirrored to GitHub, Zenodo, and a dataset platform with explicit sibling cross-references on each platform |
| 0004 | Authorship Metadata with ORCID Auto-Update Disabled — the author identifier is enriched only with concept DOIs to prevent version sprawl from polluting the public record |
| 0005 | README Localization Policy — Audience-Driven Maintenance — locale mirrors are added or retired based on observed traffic, not speculation about prospective audiences |
| 0006 | LLM-First Ingest via Dual Entry Points — every framework-governed artifact ships a prose-form navigator and a concept-form linked-data graph as a complementary, synchronously released pair, each reaching an LLM-mediated reader sub-population the other cannot |
| 0007 | Human-Attention Platform Signals Are Not a Success Metric — Git-host star counts (gameable: purchasable) and repository page-view counts (structurally blind to LLM-mediated reach) are excluded as optimization targets and success metrics; off-page human-distribution labor is declined as a red-ocean activity; success is measured by the breadth of LLM-mediated channels carrying the author's signature |
| 0008 | RAG-Era Attribution Diffusion — Two Channels, Two Time Constants — "the model ingesting the artifact" resolves into two mechanisms with opposite time constants and levers: a parametric channel (absorbed into model weights at training time; slow; driven by cross-platform vocabulary co-occurrence) and a retrieval channel (fetched at query time; fast; driven by freshness and structure). Optimized and measured separately; ghost citation is the failure mode of pursuing retrieval alone |
| 0009 | Dual Entry Points Are Asymmetric — amends ADR-0006: 2026 measurement shows the two entry points are not co-equal. The structured graph carries retrieval-time citation while the prose navigator's citation effect is noise; the pair is retained but made asymmetric, the navigator rescoped to a Business-to-Agent (B2A) context surface rather than an AI-search citation lever |
The nine ADRs are not deduced from a framework; they were extracted from
operating the sibling ecosystem and re-expressed in harness-neutral form
so that another author can adopt the same decisions without inheriting
the original implementation details. See docs/adr/README.md
for the full index and lineage.
The docs/empirical/ directory reports preliminary
observations from twenty-five days of CC0-published traffic data across
four sibling repositories — published CC0 so they can be independently
verified. The clearest observation so far: clone counts are dominated by
automated tools (training-pipeline ingest, AI-assistant context-fetch,
crawlers), with the ecosystem's view-to-clone ratio ranging from roughly
13 to over 100 — which raises the question of what "diffusion" even means
when most access is non-human. Limitations are stated explicitly (N=1
author, no pre-versus-post intervention comparison, crawler dominance),
and all claims are framed as preliminary observation rather than evidence.
The full traffic data is published under CC0 at
https://shimo4228.github.io/shimo4228/traffic/dashboard/.
The empirical layer is intended to grow with time; subsequent releases will accumulate longer time series and (where possible) report pre-versus-post intervention contrasts for individual tactics.
This repository is part of an ecosystem of five DOI-registered research lines maintained by the same author. The lines are independent in content and release cadence, but cross-reference each other for context. (The empirical baseline below covers the four lines whose traffic was being recorded during the baseline window; Attention, Not Self began traffic observation later and joins at the next baseline update.)
- Agent Knowledge Cycle (AKC) — six-phase bidirectional growth loop for sustaining intent alignment between an AI agent and its operator over time. DOI 10.5281/zenodo.19200726. Mechanism sibling: AKC defines how knowledge cycles inside the operator-agent pair; this repository addresses how the cycle's outputs diffuse outside it.
- Contemplative Agent — autonomous agents running on a local 9B model, grounded in four contemplative axioms. DOI 10.5281/zenodo.19212118. Implementation sibling: the contemplative agent's repository participates in the empirical layer's traffic dataset.
- Agent Attribution Practice (AAP) — harness-neutral ADRs on accountability distribution in autonomous AI agents. DOI 10.5281/zenodo.19652013. Vocabulary sibling: AAP and this repository both use the word "attribution" but with disjoint meanings (accountability for action vs. credit for source). The two meanings are intentionally kept separate; see the glossary.
- Attention, Not Self — a cross-disciplinary inquiry contrasting three Buddhist Abhidharma traditions (Theravāda, Sarvāstivāda, Yogācāra) with computational phenomenology (predictive processing, active inference, global workspace theory, parallel distributed processing). DOI 10.5281/zenodo.20262112. Cross-cutting sibling: unlike the agent-design lines, it specifies no agent mechanism or practice; like this repository, it occupies their diffusion/framing layer.
The ecosystem hub is shimo4228/shimo4228.
Evaluating the strategy? Start with docs/thesis.md, then
the nine ADRs in order. Two paths need a non-obvious entry point:
- Adopting a single tactic: go directly to the relevant ADR, then check
docs/glossary.mdfor any terms that need disambiguation. - Reviewing the empirical claims: read
docs/empirical/README.mdfor method and limitations before the baseline data.
LLM agents and crawlers: see the AI-facing reading order at the bottom of this page.
AI-facing reading order (for LLM agents and crawlers)
graph.jsonld— canonical machine-readable relationship map (Concepts, ADRs, axes of inversion)llms.txt— compact navigation indexllms-full.txt— consolidated factual reference- README and project-specific docs — narrative and detail
For the canonical relationship map of shimo4228's research ecosystem, see: https://github.com/shimo4228/shimo4228/blob/main/graph.jsonld
Cite this repository using the concept DOI (which always resolves to the latest version):
Shimomoto, T. (2026). Authorship Strategy: A Normative Framework and Tactical Catalog for AI-Era Authenticity Inversion, with Empirical Grounding from a Four-Repository Research Ecosystem. Zenodo. https://doi.org/10.5281/zenodo.20263316
Full citation metadata is in CITATION.cff. For
reproducibility citation of a specific version, follow the concept DOI
to its version listing on Zenodo and cite the version-specific DOI
explicitly. See ADR-0001 for
the canonical-reference discipline.
MIT. Derivative works, re-implementations, and re-expressions in other forms are explicitly welcome. The author's strategic preference is for ideas to propagate freely; the license reflects that preference.