Public datasets released alongside ForesightFlow research.
| Dataset | Description | License | Size | Tag |
|---|---|---|---|---|
| coordination-traces-100 | 500 LLM reasoning traces (100 markets × 5 coordination configs) from the Phase 0.5 shakedown of "Coordination as an Architectural Layer for LLM-Based Multi-Agent Systems" | CC-BY 4.0 | 8.8 MB | coordination-traces-100-v1 |
| ffic-inventory | 8 publicly documented Polymarket insider-trading cases mapped to 24 on-chain market identifiers, released alongside the paper "ForesightFlow: Real-Time Detection of Informed Trading in Decentralized Prediction Markets" | CC-BY 4.0 | 21 KB | ffic-inventory-v1 |
| polymarket-deadline-ils | Population-scale Deadline-ILS (ILS^dl) scores for 88 Polymarket markets across military/geopolitical, regulatory, and corporate categories. Includes LLM-recovered event dates (T_event), bootstrap CIs, and full 2,375-market attrition chain. Snapshot: 2020–2026. | CC-BY 4.0 | 311 KB | polymarket-deadline-ils-v2 |
| polymarket-resolution-typology | Three-class classification of 911,237 Polymarket markets by resolution mechanism (deadline_resolved / event_resolved / unclassifiable), with category labels and volume metadata. Snapshot: 2020–2026-04-27. | CC-BY 4.0 | 242 MB (LFS) | polymarket-resolution-typology-v1 |
| pmxt-stylized-facts-v1 | Per-market stylized-fact measurements (SF1–SF9) for 13,314 resolved Polymarket binary-event markets, week 2026-04-21 to 2026-04-27. Empirical foundation for Nechepurenko (2026) Paper 1 (event-linked perpetuals). | CC-BY 4.0 | 1.6 MB | pmxt-stylized-facts-v1 |
| pmxt-counterfactual-replay-v1 | Counterfactual simulation results (E2 margin recalibration + E3 resolution-zone protocol comparison) for 13,000+ resolved Polymarket binary-event markets. Per-(engine/mechanic, leverage, class) liquidation rates, bad-debt frequencies, drawdown, and PnL. Companion to pmxt-stylized-facts-v1 and Nechepurenko (2026) Paper 1. | CC-BY 4.0 | 19 KB | pmxt-counterfactual-replay-v1 |
| pmxt-behavioral-clusters-v1 | Fill-side behavioral clusters, feature tiers, and per-market microstructure signatures from 13.4M OrderFilled events on Polymarket CTFExchange (43,116 markets, 77,203 addresses, 2026-04-21 to 2026-04-27). k-means k=5 archetypes, 6 reviewer-defensible feature tiers, bilateral Spearman analysis with BH-FDR correction. Companion to Nechepurenko (2026) Paper 4. |
CC-BY 4.0 | 17.8 MB (LFS) | pmxt-behavioral-clusters-v1 |
Datasets released as the empirical foundation for the four-paper Event-Linked Perpetuals programme by Maksym Nechepurenko (Devnull Research). See each bundle's README for schema, methodology, and citation details.
| Bundle | Folder | DOI | Status |
|---|---|---|---|
| Bundle 1 | pmxt-stylized-facts-v1/ |
10.5281/zenodo.20107449 | Released |
| Bundle 2 | pmxt-counterfactual-replay-v1/ |
10.5281/zenodo.20108387 | Released |
| Bundle 3 | pmxt-behavioral-clusters-v1/ |
TBD (Zenodo forthcoming) | Released |
Create a subdirectory under the repo root. Each dataset directory must contain:
README.md— description, schema, quick-startDATASHEET.md— Gebru et al. (2021) datasheetCITATION.cff— citation metadataLICENSE— dataset license (CC-BY 4.0 recommended)data/— data files
Update this top-level README index, then tag as <dataset-name>-v1.
If you use these datasets, please cite the papers they accompany:
@misc{nechepurenko2026population-leakage,
title = {Information Leakage at Population Scale: An Evaluation of the Polymarket Insider-Relevant Subpopulation},
author = {Nechepurenko, Maksym},
year = {2026},
url = {http://ssrn.com/abstract=6686819},
note = {SSRN Working Paper 6686819}
}Full preprint: https://foresightflow.org/publications/information-leakage-population-scale.
Empirical Evaluation of Deadline-Resolved Information Leakage on Documented Polymarket Insider Cases
@misc{nechepurenko2026deadline-leakage,
title = {Empirical Evaluation of Deadline-Resolved Information Leakage on Documented Polymarket Insider Cases},
author = {Nechepurenko, Maksym},
year = {2026},
url = {https://papers.ssrn.com/abstract=6687398},
note = {SSRN Working Paper 6687398}
}Full preprint: https://foresightflow.org/publications/deadline-resolved-information-leakage.
@misc{nechepurenko2026coordination,
title = {Coordination as an Architectural Layer for LLM-Based Multi-Agent Systems: An Information-Controlled Empirical Study on Prediction Markets},
author = {Nechepurenko, Maksym and Shuvalov, Pavel},
year = {2026},
url = {https://papers.ssrn.com/abstract=6687518},
note = {SSRN Working Paper 6687518}
}Full preprint: https://foresightflow.org/publications/coordination-architectural-layer.
@misc{nechepurenko2026elp,
title = {Resolution-Aware Perpetual Futures on Binary Prediction Markets: An Empirical Risk-Design Framework Using Polymarket Data},
author = {Nechepurenko, Maksym},
year = {2026},
note = {Working paper. Companion datasets: pmxt-stylized-facts-v1
(DOI: 10.5281/zenodo.20107449), pmxt-counterfactual-replay-v1
(DOI: 10.5281/zenodo.20108387).}
}@misc{nechepurenko2026pmxt_clusters_paper,
title = {Fill-Side Non-Retail Trading on Polymarket: An Empirical Study of Behavioral
Tiers and Microstructure Signatures Under Quote-Attribution Constraints},
author = {Nechepurenko, Maksym},
year = {2026},
note = {Working paper. Companion dataset: pmxt-behavioral-clusters-v1
(DOI: 10.5281/zenodo.XXXXXXXX).}
}