llm-logparser

Convert full LLM export dumps into clean, human-readable Markdown — offline-first, deterministic, CLI-centric.

llm-logparser parses conversation logs (JSON / JSONL / NDJSON), normalizes them into thread records, and exports GitHub-Flavored Markdown with metadata — built for reproducibility, audits, archiving, and migration.

No cloud. No telemetry. Your data stays local.

✨ What it does

Parse → Normalize → Export (Markdown)
Thread-based layout with YAML front-matter
Automatic splitting (size / count / auto)
Localized timestamps (locale + timezone support)
Chain mode: parse & export in one command
Deterministic, offline workflows
Future-proof architecture (multi-provider adapters)

MVP currently focuses on OpenAI logs. Providers like Claude / Gemini are planned.

🚀 Quick Start

Install (local dev):

pip install -e .

Parse an export:

llm-logparser parse \
  --provider openai \
  --input examples/messages.jsonl \
  --outdir artifacts

Export a parsed thread to Markdown:

llm-logparser export \
  --input artifacts/output/openai/thread-abc123/parsed.jsonl \
  --timezone Asia/Tokyo \
  --formatting light

End-to-end (parse → export everything):

llm-logparser chain \
  --provider openai \
  --input examples/messages.jsonl \
  --outdir artifacts \
  --timezone Asia/Tokyo

📁 Directory Layout

artifacts/
  output/
    openai/
      thread-<conversation_id>/
        parsed.jsonl
        thread-<conversation_id>__*.md
        meta.json (optional)

Pass only the root via --outdir. The tool creates output/<provider>/... automatically.

📝 Markdown Format (Overview)

Each file begins with YAML front-matter:

---
thread: "abc123"
provider: "openai"
messages: 42
range: 2025-10-01 〜 2025-10-18
locale: "ja-JP"
timezone: "Asia/Tokyo"
updated: "2025-10-18T10:15:00Z"
checksum: "<sha1>"
---

Messages follow in timestamp order:

## [User] 2025-10-18 10:00
Good morning!

## [Assistant] 2025-10-18 10:01
Good morning — how can I help today?

Markdown is GFM-compatible and preserves:

fenced code blocks
links
tables
quotes

🌍 Localization

llm-logparser supports localized timestamps and messages.

You can control output formatting using:

--locale   en-US | ja-JP | …
--timezone Asia/Tokyo | UTC | …

Dates in Markdown are rendered using the selected locale
Internally, timestamps remain UTC ISO-8601 for reproducibility
Missing or unknown locales gracefully fall back to en-US
--locale takes precedence when both --locale and --lang are supplied (--lang exists for compatibility)

Example:

llm-logparser export \
  --input parsed.jsonl \
  --locale ja-JP \
  --timezone Asia/Tokyo

🪓 Splitting

--split size=4M
--split count=1500
--split auto     # size=4M + count=1500

Extra tuning:

--split-soft-overflow 0.20
--split-hard
--tiny-tail-threshold 20

🔗 Chain Mode

Runs parse → export in one flow:

--parsed-root       reuse existing parsed threads
--export-outdir     place Markdown elsewhere
--dry-run           parse only (no writes)
--fail-fast         stop on first export error

🛠 CLI Reference (MVP)

Parse

llm-logparser parse \
  --provider openai \
  --input <file> \
  --outdir artifacts \
  [--dry-run] [--fail-fast]

Export

llm-logparser export \
  --input parsed.jsonl \
  [--out <md>] \
  [--split auto|size=N|count=N] \
  [--timezone <IANA>] \
  [--formatting none|light]

Chain

llm-logparser chain \
  --provider openai \
  --input <raw> \
  --outdir artifacts \
  [other export options...]

🔒 Security & Privacy

Offline-first
No telemetry
Sensitive logs stay local
Deterministic output for audits

🗺 Roadmap

CLI MVP (parse/export/chain)
Minimal HTML viewer
Additional providers (Claude / Gemini / …)
Apps SDK integration (experimental)
GUI (later stage)

🤝 Contributing

PRs welcome! Good places to start:

adapters
exporter improvements
localization

Principles:

deterministic core
provider-specific behavior lives in adapters
offline by default

📄 License

MIT — simple and permissive.

Author

"The words you weave are not mere echoes;
they carry weight,
and may they never be lost to the tide of time."

Name		Name	Last commit message	Last commit date
Latest commit History 76 Commits
.github/workflows		.github/workflows
docs		docs
src/llm_logparser		src/llm_logparser
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.yaml		config.yaml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

llm-logparser

✨ What it does

🚀 Quick Start

📁 Directory Layout

📝 Markdown Format (Overview)

🌍 Localization

🪓 Splitting

🔗 Chain Mode

🛠 CLI Reference (MVP)

Parse

Export

Chain

🔒 Security & Privacy

🗺 Roadmap

🤝 Contributing

📄 License

Author

About

Uh oh!

Releases 1

Packages

Languages

License

Syun-tnb/llm-logparser

Folders and files

Latest commit

History

Repository files navigation

llm-logparser

✨ What it does

🚀 Quick Start

📁 Directory Layout

📝 Markdown Format (Overview)

🌍 Localization

🪓 Splitting

🔗 Chain Mode

🛠 CLI Reference (MVP)

Parse

Export

Chain

🔒 Security & Privacy

🗺 Roadmap

🤝 Contributing

📄 License

Author

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages