feat(M85): Offline Mode — File-Based Comparison Without Endpoints by hlin99 · Pull Request #184 · xPyD-hub/xPyD-acc

hlin99 · 2026-04-06T06:51:24Z

Summary

Add xpyd-acc compare-files subcommand for offline comparison of pre-collected outputs.

Changes

file_compare.py: load_outputs(), run_file_compare(), format_file_compare()
JSONL format: {"id": "...", "output": "...", "logprobs": [...]} per line
Full batch comparison pipeline (matching, classification, statistics) without API calls
CLI: compare-files --baseline <path> --target <path> with all export flags
Match config: --normalize-whitespace, --ignore-case, --numeric-tolerance
20 tests covering loading, comparison, edge cases, CLI integration

Exports Supported

JSON, CSV, Markdown, JUnit XML (via existing BatchReport methods)

Closes #183

- file_compare.py: load_outputs(), run_file_compare(), format_file_compare() - JSONL format: {id, output, logprobs?} per line - Full batch comparison pipeline (matching, classification, statistics) - CLI subcommand compare-files with --json/--csv/--markdown/--junit export - Match config support: --normalize-whitespace, --ignore-case, --numeric-tolerance - 20 tests covering loading, comparison, exports, edge cases, CLI Closes #183

hlin99-Review-Bot

✅ LGTM. Clean implementation — reuses existing BatchReport/MatchConfig nicely, solid error handling in load_outputs, good test coverage (20 tests including edge cases). CI green.

hlin99-Review-BotX

✅ Approved (hlin99-Review-BotX)

Idea Value: High — offline file-based comparison is a natural extension. Users can now compare pre-collected outputs without live endpoints, enabling CI pipelines, reproducible benchmarks, and air-gapped workflows.

Code Quality: Clean implementation.

load_outputs() has proper validation with clear error messages (line numbers, field names)
ID-matching logic with helpful mismatch diagnostics
Reuses existing MatchConfig / normalized_match / compute_report infrastructure — no duplication
All 4 export formats (JSON, CSV, Markdown, JUnit) supported
12 tests covering load, compare, format, CLI, and edge cases
docs/iterations/current.md updated

CI: all checks pass. LGTM.

hlin99-Review-Bot approved these changes Apr 6, 2026

View reviewed changes

hlin99-Review-BotX approved these changes Apr 6, 2026

View reviewed changes

hlin99 merged commit 26d5fd0 into main Apr 6, 2026
5 checks passed

hlin99 deleted the feat/m85-file-compare branch April 6, 2026 07:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(M85): Offline Mode — File-Based Comparison Without Endpoints#184

feat(M85): Offline Mode — File-Based Comparison Without Endpoints#184
hlin99 merged 1 commit into
mainfrom
feat/m85-file-compare

hlin99 commented Apr 6, 2026

Uh oh!

hlin99-Review-Bot left a comment

Uh oh!

hlin99-Review-BotX left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

hlin99 commented Apr 6, 2026

Summary

Changes

Exports Supported

Uh oh!

hlin99-Review-Bot left a comment

Choose a reason for hiding this comment

Uh oh!

hlin99-Review-BotX left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants