asr-bench

Local evaluation harness for comparing ASR workflows across:

worker-faster_whisper
Qwen3-ASR
FireRedASR2S

It contains:

benchmark manifests
a durable curated sample catalog in sample_catalog.json
local runner scripts
report generation scripts
the mobile-friendly HTML report template in report/index.html

Generated artifacts are intentionally excluded from git:

downloaded audio in data/
benchmark outputs in out/
report runtime cache in report/data.json and report/runs.json
report source-of-truth DB in report/report.db

Report API

The local report server now keeps source registrations in SQLite and exposes simple write APIs:

GET /api/report/sources
GET /api/report/data
POST /api/report/rebuild
POST /api/report/register-source
POST /api/report/register-batch

Example batch registration:

curl -X POST http://127.0.0.1:18745/api/report/register-batch \
  -H 'Content-Type: application/json' \
  -d '{
    "batch_key": "eval-20260318-fr",
    "batch_label": "法语对比",
    "sources": [
      {"engine": "worker", "result_file": "/home/kevinzhow/github/asr-bench/out/a.json"},
      {"engine": "qwen", "result_file": "/home/kevinzhow/github/asr-bench/out/b.json"}
    ]
  }'

Layout

scripts/
- report build / registration / serving
report/
- static report frontend
manifest*.json
- benchmark sample lists
sample_catalog.json
- curated user-provided sample inventory grouped by issue / language / scenario
run_*
- local benchmark entrypoints

Notes

This repo depends on neighboring local repos for actual model execution:

/home/kevinzhow/worker-faster_whisper
/home/kevinzhow/github/Qwen3-ASR
/home/kevinzhow/github/FireRedASR2S

The benchmark repo itself stores orchestration and reporting only.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
report		report
scripts		scripts
.gitignore		.gitignore
README.md		README.md
compare_results.py		compare_results.py
download_manifest_audio.py		download_manifest_audio.py
manifest.cross_lang_eval.json		manifest.cross_lang_eval.json
manifest.en_6minute.json		manifest.en_6minute.json
manifest.en_6minute_worker_preprocessed.json		manifest.en_6minute_worker_preprocessed.json
manifest.en_ja_eval.json		manifest.en_ja_eval.json
manifest.famicon_tentei_01.json		manifest.famicon_tentei_01.json
manifest.famicon_tentei_01_first5m.json		manifest.famicon_tentei_01_first5m.json
manifest.firered_supported.json		manifest.firered_supported.json
manifest.ja_compare_toolkit.json		manifest.ja_compare_toolkit.json
manifest.ja_songs.json		manifest.ja_songs.json
manifest.ja_songs_eval.json		manifest.ja_songs_eval.json
manifest.ja_zh.json		manifest.ja_zh.json
manifest.json		manifest.json
manifest.ko_korean180.json		manifest.ko_korean180.json
manifest.ko_th_eval.json		manifest.ko_th_eval.json
manifest.local.json		manifest.local.json
manifest.long_zh.json		manifest.long_zh.json
manifest.long_zh_worker_preprocessed.json		manifest.long_zh_worker_preprocessed.json
manifest.qifengle.json		manifest.qifengle.json
manifest.qifengle_worker_preprocessed.json		manifest.qifengle_worker_preprocessed.json
manifest.test_dongman_dialog.json		manifest.test_dongman_dialog.json
manifest.xwz_27.json		manifest.xwz_27.json
manifest.xwz_27_worker_preprocessed.json		manifest.xwz_27_worker_preprocessed.json
preprocess_with_worker_ffmpeg.py		preprocess_with_worker_ffmpeg.py
run_qwen_batch.py		run_qwen_batch.py
run_qwen_toolkit_local_batch.py		run_qwen_toolkit_local_batch.py
run_worker_faster_whisper_batch.py		run_worker_faster_whisper_batch.py
sample_catalog.json		sample_catalog.json
worker_params_miraa_ja.json		worker_params_miraa_ja.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

asr-bench

Report API

Layout

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

asr-bench

Report API

Layout

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages