- Run
preprocess_data.pyto generate dictionaries containing n-best asr scores for each utterance. - Run
lllm_scoring.pyto update dictionaries with llm scores for each utterance. (forgpt2andbert) - Run
combined_scores.pywith arg--lambda_paramto combine the asr and llm scores. - Run
compute_error_rate.pyto compute the error rate for a given hypothesis dictionary. gridsearch.shTests error rates on a range of lambda values.hyp_comb_10_dict_test_other.jsoncontains the hypotheses and all the scores for the automasking experimenthyp_comb_masks_10_dict_test_other.jsoncontains the hypotheses and all the scores for the selective mask-based experiment
saagar-parikh/ASR_LLM_Rescoring
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|
Repository files navigation
Releases
No releases published
Languages
- Jupyter Notebook 71.3%
- Shell 15.0%
- Python 13.7%