Skip to content

docs: LoCoMo retrieval algorithm comparison findings#49

Open
rajkripal wants to merge 1 commit into
mainfrom
docs/retrieval-algorithm-comparison
Open

docs: LoCoMo retrieval algorithm comparison findings#49
rajkripal wants to merge 1 commit into
mainfrom
docs/retrieval-algorithm-comparison

Conversation

@rajkripal

Copy link
Copy Markdown
Owner

Adds papers/locomo-run/retrieval-algorithm-comparison.md with results
from four retrieval configurations tested on conv-26 (n=199), plus
cross-validation on conv-30 and conv-41.

Uniform scoring shows a small gain on conv-26 (+0.014 F1) but collapses
on conv-41 (-0.181 F1 vs qe-gte). Recommendation: keep vector+recency
blend as default; do not ship uniform scoring.

Also removes tests/test_retrieval_uniform.py. It was untracked on main
(the uniform feature was never merged) and imports _entity_match_score
from core.retrieval, a symbol that does not exist on main. The test
fails on collection with an ImportError. Deleting the orphan keeps pytest
clean without needing ignores.

Adds `papers/locomo-run/retrieval-algorithm-comparison.md` with results
from four retrieval configurations tested on conv-26 (n=199), plus
cross-validation on conv-30 and conv-41.

Uniform scoring shows a small gain on conv-26 (+0.014 F1) but collapses
on conv-41 (-0.181 F1 vs qe-gte). Recommendation: keep vector+recency
blend as default; do not ship uniform scoring.

Also removes `tests/test_retrieval_uniform.py`, which was an untracked
orphan on main. It imports `_entity_match_score` from `core.retrieval`,
a symbol that does not exist on main (the uniform feature was never
merged). The test fails on collection with an ImportError.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant