A comparative study of recommender system strategies for news articles using the MIND dataset, focusing on both accuracy and beyond-accuracy metrics.
- 1. Introduction
- 2. Problem Statement
- 3. Approach
- 4. Dataset & EDA
- 5. Methods
- 6. Evaluation Metrics
- 7. Results
- 8. Discussion
- 9. Conclusion
- 10. How to Run
- 11. Future Work
Recommender systems are a core component of modern digital platforms.
This project compares:
- Content-Based Filtering (CBF)
- Collaborative Filtering (CF)
- Hybrid Models
on the MIND dataset, a widely used benchmark in news recommendation.
- Data sparsity
- Cold-start problem
- Popularity bias
- Trade-offs between accuracy, diversity, and novelty
Evaluate and compare recommender models across multiple metrics.
Models implemented:
- Popularity Baseline
- Content-Based (TF-IDF)
- Collaborative Filtering (SVD)
- Hybrid Model
Pipeline:
- EDA
- Model implementation
- Evaluation
- Comparison
π https://msnews.github.io/
- ~1M users
- ~160k articles
- ~2.2M sessions
- Long-tail distribution
- Category imbalance
- Sparse user interactions
- Short textual features
π Leads to:
- Strong CBF performance
- Weak CF in small data
- Need for hybrid model
- Popularity-based ranking
- TF-IDF on title, abstract, category
- Cosine similarity
- Recency-weighted user profile
- User-item matrix
- Truncated SVD (32 dims)
- Embedding-based similarity
[ S = \omega_{cf} \cdot norm(S_{cf}) + \omega_{cbf} \cdot norm(S_{cbf}) ]
Best weights:
- CF = 0.1
- CBF = 0.9
- AUC
- MRR
- nDCG@5 / nDCG@10
- Novelty
- Diversity
| Model | AUC | MRR | nDCG@10 | Novelty |
|---|---|---|---|---|
| Baseline | 0.5318 | 0.2671 | 0.3098 | 14.04 |
| CF | 0.5429 | 0.2632 | 0.3088 | 14.58 |
| CBF | 0.6073 | 0.3306 | 0.3718 | 16.03 |
| Hybrid | 0.6126 | 0.3310 | 0.3731 | 15.80 |
| Model | AUC | MRR | nDCG@10 |
|---|---|---|---|
| Baseline | 0.5385 | 0.2618 | 0.3079 |
| CF | 0.5541 | 0.2795 | 0.3223 |
| CBF | 0.6059 | 0.3300 | 0.3711 |
| Hybrid | 0.6084 | 0.3272 | 0.3692 |
π Hybrid performs best overall.
- CBF dominates in sparse data
- CF improves with scale
- Hybrid balances both
Trade-offs:
- Accuracy vs diversity
- Novelty vs popularity
- CBF = strongest standalone
- CF = data-dependent
- Hybrid = best overall
Download from:
π https://msnews.github.io/
Files needed:
- MINDsmall_train.zip
- MINDsmall_dev.zip
data/
βββ MINDsmall_train/
β βββ behaviors.tsv
β βββ news.tsv
β βββ entity_embedding.vec
β βββ relation_embedding.vec
β
βββ MINDsmall_dev/
β βββ behaviors.tsv
β βββ news.tsv
β βββ entity_embedding.vec
β βββ relation_embedding.vec- behaviors.tsv β user clicks & impressions
- news.tsv β article metadata
- entity_embedding.vec β entity embeddings
- relation_embedding.vec β relation embeddings
behaviors.tsv and news.tsv are used in this project.
pip install -r requirements.txtpython run_all.pypython run_all.py --max-eval-sessions 5000python run_all.py --json-output results.jsonproject-root/
βββ run_all.py
βββ requirements.txt
βββ data/
βββ baseline/
βββ collaborative_filtering/
βββ content_based_filtering/
βββ hybrid_filtering/- Neural CF
- Transformer embeddings
- Temporal modeling
- Better hybrid strategies
- π Full Report: https://github.com/BitterOcean/MIND-Recommendation-System/blob/main/Report.pdf
- π Presentation: https://github.com/BitterOcean/MIND-Recommendation-System/blob/main/Presentation.pdf
- π» Code: https://github.com/BitterOcean/MIND-Recommendation-System