TIB-SID: A bilingual (English/German) dataset of library catalog records with GND subject indexing for research on automated subject tagging and extreme multi-label classification.
nlp information-retrieval dataset gnd digital-libraries subject-indexing xmtc library-metadata controlled-vocabulary extreme-multilabel-classification
-
Updated
Mar 14, 2026 - Python