Data Scientist at Event Registry
Artificial Intelligence • Natural Language Processing • Cross-lingual Language Models • Semi-Automatic Text Processing • Data Visualization
| Package | Description | GitHub Stars | PyPI |
|---|---|---|---|
| anonipy | Data anonymization library supporting multiple anonymization strategies and techniques | ||
| datachart | Flexible data visualization library with simple API and extensive customization options |
| Dataset | Description | GitHub Stars | Repository |
|---|---|---|---|
| OG2021 | Comprehensive dataset from the 2021 Tokyo Olympics | Clarin.si | |
| SloATOMIC 2020 | Slovene translation of the ATOMIC 2020 commonsense reasoning dataset | Clarin.si |
eriknovak/cookiecutter-ml-dvc — Template for machine learning experiments using DVC for version control and reproducibility (in development).
# Install pipx for running cookiecutter
pip install pipx
# Create a new project using the template
pipx run cookiecutter gh:eriknovak/cookiecutter-ml-dvceriknovak/cookiecutter-ml-hpc — Template for machine learning experiments on HPC clusters with SLURM workload manager (in development).
# Install pipx for running cookiecutter
pip install pipx
# Create a new project using the template
pipx run cookiecutter gh:eriknovak/cookiecutter-ml-hpc



