This repository holds the homework of Graph Databases course developed by group LODB.
Group Members (University email id| Matricola| Github ID):-
- Mehuly Chakraborthy (mehuly.chakraborthy@studenti.unipd.it| 2107432| mehuly25)
- Hufsa Memon (Hufsa.memon@studenti.unipd.it| 2106237| Hufsa05)
- Francisca Chidubem Ezeobi (franciscachidubem.ezeobi@studenti.unipd.it| 2110514| FrancyCat)
The repository is organised as follows:
- Dataset: This folder contains the datasets in yaml format that we used for our project.
- Notebooks: This folder contains the ipynb files for data cleaning and serialisation for rdf triples.
- Ontology: This folder contains the .ttl file for our ontology and the visual schema of the same.
- RDFTriples: This folder contains the rdf triples (.ttl files) generated using the ontology and notebooks.
For our project, we have chosen to use the Women's T20 International Cricket Dataset from link mentioned above. The dataset includes detailed match information such as players, teams, runs, balls, deliveries, etc. for each match over a time span from 2009 to 2019. We intend to model an ontology that answers the over all match details - toss result, venue of the match, players of each team, winning runs and so on - for each match provided in the dataset.