Skip to content

Acquire MARBLE Data #1

@janetzki

Description

@janetzki

Goal

As a developer, I want to acquire the data from MARBLE to use their mappings from words to lexical domains. These mappings indicate to which semantic domain a verse (or word or phrase) belongs (for Hebrew and Greek).
Motivation: disambiguate words -> reduce FNs -> increase DC recall > 0.30

Tasks

  • get lexical domain mappings for the Old Testament
  • get lexical domain mappings for the New Testament
  • create a list of all data that we already can access
    • --> 2 parquet files
  • Parse mappings from words to lexical domains
  • think about how we can map lexical domains to SDs
  • see A.1.3 in proposal
  • try it out
  • Look at what we already have
    • a) mappings from words in verses to MARBLE domains
    • b) mappings from MARBLE domains to SDs
    • c) mappings from words in verses to SDs
  • refine c) with a) and b) (set intersection)
  • (alternative idea: add lexical domains to the graph (#GNN))
  • (disambiguate example verse with lexical domains)
  • (create more labels by trying out the semi-automatic labeling function from [Refinery](https://github.com/code-kern-ai/refinery) (NLP IDE).)

Notes

Metadata

Metadata

Assignees

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions