Skip to content

Figure out how to get the database to properly recognise author papers #5

@edoyango

Description

@edoyango

Currently, database doesn't seem to pull papers based on author(s) e.g.

python query_data.py "Do you know of any papers authored by Edward Yang?"
Response:  Yes, based on the provided context, it is known that Edward Yang has 
co-authored a research paper titled "Numerical investigation of the mechanism of granular 
flow impact on rigid control structures". The year of submission and acceptance are 
provided as well.
Sources: ['data/A_Review_on_Ocular_Biomechanic_Models_for_Assessing_Visual_Fatigue_in_Virtual_Reality.pdf:16:1', 
'data/RCP-Projects.aspx.html:None:22', 
'data/A_Review_on_Ocular_Biomechanic_Models_for_Assessing_Visual_Fatigue_in_Virtual_Reality.pdf:15:12', 
'data/1-s2.0-S0266352X20300379-main-1.pdf:20:9', 'data/s11440-021-01162-4.pdf:0:0']

Which is partially right as it's one of the papers included in the dataset. Interestingly data/1-s2.0-S0266352X20300379-main-1.pdf (my other paper included in the paper) was thought to be more relevant, but not mentioned by the LLM - probably because the database returned a chunk later in the paper.

Another example:

python query_data.py "Do you know of any papers authored by Michael Milton?"
Response:  No, there is no information in the provided context that indicates if Michael Milton 
has authored any papers or not. The term "Milton" refers to a high-performance computer 
(HPC) at WEHI, not an individual author.
Sources: ['data/RCP-Projects.aspx.html:None:22', 
'data/Milton-SLURM-2022-uplift.aspx.html:None:0', 
'data/What-is-Milton.aspx.html:None:0', 
'data/RCP-AnnualSummary.aspx.html:None:20', 
'data/RCP-AnnualReport.aspx.html:None:20']

Need to figure out how to get the database to return/recognise author information.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions