Skip to content
This repository was archived by the owner on Sep 25, 2025. It is now read-only.
This repository was archived by the owner on Sep 25, 2025. It is now read-only.

Unable to parse daily tracklog #74

@jleong-openedx

Description

@jleong-openedx

Hi,

I have run into a problem with edx2bigquery where there is a failure to process a tracklog for a day; the error in the BigQuery interface is:

gs://tracklog-2017-11-17.json.gz: Error while reading data, error message: JSON parsing error in row starting at position 12345: No such field: event_struct.duration. (error code: invalid)

The portion in the file that triggers this is:

"event_struct": {"duration": 123.45

This will only occur for files that contain data with that specific JSON schema.

As a result of this, the job fails to complete, and no data for this tracklog file is uploaded to BigQuery. The edx2bigquery library itself fails silently in its execution.

Would someone please be able to look into this? I did some searching and this looks to be a similar situation, with possibly a solution: https://stackoverflow.com/questions/25279116/cannot-insert-new-value-to-bigquery-table-after-updating-with-new-column-using-s

Thank you!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions