GPS Support: Data Model, Standardization & Open Data Sources
Hi all,
I wanted to open a conversation around GPS support in Kloppy. As more users work with wearable tracking data (PlayerData, Catapult, STATSports, etc.), it feels like a good time to explore what GPS support in Kloppy could look like — and what would be needed to get there.
This issue is a place to discuss and brainstorm. Nothing is set in stone — just starting the conversation.
🧱 Data model
Currently, Kloppy assumes a match-based structure (with Team, Periods, etc. see Metadata), which makes sense for optical tracking. But GPS data often comes from training sessions with different structures — like multiple drills or free-form running.
- Do we need to introduce new concepts like
Session and Drill?
- Can these live alongside
Periods in the same core model?
- What should the shared/common data structure be?
🧭 Standardization
GPS data is often in lat/lon — which is tricky to work with for analysis. Ideally, we could:
- Convert lat/lon to pitch coordinates (if pitch dimensions and orientation are known, including the four corners in lat/lon)
- Apply standard smoothing techniques to reduce noise
- Support different sampling rates and interpolation strategies
Would love to hear thoughts on how best to handle coordinate normalization and smoothing in a consistent way.
🌍 Open data & ingestion
To experiment, we’ll need access to GPS data from different providers.
- Are there any open datasets available (even small ones)?
- What formats are typically used — CSV, JSON, FIT files?
- Can Ingestify play a role in fetching/standardizing the raw data?
Would love to hear from anyone who has worked with real GPS data, or who knows where to find it.
GPS Support: Data Model, Standardization & Open Data Sources
Hi all,
I wanted to open a conversation around GPS support in Kloppy. As more users work with wearable tracking data (PlayerData, Catapult, STATSports, etc.), it feels like a good time to explore what GPS support in Kloppy could look like — and what would be needed to get there.
This issue is a place to discuss and brainstorm. Nothing is set in stone — just starting the conversation.
🧱 Data model
Currently, Kloppy assumes a match-based structure (with
Team,Periods, etc. see Metadata), which makes sense for optical tracking. But GPS data often comes from training sessions with different structures — like multiple drills or free-form running.SessionandDrill?Periodsin the same core model?🧭 Standardization
GPS data is often in lat/lon — which is tricky to work with for analysis. Ideally, we could:
Would love to hear thoughts on how best to handle coordinate normalization and smoothing in a consistent way.
🌍 Open data & ingestion
To experiment, we’ll need access to GPS data from different providers.
Would love to hear from anyone who has worked with real GPS data, or who knows where to find it.