Skip to content
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
20 changes: 20 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -176,6 +176,26 @@ make data-build
- ✅ Full validation and transformation
- ✅ Integrated confidence scoring

#### CSV templates

**Basic template (recommended starting point):**

```csv
id,city,provider,latitude,longitude,download,upload,latency,jitter,packet_loss,timestamp
1,Your City Name,Your ISP Name,-23.5505,-46.6333,100.0,15.0,30.0,5.0,0.5,2026-01-15T10:00:00
```

**Complete template (example dataset):**

```csv
id,city,provider,latitude,longitude,download,upload,latency,jitter,packet_loss,timestamp
1,São Paulo,Starlink,-23.5505,-46.6333,165.4,22.8,28.5,3.2,0.1,2026-01-15T10:30:00
2,Belo Horizonte,Claro,-19.9167,-43.9345,92.1,15.3,38.7,6.5,0.8,2026-01-15T11:00:00
3,Curitiba,Vivo,-25.4284,-49.2733,110.5,18.2,32.1,4.8,0.3,2026-01-15T11:30:00
4,Porto Alegre,TIM,-30.0346,-51.2177,88.7,14.1,42.3,7.2,1.1,2026-01-15T12:00:00
5,Manaus,Viasat,-3.1190,-60.0217,75.3,9.8,68.2,15.7,2.5,2026-01-15T12:30:00
```

Copilot AI Feb 10, 2026

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The templates include jitter and packet_loss as if they are first-class measurement metrics, but the Manual CSV pipeline currently only preserves these columns as string metadata (see src/sources/manual_csv.py where they are copied into measurement.metadata), and they are not part of the canonical MeasurementSchema. This is likely to mislead users into thinking those values will be validated/used in scoring/analysis; either clarify here that these fields are stored as metadata only for Manual CSV ingestion, or remove them from the Manual CSV templates / link to the upload_csv.py template docs instead.

Suggested change
Note: in the Manual CSV pipeline, the `jitter` and `packet_loss` columns are preserved as string metadata only and are **not** part of the canonical `MeasurementSchema` or used directly in scoring/analysis.

Copilot uses AI. Check for mistakes.
See the [complete guide](docs/MANUAL_CSV_PIPELINE.md) for details.

## Roadmap (alto nível)
Expand Down