🚆 SEPTA Delay Scraper

SEPTA Delay Scraper is an open-source project that collects real-time train data from SEPTA's public APIs, including:

Train positions (train_view.py)
Real-time trip updates (trip_updates.py)
GTFS schedule updates (rrschedules.py)

The scraper runs every 10 minutes (rrschedules.py runs once a day) and stores data in SQLite databases. This project is containerized with Docker, making deployment easy on any server.

🎯 Features

Scrapes real-time train positions
Stores historical delay data
Downloads & updates GTFS schedules
Fully automated with cron jobs inside Docker

📥 Deployment Guide

1. Install Docker & Docker Compose

Before running the scraper, install Docker:

sudo apt update && sudo apt install -y docker.io docker-compose

Verify installation:

docker --version
docker-compose --version

2. Clone the Repository

gh repo clone nathankong97/septa-delay
cd septa-delay

3. Build & Run the Scraper

docker-compose up -d --build

This will:

Install dependencies
Run scrapers every 10 minutes
Store data in SQLite databases inside data/, json file inside scraping/
Persist logs inside logs/

🐳 Managing the Scraper

Check Running Containers

docker ps

View Logs

docker logs septa_scraper

Shutdown Docker

docker-compose down

Access Container

docker exec -it septa_scraper

📁 Project Structure

septa-delay/
├── data/                 # Stores SQLite databases (Persistent)
├── logs/                 # Stores log files
├── scraping/             # Stores json files
├── septa/
│   ├── core/
│   │   ├── database.py   # Database handling
│   │   ├── fetcher.py    # API fetch logic
│   │   ├── logger.py     # Logging system
│   ├── rrschedules.py   # Fetch GTFS data and final updates
│   ├── train_view.py    # Fetch live train positions
│   ├── trip_updates.py  # Fetch real-time trip updates
├── config.py             # Configuration settings
├── Dockerfile            # Docker build instructions
├── docker-compose.yml    # Manages Docker services
├── run_scraper.sh        # Auto-runs all scrapers
├── requirements.txt      # Python dependencies
└── README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🚆 SEPTA Delay Scraper

🎯 Features

📥 Deployment Guide

1. Install Docker & Docker Compose

2. Clone the Repository

3. Build & Run the Scraper

🐳 Managing the Scraper

Check Running Containers

View Logs

Shutdown Docker

Access Container

📁 Project Structure

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
data		data
logs		logs
scraping		scraping
septa		septa
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
config.py		config.py
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt
run_scraper.sh		run_scraper.sh

License

nathankong97/septa-delay

Folders and files

Latest commit

History

Repository files navigation

🚆 SEPTA Delay Scraper

🎯 Features

📥 Deployment Guide

1. Install Docker & Docker Compose

2. Clone the Repository

3. Build & Run the Scraper

🐳 Managing the Scraper

Check Running Containers

View Logs

Shutdown Docker

Access Container

📁 Project Structure

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages