Data engineering projects and learnings:
- DE Trainings - Big data engineering homeworks: SQLite, Parquet, file processing, database operations
- Apache Airflow - Workflow orchestration: DAGs, operators, scheduling, monitoring, security
- GCP Data Engineering - Full pipeline: Airflow, PostgreSQL, Data Lake, BigQuery, Dataproc, Composer
- Lithuania Statistics Pipeline - Dataflow ETL from Statistics Lithuania API to GCP Storage and BigQuery
- LT Transport Dashboard - Car/motorcycle market analytics: Dataflow, BigQuery, Looker Studio, scheduled pipeline
- VNO Airplane Spotting - Flask web app with flight data analysis, deployed on GCP App Engine
- Rust vs Python Performance - Data processing speed comparison with benchmarks and performance analysis
- Real-time Streaming (Rust) - Event-driven pipeline: Pub/Sub → Cloud Run → BigQuery with Rust
- Gemini Pro Translator - LLM integration in BigQuery for multilingual text translation using SQL
- Rust DataFusion vs PySpark - 10 billion rows benchmark: 2.7x faster performance with DataFusion
- Google Pipe SQL - Comparison of standard SQL vs Google's new pipe syntax for readability
- BigQuery Optimization - Query performance tuning using historical execution patterns
- GCP Cost Optimization - Resource management, sustained use discounts, preemptible instances strategies
- Creative Data Engineering - Analysis of creativity aspects in data engineering with visualization
- GCP CMEK Integration - Customer-managed encryption keys implementation for secure data ingestion