Senior Site Reliability Engineer at Thomson Reuters (Technology - Service Management division), based in Gdansk, Poland.
Focus: resilient enterprise infrastructure, monitoring and observability, AI-powered automation, Docs-as-Code, project roadmap management, and cross-team SRE practices.
- Site Reliability Engineering: incident management, on-call (follow-the-sun), SLO/error budgets
- Monitoring and Observability: Datadog, Prometheus, Grafana, Loki, ELK
- Infrastructure as Code: Ansible, Terraform, GitHub Actions CI/CD
- Documentation: Docs-as-Code, Markdown, ServiceNow KB automation, CI/CD publishing
- Project and Roadmap Management: Scrum/Agile facilitation, cross-team coordination
- AI Dev Tools: GitHub Copilot, Claude Code, Cursor AI, LiteLLM, MCP server development
- Virtualization and Security: Proxmox VE, Docker, Kubernetes, CrowdSec, nftables
- Scripting and Automation: PowerShell, Bash, Python
- Incident lifecycle management: triage, priority assessment, post-incident reviews
- Global on-call support using follow-the-sun model
- Root-cause analysis and reliability improvements
- SLO-driven burn-rate alerting aligned with error budgets
- Cross-team coordination across international engineering groups
- Datadog platform administration: dashboards, APM, synthetic tests, log pipelines
- Analyzing and consolidating non-standardized monitors into template-based multi-alert systems
- Alert noise reduction, tier-based routing, standardized naming and tagging conventions
- Prometheus, Grafana, Loki, ELK stack deployment and management
- Developing and maintaining strategic project roadmaps
- Coordinating feature implementation across international teams
- Aligning technical initiatives with business objectives
- Scrum/Agile facilitation: daily standups, sprint planning, retrospectives
- Docs-as-Code workflows: Markdown, Git version control, CI/CD automated publishing
- Documentation lifecycle: creation, review, approval, updates, archival, version tracking
- Documentation audits: accuracy, completeness, compliance with enterprise standards
- ServiceNow KB automation: GitHub Actions pipeline for Markdown to HTML publishing
- AI development tools: GitHub Copilot, Claude Code, Cursor AI, LiteLLM integration
- MCP (Model Context Protocol) server development for IDE integrations
- Custom Swagger/OpenAPI specifications for enterprise AI service integrations
- Automated workflows: PowerShell, Bash, Python, MS Power Automate
- AWS (EC2, ASG, EKS, FSx, S3) and Azure (AKS) cloud operations
- Proxmox VE administration: LXC container and VM provisioning, disaster recovery
- Ansible IaC and GitHub Actions CI/CD for automated infrastructure deployment
- Docker, Kubernetes (EKS, AKS), Helm chart management
- Security hardening: CrowdSec, nftables, fail2ban, nginx, Let's Encrypt
"Building reliable systems, one automation at a time."





