Rafael Bernardo Sathler

Rafael Bernardo Sathler

Infrastructure Engineer • DevOps • Platform Engineer • AWS/GCP Cloud Architect

📧 rafaelbsathler@gmail.com 📱 +55 21 997988682 📍 Rio de Janeiro, Brazil

Professional Summary

Platform Engineer with 10+ years evolving from Linux system administration to enterprise-scale cloud-native architecture. Worn many hats across DevOps, infrastructure automation, and platform engineering. Led 8,000-repo GitHub Actions migration, orchestrated zero-downtime Kubernetes upgrades protecting revenue, and delivered $265K+ cost reductions through rightsizing initiatives, spot-instance strategies, and automation serving 5,000+ engineers across AWS/GCP.

Professional Experience

Platform Engineer
Bluecore
2024-08 - 2025-06

Re-platformed observability and future-proofed infrastructure for a high-traffic e-commerce platform.

  • Re-platformed observability infrastructure: Migrated 300+ Datadog monitors and 150+ dashboards to Chronosphere, reducing monitoring costs by 40% (~$200K annually)
  • Zero-downtime Kubernetes upgrades: Orchestrated GKE fleet upgrades to v1.32 across multi-region clusters, protecting Black Friday/Cyber Monday revenue
  • Process optimization: Authored 20+ runbooks and implemented GitOps workflows, reducing patch lead-time by 50% and infrastructure tickets by 30%
  • Technical mentorship: Provided deep technical guidance to 10+ engineering teams, solving complex infrastructure challenges and driving adoption of best practices
Senior Software Engineer
Uber
2023-08 - 2024-08

Drove company-wide migration from Atlassian Bamboo to GitHub Actions, scaling infrastructure to serve 5,000 engineers and 8,000 repos.

  • Enterprise CI/CD transformation: Led migration of 8,000 repositories from Bamboo to GitHub Actions, scaling 1,000-node GKE runner fleet with 99%+ uptime and <5min job startup SLO
  • Monorepo at scale: Supported Bazel across the organization, enabling teams to leverage developing toolchains and CI/CD pipelines with GitHub Actions, reducing build times by 60% for 5,000+ engineers
  • Cost optimization at scale: Implemented Terraform automation and spot-instance strategy, achieving $65K annual savings while reducing median build latency by 40%
  • Security & compliance: Established audit-grade compliance with automated logging, secret scanning, and SAML enforcement, closing critical audit findings
  • Platform architecture: Designed and implemented self-service infrastructure platform reducing deployment time from 24+ hours to <4 hours through advanced automation and orchestration
  • Technical documentation: Authored comprehensive 'CI/CD Playbook' and Bazel optimization guides, becoming the go-to technical resource for platform teams
Solutions Architect
Triumph Technology Solutions LLC
2023-04 - 2023-07

Led enterprise AWS cloud transformation and migration projects for Fortune 500 clients.

  • AWS MAP assessments: Executed comprehensive enterprise cloud migration assessments covering infrastructure analysis, cost optimization, and technical framework design
  • Microservices architecture: Designed and implemented scalable microservices that reduced component dependencies and improved system scalability by 300%
  • Hybrid cloud solutions: Developed secure hybrid cloud infrastructure enabling seamless data transfer between on-premise and cloud environments
  • DevOps strategy: Implemented enterprise DevOps practices that improved system reliability and enabled continuous delivery workflows
Infrastructure Engineer
Toptal
2022-03 - 2023-02

Built and maintained cloud-native infrastructure for global talent marketplace serving top-tier clients.

  • Cloud-native solutions: Designed and implemented scalable infrastructure solutions using Kubernetes, Terraform, and ArgoCD on Google Cloud
  • Solution architecture: Designed complex technical architectures translating business requirements into scalable, resilient infrastructure solutions
  • Automation excellence: Developed CI/CD pipelines and infrastructure-as-code solutions that reduced deployment time by 60%
  • Performance optimization: Monitored and optimized cloud infrastructure performance, achieving 99.9% uptime SLA
Senior DevOps Engineer
Globo.com
2018-08 - 2022-06

Achieved Guinness World Record supporting Big Data infrastructure for Brazil's largest media company.

  • 🏆 Guinness World Record: Ensured zero-downtime operation of voting infrastructure for 'Big Brother Brasil', handling record-breaking public votes that surpassed 'American Idol'
  • Big Data at scale: Architected and maintained Hadoop/Kafka clusters processing millions of requests per second with 99.99% availability
  • Production systems: Designed, deployed, and maintained highly available systems supporting Brazil's top media properties with shared ownership model
  • Advanced data engineering: Architected innovative solutions using Spark/Hadoop ecosystem for real-time analytics, processing millions of events with sub-second latency
Senior DevOps Engineer
MeuPatrocínio
2016-12 - 2018-08

Led infrastructure automation and security initiatives for Brazilian fintech platform.

  • Infrastructure automation: Developed scripts and tools that improved system efficiency by 50% and generated significant cost savings
  • CI/CD implementation: Built comprehensive deployment pipeline that improved system reliability, reduced deployment time by 70%, and increased uptime to 99.9%
  • Security hardening: Implemented multi-layered security measures protecting against attacks and ensuring data privacy compliance
  • Configuration management: Automated server provisioning and configuration management, enabling rapid scaling and consistent deployments
Infrastructure Engineer
Funcional Health Tech | Fidelize
2015-04 - 2016-03

Managed pharmaceutical industry infrastructure with focus on compliance and reliability.

  • Healthcare compliance: Managed CentOS-based infrastructure using Linux, Puppet, PostgreSQL, Redis, and VMware in regulated pharmaceutical environment
  • Multi-platform support: Ensured reliable operation of diverse web applications across multiple programming languages and database technologies
  • Proactive monitoring: Implemented Nagios monitoring solutions with 24/7 alerting, achieving 99.8% uptime and rapid issue resolution
  • System optimization: Applied deep infrastructure expertise to eliminate performance bottlenecks, achieving 60% reduction in server-related incidents through proactive monitoring and tuning
  • Advanced staging automation: Implemented staging results posting to release-please PRs with ARM64 performance metrics and user-friendly summaries
Support Specialist
Acol Consultoria & Sistemas
2014-08 - 2015-04

Advanced from intern to specialist role, managing enterprise IT infrastructure and client support.

  • Career progression: Promoted from intern to Support Specialist within 8 months, demonstrating rapid skill development and value delivery
  • Enterprise support: Provided technical support for enterprise clients, maintaining 95%+ customer satisfaction ratings
  • Infrastructure management: Managed servers, databases, and applications using Linux, VMware, and PostgreSQL technologies
  • System stability: Utilized monitoring tools and proactive maintenance to ensure reliable operation of critical business applications

Technical Skills

Cloud Platforms & IaC

AWSGCPTerraformAnsibleHelm

Container Orchestration

KubernetesDockerHelmKustomizeIstioArgoCDGoogle Kubernetes EngineAmazon EKS

CI/CD & DevOps

GitHub ActionsGitLab CIJenkinsAtlantisBazel

Observability & Monitoring

PrometheusGrafanaELKLokiMimirDatadogChronosphereNew RelicSignOz

Programming & Scripting

PythonBashGoJavaScriptYAMLHCLSQL

Big Data & Databases

Apache KafkaHadoopApache SparkPostgreSQLRedisElasticsearchBigQuery

Security & Compliance

RBACSAML/SSOVaultSecurity ScanningCompliance AuditingInfrastructure Security

Soft Skills

Effective in cross-functional teamsClear technical writingLeadershipMentoringProblem solving

Key Projects

Resume-as-Code Platform
World-class resume generation system showcasing enterprise-grade platform engineering and DevOps excellence
  • Architected comprehensive three-tier CI/CD pipeline: PR Preview (Netlify) → Staging (Netlify) → Production (GitHub Pages)
  • Implemented Infrastructure-as-Code approach with Docker containerization, multi-stage builds, and cross-platform support (ARM64/AMD64)
  • Built automated dependency management system with intelligent PR recreation and zero-maintenance updates achieving 99%+ pipeline success rate
  • Engineered multi-format PDF generation system (Screen/Print/ATS optimized) with Puppeteer achieving 99.5% bandwidth reduction through image optimization
  • Designed comprehensive testing strategy with 20 viewport/theme combinations, accessibility compliance (WCAG 2.1 AA), and performance monitoring
  • Created automated release management with Release-Please integration, conventional commits enforcement, and semantic versioning
  • Implemented enterprise security practices including vulnerability scanning, secret detection, and environment protection rules
  • Achieved 95+ Lighthouse scores, sub-3-second load times, and comprehensive observability with detailed pipeline monitoring
Platform EngineeringDevOpsCI/CDDockerGitHub ActionsInfrastructure as CodeMulti-Environment DeploymentPerformance OptimizationSecurity AutomationRelease Management
Self-Service Terraform Platform with Atlantis Integration
Enterprise-grade GitOps platform enabling teams to self-apply Terraform through GitHub PR interface using Atlantis
  • Built self-service Terraform deployment system using Atlantis enabling 50+ developers to independently manage infrastructure
  • Integrated Atlantis with GitHub PR workflows for automated Terraform plan/apply via PR comments
  • Implemented GitOps workflows with Terragrunt and Atlantis allowing teams to apply infrastructure changes seamlessly
  • Created secure multi-environment promotion pipelines with Atlantis server reducing deployment time from days to minutes
  • Established governance controls with policy-as-code using Sentinel and OPA integrated with Atlantis workflows
  • Released v4.2.0 with enhanced industry-standard version management and comprehensive CI/CD pipeline
TerraformAtlantisTerragruntGitOpsGitHub ActionsOPASentinelSelf-ServiceInfrastructure as Code

Education

Bachelor in Computer Science
Universidade do Estado do Rio de Janeiro (UERJ)
2011-06 - 2016-06

Training

Kubernetes Fundamentals (LFS258)
The Linux Foundation
2025-08-25
  • 35-hour hands-on training covering cluster architecture, container orchestration, Helm, Ingress, RBAC, and production troubleshooting
  • Linux Foundation training course with certificate of completion and verified Credly digital badge