Kartik Patel
I build systems that don't wake people up at 3 AM.
Site Reliability Engineer based in Ahmedabad, India. I started with a curiosity for how things break — and spent the last few years turning that curiosity into infrastructure that doesn't.
The Journey
Started Computer Engineering
Joined Gujarat Technological University for B.E. Computer Engineering. Picked up Linux and Python early on — not because the syllabus demanded it, but because I wanted to understand how things actually worked under the hood.
Built the Foundation
Final year project: an Ansible-based Linux server automation framework that cut manual deployment time by 80%. That project wasn't just code — it was the first time I saw infrastructure as a product that needed reliability engineering, not just deployment scripts.
First Role — Platform Operations Intern
Joined Parkar with zero production access. Left four months later with Zabbix monitoring deployed across Linux server fleets, Grafana dashboards live for dev teams, and an AWS Cloud Practitioner cert earned mid-internship. Promoted to full-time before review season.
Platform Operations Engineer
Full ownership of monitoring architecture, incident lifecycle, and automation strategy across production cloud environments. Reduced MTTD by 40%, cut unplanned downtime by 35%, and eliminated 80% of operational toil through Ansible automation. This is where I operate.
How I Think
The principles that shape how I approach reliability engineering.
Observability first
You can't fix what you can't see. Every system I touch gets instrumented before it gets optimised.
Automate the toil
If a human is doing it more than twice, it should be a script. If a script is running more than daily, it should be a pipeline.
Incidents are data
A post-mortem that doesn't change something is just documentation. Every failure is a permanent improvement waiting to happen.
Reliability is a product
SRE isn't a support function. It's an engineering discipline with measurable outcomes — SLOs, error budgets, and decision-making frameworks.
Want to work together?
I'm open to SRE & Platform roles globally.
Designing resilient cloud infrastructure and self-healing systems that prioritize reliability, scalability, and operational excellence.