8 years in site reliability. Site reliability engineer with experience in production incident response, observability, and automation across payment and distributed backend systems. Led incident response for critical payment systems, driving a 50% reduction in MTTR and 30% improvement in on-call efficiency. Owned primary on-call rotation, resolving 40+ production incidents and configuring Prometheus, Grafana,.
Key Highlights
Reduced MTTR by 50% and improved on-call efficiency by 30% for critical payment systems through cross-functional collaboration and postmortem-driven fixes.
Owned primary on-call rotation for distributed backend services, resolving 40+ production incidents and executing emergency patches for Sev2 outages.
Built observability infrastructure across Splunk, Prometheus, and Grafana, configuring alerts, metrics, and dashboards to enforce SLO compliance.
Cut internal tooling page load latency from 240ms to 34ms through lazy loading, improving reliability of tools used by store associates.
Developed an Image Translator tool using PyTesseract and GenAI to automate extraction and translation of non-English text from images, reducing ticket resolution time by 40%.
Led incident response for critical payment systems, driving a 50% reduction in MTTR and 30% improvement in on-call efficiency through cross-functional collaboration and postmortem-driven fixes.
Developed and deployed automation tools to reduce operational toil, eliminate manual intervention, and improve team engineering capacity.
Software Development Engineer
Amazon
Feb 2022 – Mar 2023
Developed and integrated pagination for Orders Dashboard dynamic pages, increasing capacity from 50 to 200+ orders using functional components and custom hooks.
Built support for variable weight items across backend services, including the Order Management Service.
Associate Data Scientist
Prakshep Pvt. Ltd.
Sep 2017 – Oct 2018
Created a dynamic web scraper using BeautifulSoup and Selenium to collect weather data based on geospatial and crop inputs, improving speed by 30%.
Developed a license plate detection and recognition model for Indian vehicles using YOLOv2 with OpenCV and OpenALPR, achieving 86% test accuracy.