Mytresh Ravi

DevOps • Site Reliability • Cloud Engineer

Designing resilient cloud-native systems, improving reliability, and scaling infrastructure through automation

About Me

DevOps / Site Reliability Engineer with 4 years of experience designing, deploying, and operating highly available cloud-native systems on AWS and Kubernetes. Strong expertise in infrastructure automation, CI/CD, monitoring & observability, disaster recovery, and incident response. Proven ability to improve availability, reduce MTTR, and support production systems under real-world operational constraints.

Professional Experience

Site Reliability Engineer — Xoriant Solutions

Jun 2024 – Jan 2026

Led cloud infrastructure migration and modernization initiatives, improving platform resilience and fault tolerance. Redesigned critical workloads resulting in a 35% increase in system availability and a 40% reduction in unplanned downtime.

Designed and implemented Jenkins-based CI/CD pipelines integrated with Git workflows, reducing release cycles by 50% and minimizing deployment-related production incidents.

Managed containerized workloads using Docker and Kubernetes, optimizing resource utilization and deployment reliability across multiple environments.

Executed disaster recovery validation strategies including failover drills and chaos testing, reducing MTTR by 50% and validating end-to-end recovery workflows.

DevOps Engineer — Synectiks

Jan 2020 – Nov 2023

Designed, deployed, and operated AWS-based cloud infrastructure supporting more than 50 microservices across development, staging, and production environments, emphasizing high availability and horizontal scalability.

Built infrastructure automation workflows using Terraform and Ansible, eliminating approximately 70% of manual provisioning tasks and improving deployment consistency and reliability across environments.

Maintained 99.9% uptime by implementing proactive monitoring, alerting, and observability strategies using CloudWatch, Grafana, and log-driven diagnostic tooling.

Participated in 24×7 on-call rotations, performing production incident triage, mitigation, and root cause analysis. Implemented long-term fixes that improved platform stability and reduced recurrence of critical issues.

Led cost optimization and performance tuning initiatives, identifying inefficient resource utilization patterns and implementing improvements without compromising SLA targets.

Collaborated closely with engineering teams to improve deployment strategies, operational workflows, and reliability best practices across cloud-native services.

Skills & Expertise

AWS / Cloud Architecture
Kubernetes / Docker
Terraform / Ansible
CI/CD / Jenkins / Git
Monitoring / Grafana / CloudWatch
Incident Response
Disaster Recovery
Reliability Engineering
Linux / Bash / Python

Featured Projects

🚗

Driver Drowsiness Detection and Alerting System

IEEE Published Research

Developed a real-time fatigue detection system leveraging computer vision techniques and Eye Aspect Ratio (EAR) tracking to identify drowsiness events with high accuracy and low latency.

Achieved 92% detection accuracy with alert triggering under 200 ms, ensuring responsiveness suitable for safety-critical scenarios.

Implemented modular architecture, robust error handling, and performance optimizations to maintain stability under varying environmental conditions.

Validated model behavior across diverse lighting and head-movement scenarios to minimize false positives and improve real-world reliability.

Python • OpenCV • Real-Time Processing • Computer Vision
🌎

Real-Time IoT Environmental Monitoring System

Designed and implemented an end-to-end telemetry pipeline for continuous monitoring of gas concentration, temperature, and humidity sensors.

Built a secure MQTT → Python → InfluxDB ingestion pipeline capable of handling high-frequency time-series data streams with sub-second telemetry updates.

Optimized retention policies and storage strategies to maintain long-term historical visibility while keeping storage utilization efficient.

Developed Grafana dashboards with anomaly detection logic enabling rapid identification of abnormal sensor behavior and environmental spikes.

MQTT • Python • InfluxDB • Grafana • Time-Series Data
🩺

COVID-19 Detection from Chest X-Ray Images

Developed a deep learning pipeline using CNN and transfer learning methodologies to classify chest X-ray images for COVID-19 detection.

Achieved 95.5% classification accuracy using VGG16 architecture while prioritizing high recall to minimize false negative diagnoses.

Automated inference workflows supporting batch predictions across 10,000+ medical images, significantly reducing manual review effort.

Designed the model pipeline as a reusable inference module enabling integration into future research or clinical validation systems.

Python • CNN • VGG16 • Deep Learning • Medical Imaging
📧 mytreshravi26@gmail.com
📞 +1 (732)-799-8325
📍 New Jersey, United States