Candidate Information
Contact Details
About candidate
Cover latter
Shivdevkumar T R
Lead SRE Engineer
Hyderabad | /in/shivdevkumar/ | +91 8903432228 | shivdkum@gmail.com
Dedicated and experienced Lead Site Reliability Engineer with a proven track record of optimizing system
performance and ensuring high availability of mission-critical applications. Proficient in implementing and
managing monitoring solutions and Site Reliability Engineering (SRE) practices. Proficient in Python
programming for automation and scripting tasks.
Work History
2021-07 - Current
Lead Site Reliability Engineer
Copart, Hyderabad
Lead a team of engineers in developing and maintaining internal application.
Hands-on Backend development work experience.
Experience in AWS EC2, S3, ELB, Auto Scale, CloudFormation.
Automated adhoc troubleshooting steps for resolving the alerts.
Mentored junior engineers, sharing knowledge of best practices for site reliability
engineering methodologies.
Managed and optimized infrastructure on AWS Cloud and On-Prem along with
Kubernetes, Docker, and VMware, resulting in improved resource utilization and
cost savings by 15%
Automated deployment and configuration processes with Terraform, Jenkins,
Ansible, and Puppet, reducing manual effort and increasing efficiency
Ensured high availability of services by developing comprehensive disaster
recovery plans and backup procedures.
Evaluated new technologies and tools to enhance overall system performance,
stability, and security.
Handled Incident bridge calls to resolve the incident and performed root cause
analysis/postmortem and came up with solutions to mitigate potential issue in
future.
2019-01 - 2021-06 Site Reliability Engineer
Copart, Dallas
Implemented comprehensive monitoring and logging solutions with New Relic
and Prometheus on both on-premises and cloud environments, enabling real-
time visibility into system health
Conducted regular performance tuning and capacity planning to ensure system
reliability and scalability
Demonstrated proficiency in cloud concepts and best practices, including
elasticity, scalability, security, and cost optimization.
2017-07 - 2018-12 Systems Engineer Intern
Copart, Dallas
Assisted in the design and implementation of CI/CD pipelines using Jenkins,
accelerating software delivery cycles.
Contributed to the configuration and management of Kubernetes clusters to
support containerized applications.
Supported the development and maintenance of automation scripts using
Python for various infrastructure tasks.2014-01 - 2016-07 Technical Support Engineer
Cisco, Bangalore
Provided technical support and troubleshooting assistance to customers on
networking products and solutions.
Collaborated with cross-functional teams to resolve complex technical issues
and ensure customer satisfaction
Developed and delivered technical training sessions for internal teams and
customers.
2013-03 - 2013012 Customer Support Engineer
Sutherland Global Services, Chennai
Provided remote assistance to clients, ensuring timely resolution of software and
hardware concerns.
Mentored junior members of the team on best practices in issue resolution
techniques.
Served as an escalation point for challenging technical inquiries, demonstrating
expertise in product knowledge and problem-solving abilities.
Conducted root cause analysis of technical issues, implementing preventive
measures for future occurrences.
Skills
Devops Tools
: Kubernetes, Jenkins, Docker, Spinnaker, Github, VMware
Programming
: Python, Golang
Monitoring
: New Relic, Grafana, Prometheus
Infra/Config
: Terraform, Ansible, Puppet, REST API
Database
: SQL, Postgres, MongoDB
Cloud
: AWS, GCP
Certifications
2024-06
Certified Kubernetes Administrator (CKA)
2024-08
AWS Certified Solutions Architect Associate
Education
2016-2018
MS in Computer Science
University of Texas - Dallas
GPA: 3.67
2008-2012
B.Tech in Information Technology
Anna University - Chennai
GPA: 3.86