SRE / Reliability Engineer (Lead) with skills SRE Engineering, Kubernetes, Python Test Scripting, ITSM Principles, Dynatrace, Bash Scripting for location Any Infogain Base Location (Noida, Gurugram, Bangalore, Mumbai, Pune)
ROLES & RESPONSIBILITIES
Education:
Bachelor’s or Master’s degree in Computer Science, Information Technology, or a related field.
Certifications in Dynatrace (e.g., Dynatrace Certified Professional or similar) are a plus.
Experience:
8+ years of experience in application performance monitoring (APM), systems engineering, or site reliability engineering (SRE).
2+ years of hands-on experience implementing and managing Dynatrace in an enterprise environment, with a focus on full-stack monitoring and performance optimization.
Experience in monitoring distributed applications, microservices, containers, and Large Enterprise ecosystems .
Familiarity with cloud environments (AWS, Azure)
Technical Expertise:
Strong knowledge of Dynatrace platform capabilities (e.g., AI-driven insights, Distributed Tracing, PurePath, Real User Monitoring, Log Monitoring).
Experience with cloud-native technologies like Kubernetes, Docker, and container orchestration tools.
Proficiency with scripting and automation tools (e.g., Python, Bash).
Familiarity with monitoring best practices, such as defining SLOs, SLIs, and implementing monitoring as code.
Experience integrating Dynatrace with third-party tools like ITSM (ServiceNow), ticketing systems, and CI/CD tools.
Soft Skills:
Strong analytical skills with the ability to identify performance bottlenecks and recommend optimization strategies.
Excellent troubleshooting skills, with a focus on proactive monitoring and performance improvement.
Ability to collaborate effectively with cross-functional teams and communicate technical concepts to both technical and non-technical stakeholders.
Strong written and verbal communication skills.
Preferred Qualifications:
Experience with other observability tools (e.g., Prometheus, Grafana, Splunk, New Relic).
Familiarity with cloud-native application architectures and microservices patterns (e.g., service meshes, API gateways).
Knowledge of incident management and post-mortem processes in a highly available environment.
Familiarity with continuous integration/continuous deployment (CI/CD) processes and DevOps practices.
Experience in a large-scale enterprise environment with complex distributed systems.
EXPERIENCE
- 8-11 Years
SKILLS
- Primary Skill: SRE Engineering
- Sub Skill(s): SRE Engineering
- Additional Skill(s): Kubernetes, Python Test Scripting, ITSM Principles, Dynatrace, Bash Scripting
ABOUT THE COMPANY
Infogain is a human-centered digital platform and software engineering company based out of Silicon Valley. We engineer business outcomes for Fortune 500 companies and digital natives in the technology, healthcare, insurance, travel, telecom, and retail & CPG industries using technologies such as cloud, microservices, automation, IoT, and artificial intelligence. We accelerate experience-led transformation in the delivery of digital platforms. Infogain is also a Microsoft (NASDAQ: MSFT) Gold Partner and Azure Expert Managed Services Provider (MSP).
Infogain, an Apax Funds portfolio company, has offices in California, Washington, Texas, the UK, the UAE, and Singapore, with delivery centers in Seattle, Houston, Austin, Kraków, Noida, Gurgaon, Mumbai, Pune, and Bengaluru.