site reliability engineer in bengaluru / bangalore

randstad india
position type
apply now

bengaluru / bangalore
Information Technology
position type
reference number
randstad india

job description

site reliability engineer in bengaluru / bangalore

The team is looking for highly motivated and talented DevOps/Site Reliability Engineers (SRE) engineers within its Big Data ecosystem to build the next generation of Customer Intelligence Platform. Responsibilities:
• Build, deploy, and manage business applications to cloud platforms using Containers orchestration, Service mesh, API gateways, CI/CD components & Observability stacks.
• Collaborate with Product Managers and Developers in self-sufficient teams to implement and follow best DevOps capabilities and practices.
• Own service or service availability and strong expertise in troubleshooting complex production issues. Requirements:
● 8 to 14+ years of relevant work experience.
● Software engineers with a bent towards Operations engineering or vice versa.
● Experience in managing Spark and Elastic search clusters at scale.
● Determine future capacity needs and investigate new products and/or features.
● Keep an eye and always look for opportunities to the cost associated with cloud infrastructure.
● Have a passion for automation by creating tools using Python, Java, or other JVM languages.
● Experience with CI/CD practices (Jenkins), Deployment patterns, and relevant toolsets.
● Infrastructure as code & Configuration management with tools like Terraform, Ansible, etc
● Observability practices and toolchains (Monitoring, Metrics, Logging, Alerts & Tracing)
● Guide to improve the stability, security, efficiency, and scalability of systems.
● Good experience with GCP (Google Cloud) is desirable.
● Expertise in configuration management (such as Ansible, salt) for deploying, configuring, and managing servers and systems● The candidate should be adept at prioritizing multiple issues in a fast pace environment
● Should be able to understand complex architectures and be comfortable working with multiple teams
● Ability to conduct performance analysis and troubleshoot large scale distributed systems
● Cloud security / DevSecOps Practices


Cloud, Devops, scripting, Terraform