Site Reliability Engineering
Pune, India
Must have
-
Minimum 3 to 5 years of experience
-
Ensure applications and systems run reliably in the cloud
-
Collaborate with developers to:
-
ensure the right infrastructure such as databases, APIs, caches, etc are configured as per requirement
-
Improve the services through rigorous testing and release proces
-
consult in system design and capacity planning
-
-
Monitor availability and system health by gathering the right metrics from OS, applications
-
Use of tools / scripts to automate platform infrastructure maintenance process
-
Experience in distributed and scalable environment such as Kubernetes
-
Skilled on monitoring tools such as Grafana, Datadog, AppDynamics, Dynatrace, New Relic, etc
-
Must have experience in Terraform
Good to have
-
Experience in Azure cloud
-
Experience in PostgreSQL
-
Experience in MongoDB