Site Reliability Engineer

Tata Consultancy Services

Are you a Site Reliability Engineer seeking a new interesting challenge ?

If your answer is yes, it’s your lucky day so keep reading, it can be just what you’re looking for !

✍️ WHAT WILL YOU DO?

We are looking for a dynamic, proactive and talented person to join our team and perform the following tasks :

  • Ensure availability and stability of critical Java-based applications and cloud platforms.
  • Manage and resolve complex production incidents in 24×7 environments.
  • Conduct RCA and post-mortems to prevent recurring issues.
  • Monitor and improve system reliability, observability, and performance.
  • Troubleshoot Java applications, including JVM, GC, and performance issues.
  • Drive automation and IaC initiatives using Terraform, Ansible, Bash, and Python.
  • Collaborate with development and architecture teams to improve scalability and resilience.

WHAT TECHNOLOGIES WILL YOU USE?

  • Java Platform Troubleshooting (JVM, GC Analysis, Heap & Thread Dumps)
  • Java Application Performance & Diagnostics
  • Spring Boot
  • AWS
  • Linux / Unix
  • Terraform
  • Ansible, Bash & Python Automation
  • Dynatrace
  • CloudWatch
  • Splunk
  • ELK
  • Prometheus
  • Grafana

Por favor, para solicitar este trabajo visita es.whatjobs.com.