Labs
MissionCareersUpdates
Models
Terra-1Mantle-1Aether-1
SafetyResearchTeamBlogDocs
All Positions
DevOps & InfrastructureHyderabad, IndiaFull-Time

Infrastructure Engineer

Design the cloud infrastructure that keeps TAL models available at low latency for every user on earth.

About This Role

Reliability at TAL Corp is a promise to every person who depends on our systems. The Infrastructure team makes sure that promise holds at any scale.

Responsibilities
  • Design and manage Kubernetes clusters for model serving and training workloads
  • Build and maintain CI/CD pipelines for rapid, safe deployment
  • Own observability stack — metrics, logging, distributed tracing
  • Drive cost efficiency across cloud compute and storage
  • Develop runbooks and conduct game-day failure simulations
Requirements
  • BS/MS in Computer Science or Systems Engineering
  • 4+ years in DevOps, SRE, or cloud infrastructure
  • Deep expertise in Kubernetes, Terraform, and AWS/GCP
  • Experience with GPU cluster management
  • Strong scripting skills in Python and Bash
Nice to Have
  • Experience with Slurm or other HPC schedulers
  • Background in FinOps or cloud cost optimisation
  • Certifications in AWS/GCP architecture

TAL Corp is an equal opportunity employer. We believe the best team reflects the full diversity of humanity — because we are building for all of it.

Apply Through Training

At TAL Corp you don't just send a résumé — you prove yourself. Apply by joining our training program; complete it, and top performers are hired into this role.

  • 1Register & start your 7-day program
  • 2Train, build real skills, earn a credential
  • 3Top performers → straight into hiring

Already training? Log in