We are seeking a Senior DevOps Engineer / SRE to take end-to-end ownership of infrastructure reliability, scalability, security, and cost efficiency across both IT services projects and product-based platforms. This role goes beyond pipeline management—you will be responsible for production stability, system observability, performance optimization, and cloud cost governance, supporting applications built using Node.js, JavaScript, React.js, React Native, and Python, deployed across monolithic, microservices, and serverless architectures.
Key Responsibilities
● Own production infrastructure reliability, uptime, and performance across all environments
● Architect, deploy, and maintain scalable, secure AWS infrastructure and dedicated servers
● Design and manage containerized workloads using Docker and orchestration platforms (ECS / EKS / Kubernetes)
● Build, optimize, and enforce CI/CD pipelines using GitHub Actions and modern Git workflows
● Implement infrastructure as code (IaC) using Terraform or CloudFormation
● Establish and maintain monitoring, logging, and alerting systems for proactive incident detection
● Define and track SLOs, SLIs, and error budgets aligned with business priorities
● Lead incident response, root cause analysis (RCA), and post-mortems
● Optimize AWS cloud costs through right-sizing, usage analysis, and architectural improvements Support high-traffic and multi-tenant SaaS systems with a focus on isolation and scalability
● Ensure security, compliance, access control, secrets management, and audit readiness
● Collaborate closely with backend, frontend, and mobile teams to improve reliability and deployment workflows
● Maintain clear documentation for infrastructure, operational processes, and runbooks