Key Responsibilities:
- Deliver system reviews, tuning, and resilience improvements
- Resolve incidents, analyze problems, and drive recovery
- Collaborate on solution design and deployment with Engineering teams
- Define and enforce Linux configuration standards
- Provide technical support to development and operations teams
- Lead troubleshooting, RCA, and outsourced technical teams
- Plan and deliver OS upgrades, patching, and software upgrades
- Approve and supervise major change requests
- Conduct performance analysis, tuning, and automation
- Manage backup, recovery, and asset inventory
- Design access control frameworks and monitoring for service reliability
- Mentor junior engineers and support compliance/audit requirements
- Conduct capacity planning and performance management
Functional & Technical Competencies:
- Linux, Solaris, UNIX systems and services (volume manager, NTP, DNS, clustering, etc.)
- Troubleshooting and performance tuning for Linux/UNIX
- Linux server clustering implementation
- Information security knowledge
- Networking and data communication
- Middleware/app execution in Linux/UNIX (an advantage)
Requirements:
- Strong Linux/UNIX administration experience
- Experience with clustering, troubleshooting, and performance tuning
- Knowledge of information security and networking
- Experience with automation and scripting
- Strong problem-solving and leadership skills