Job Overview
We are seeking a detail driven Application Operations Engineer to manage enterprise monitoring platforms, automation tools, and operational systems. This role plays a key part in ensuring visibility across applications and infrastructure, supporting batch processing, and maintaining operational stability.
The ideal candidate is experienced in monitoring tools, alert management, and operational workflows, with the ability to support incident response and continuous service improvement.
Key Responsibilities
Monitoring & Observability
- Monitor enterprise dashboards across multiple platforms to ensure system and application health
- Investigate alerts and resolve threshold breaches to maintain service availability
- Fine tune alert configurations to reduce noise and improve signal accuracy
Batch & File Transfer Operations
- Support and monitor batch job execution using enterprise scheduling tools
- Track and manage file transfer operations to ensure timely and secure data movement
- Troubleshoot job failures and coordinate resolution with relevant teams
Log Analysis & Incident Support
- Analyze logs using centralized logging platforms to identify anomalies and root causes
- Support incident investigations and collaborate with infrastructure and application teams
- Escalate critical alerts to the appropriate technical teams for resolution
Patch & Automation Support
- Validate patch deployment activities across managed systems
- Maintain and support automation workflows related to monitoring and operations
- Assist in improving automation processes for operational efficiency
Reporting & Governance
- Maintain monitoring dashboards, reports, and operational metrics
- Generate SLA and performance reports for stakeholders
- Ensure monitoring coverage is aligned with new applications and services
Disaster Recovery & Continuous Improvement
- Support DR monitoring validation and ensure observability during failover scenarios
- Maintain documentation for tools, processes, and configurations
- Drive improvements in monitoring effectiveness and operational workflows
Requirements
- Diploma or Degree in Information Technology, Computer Science, or related discipline
- Hands on experience with monitoring tools such as SolarWinds, DataDog, or similar platforms
- Experience with logging and observability tools such as OpenSearch, FluentBit, or equivalent
- Familiarity with batch job scheduling tools such as Control-M or similar systems
- Experience with file transfer solutions such as Serv-U or equivalent platforms
- Exposure to endpoint management or patching tools such as Endpoint Central
- Strong understanding of incident management and monitoring best practices
- Ability to analyze logs, troubleshoot issues, and support root cause analysis
- Experience working in SLA driven operations environments
- Good communication and documentation skills
Please send your detailed resume in MS Word format to resume@goldtecHRs.com with
- Education Level
- Working experiences
- Each employment background
- Reason for leaving each employment
- Last drawn salary
- Expected salary
- Date of availability