Role Summary
We are seeking an experienced Elastic Platform Architect to lead the architecture, design, governance, and implementation of a next-generation enterprise observability and operational intelligence platform based on the Elastic ecosystem.
The successful candidate will be responsible for defining enterprise observability architecture, designing scalable Elastic platform solutions, establishing platform engineering standards, and driving the implementation of cloud-native monitoring capabilities across complex enterprise environments.
The role requires strong expertise in Elastic Stack technologies, Kubernetes-based platforms, GitOps deployment methodologies, Infrastructure-as-Code, DevSecOps practices, and enterprise integration architecture. The candidate will work closely with enterprise architects, cybersecurity teams, infrastructure teams, platform engineers, and business stakeholders to deliver highly resilient, secure, and scalable observability platforms supporting critical business operations.
Key Responsibilities
Enterprise Architecture & Platform Strategy
Define and maintain the target-state architecture roadmap for enterprise observability and monitoring platforms.
Develop architecture blueprints, reference architectures, design standards, and governance frameworks for Elastic-based platforms.
Lead architecture reviews, technical assurance activities, and solution governance processes.
Establish platform engineering standards covering scalability, availability, security, performance, and operational resilience.
Evaluate emerging observability technologies and recommend strategic platform enhancements.
Collaborate with enterprise architecture teams to align observability capabilities with broader technology transformation initiatives.
Elastic Platform Architecture & Design
Architect and govern enterprise deployments of:
Elasticsearch
Kibana
Logstash
Elastic Agent
Fleet Server
Elastic APM
Elastic Cloud on Kubernetes (ECK)
Design distributed Elastic platform architectures supporting large-scale telemetry ingestion and analytics workloads.
Establish platform standards for:
Index Management
Shard Allocation
Cluster Scaling
Data Retention
Performance Optimization
Disaster Recovery
Define and govern Index Lifecycle Management (ILM) and Snapshot Lifecycle Management (SLM) strategies.
Design platform resiliency and high-availability frameworks supporting enterprise-grade service levels.
Observability & Operational Intelligence
Design enterprise observability frameworks covering:
Infrastructure Monitoring
Application Performance Monitoring (APM)
Distributed Tracing
Centralized Logging
Security Event Monitoring
Service Health Monitoring
Operational Analytics
Establish telemetry collection and data normalization standards across multiple technology domains.
Define alerting, event correlation, escalation, and incident visibility frameworks.
Develop monitoring standards, service health models, and operational dashboards supporting business and technical stakeholders.
Define SLI, SLO, SLA, and reliability measurement frameworks.
Cloud-Native & Kubernetes Architecture
Design Kubernetes-native observability platforms leveraging Elastic Cloud on Kubernetes (ECK).
Define deployment architectures supporting:
Kubernetes
OpenShift
Hybrid Cloud
Air-Gapped Environments
Architect GitOps deployment frameworks utilizing:
FluxCD
GitLab
Helm
Establish platform lifecycle management standards covering upgrades, deployments, configuration management, and operational governance.
Design secure multi-cluster deployment architectures supporting enterprise scalability requirements.
Platform Integration Architecture
Define enterprise integration patterns for onboarding infrastructure, application, network, and security telemetry sources.
Architect integrations across:
Enterprise Applications
Cloud Platforms
Security Solutions
Network Infrastructure
Storage Platforms
Kubernetes Environments
Identity & Access Management Platforms
Design data ingestion and enrichment frameworks supporting operational analytics and observability use cases.
Lead integration of observability platforms with IT Service Management and operational workflow systems.
Security, Compliance & Governance
Define platform security architecture including:
RBAC
Authentication & Authorization
Encryption Standards
Certificate Management
Secrets Management
Audit Controls
Collaborate with cybersecurity teams to implement observability-driven security monitoring capabilities.
Ensure compliance with enterprise security policies, governance standards, and regulatory requirements.
Establish platform governance processes supporting change management, operational controls, and risk management.
Automation, DevSecOps & Platform Engineering
Define Infrastructure-as-Code standards utilizing:
Terraform
Ansible
Establish CI/CD and GitOps operating models supporting platform lifecycle automation.
Design automated deployment, testing, upgrade, rollback, and recovery processes.
Drive platform engineering best practices to improve operational efficiency and reliability.
Promote automation initiatives supporting scalability, consistency, and operational excellence.
Technical Leadership & Stakeholder Engagement
Provide technical leadership and architectural guidance to engineering and operations teams.
Facilitate architecture workshops, technical reviews, and stakeholder engagement sessions.
Mentor platform engineers and observability specialists on architecture standards and best practices.
Develop architecture documentation, technical standards, solution blueprints, operational procedures, and implementation guidelines.
Support strategic planning, technology assessments, and platform modernization initiatives.
Required Qualifications & Experience
Technical Expertise
Extensive experience designing and implementing enterprise observability platforms.
Strong hands-on expertise in:
Elasticsearch
Kibana
Logstash
Elastic Agent
Fleet Management
Elastic APM
Elastic Cloud on Kubernetes (ECK)
Strong understanding of distributed systems, large-scale data platforms, and enterprise monitoring architectures.
Cloud & Platform Technologies
Kubernetes
OpenShift
Docker
AWS
Azure
Hybrid Cloud Environments
Automation & Platform Engineering
Strong experience with:
Terraform
Ansible
GitOps
GitLab
FluxCD
Helm
CI/CD Pipelines
Observability & Operations
Experience designing:
Monitoring Frameworks
Logging Architectures
Tracing Solutions
Alerting Frameworks
Event Correlation Models
Operational Intelligence Platforms
Enterprise Integration
Experience integrating observability platforms with:
ServiceNow
Enterprise Security Platforms
Identity Management Solutions
Network and Infrastructure Technologies
Preferred Qualifications
Elastic Certified Observability Engineer
Certified Kubernetes Administrator (CKA)
HashiCorp Terraform Associate
Red Hat OpenShift Certification
GitLab Professional Certifications
AWS and/or Azure Certifications
Experience delivering observability platforms within government agencies, financial institutions, telecommunications organizations, or large enterprise environments.
Experience with AIOps, OpenTelemetry, Service Intelligence, and advanced analytics platforms.
Employment Type
Contract Position (6 Months)
Renewable subject to project requirements, performance, and business needs.