Job ID: JOB_ID_4617
Job Summary:
We are seeking an experienced Elastic SRE Architect with over 10 years of experience in Site Reliability Engineering (SRE) or DevOps architecture. The ideal candidate will have a strong background in implementing observability and monitoring solutions for enterprise applications and infrastructure, with hands-on experience using OpenTelemetry for distributed tracing and telemetry data collection. Expertise in designing architectures for large-scale production IT ecosystems and building customer journey-based monitoring is crucial. This role requires a deep understanding of modern infrastructure management, cloud-native architectures, and automation frameworks.
Key Responsibilities:
- Implement observability and monitoring solutions for enterprise applications and infrastructure.
- Utilize OpenTelemetry for distributed tracing and telemetry data collection.
- Design and architect large-scale production IT ecosystems.
- Build customer journey-based monitoring for applications, infrastructure, and network components.
- Ensure strong understanding of modern infrastructure management and cloud-based environments.
- Develop and implement automation frameworks and infrastructure tooling.
- Utilize scripting or automation tools like Python or configuration management tools like Ansible.
- Work with monitoring platforms, logging systems, and distributed system diagnostics.
- Troubleshoot and optimize performance for high-availability production systems.
- Collaborate with DevOps, infrastructure, and application teams to improve reliability and system performance.
- Apply strong understanding of cloud-native architectures and modern infrastructure platforms.
- Document architecture and communicate effectively with stakeholders.
Required Skills:
- Site Reliability Engineering (SRE) or DevOps architecture
- Observability and monitoring solutions
- OpenTelemetry
- Elasticsearch and Elastic AIOps
- Large-scale production IT ecosystem architecture
- Customer journey-based monitoring
- Modern infrastructure management
- Cloud-based environments (cloud-native architectures)
- Automation frameworks and infrastructure tooling
- Python scripting
- Ansible
- Monitoring platforms, logging systems, distributed system diagnostics
- Troubleshooting and performance optimization
- Excellent communication, architecture documentation, and stakeholder collaboration skills
Experience:
- 10+ Years of experience in SRE or DevOps architecture.
Location:
- Cary, NC (Onsite) – Local Only.
Employment Type:
- W2/C2C Contract
Special Requirements
Local to Cary, NC only. Onsite.
Compensation & Location
Salary: $70 – $90 per year (Estimated)
Location: Cary, NC
Recruiter / Company – Contact Information
Email: nshu.singh@scalable-systems.com
Recruiter Notice:
To remove this job posting, please send an email from
nshu.singh@scalable-systems.com with the subject:
DELETE_JOB_ID_4617