Job ID: JOB_ID_387

Role Overview: OMS Site Reliability Engineer

We are looking for a dedicated Site Reliability Engineer (SRE) with specialized expertise in IBM Sterling Order Management Systems (OMS) to join our engineering team in Marlborough, MA. This role is pivotal in ensuring the reliability, scalability, and performance of our large-scale distributed order management platforms. As an SRE, you will bridge the gap between development and operations, applying software engineering principles to solve complex infrastructure and operational challenges.

Core Responsibilities

  • System Reliability: Take ownership of the Sterling OMS platform’s health, ensuring it meets or exceeds strict Service Level Agreements (SLAs) and Service Level Objectives (SLOs).
  • Automation and Tooling: Reduce operational toil by developing sophisticated automation scripts and tools using Java and Python. Streamline deployment pipelines and incident response workflows.
  • Incident Management: Lead the technical response to production incidents. Conduct thorough Root Cause Analysis (RCA) and implement permanent fixes to prevent recurrence.
  • Observability: Design and implement comprehensive monitoring, logging, and tracing solutions to provide deep visibility into system performance and user experience.
  • Capacity Planning: Perform proactive performance tuning and capacity planning to ensure the system can handle peak seasonal order volumes without degradation.
  • Cloud and Containerization: Manage and optimize OMS workloads across cloud platforms (AWS/Azure/GCP) using Kubernetes and Docker for container orchestration.
  • CI/CD Integration: Enhance and maintain continuous integration and continuous deployment (CI/CD) pipelines to ensure safe and rapid software delivery.

Technical Environment and Culture

The Marlborough office offers a collaborative environment where you will work closely with software developers, product managers, and offshore teams. We value an operations mindset that prioritizes stability and efficiency. You will be expected to communicate effectively across different time zones and technical levels, ensuring that reliability is built into the software lifecycle from the start. This 6+ month contract offers the opportunity to work on high-impact systems that are central to our retail and supply chain operations.

Required Skills and Qualifications

  • Extensive experience with IBM Sterling OMS or similar enterprise order management platforms.
  • Strong programming skills in Java and Python, with a focus on automation.
  • Hands-on experience with Linux administration and cloud infrastructure (AWS, Azure, or GCP).
  • Proficiency in container technologies like Docker and Kubernetes.
  • Solid understanding of CI/CD tools and modern DevOps practices.
  • Excellent communication skills for coordinating with global teams.

Compensation & Location

Salary: $145,000 – $195,000 per year (Estimated)

Location: Marlborough, MA


Recruiter / Company – Contact Information

Recruiter / Employer: KK Software Associates

Email: ankitkumar.s@kksoftwareassociates.com


Interested in this position?
Apply via Email

Recruiter Notice:
To remove this job posting, please send an email from
ankitkumar.s@kksoftwareassociates.com with the subject:

DELETE_JOB_ID_387

to delete@join-this.com.