Job ID: JOB_ID_3944
Job Summary:
We are seeking a handson analyst/engineer who is strong in Oracle PL/SQL and Unix/Linux to own daily batch job monitoring, perform root-cause analysis (RCA) for failures and performance issues, and ensure endtoend stability of data processing in a Manhattan (DFIO/SCM) ecosystem. The role includes publishing daily open items, prioritizing work by business impact, and collaborating with DBAs and the Manhattan product team to resolve outliers and configuration issues.
Key Responsibilities:
- Batch Operations & Monitoring: Monitor and manage daily/overnight batch jobs (Unix shell + SQL/PLSQL steps); validate pre-/post-conditions and recover/rewind missed or skipped steps. Triage job failures, capture diagnostics, and execute reruns safely with appropriate approvals and documentation.
- Root Cause Analysis (RCA): Investigate failures by reviewing Unix logs, job runbooks, and database logs; pinpoint breaking step(s) and document the actual cause (e.g., data, logic, config, capacity). For each failure/performance incident, produce a clear RCA and corrective & preventive actions (CAPA); drive closure.
- Database Analysis (PL/SQL & Performance): Analyze PL/SQL packages, procedures, and functions implicated in failures; identify defects, data issues, or edge cases. Diagnose longrunning queries; obtain and interpret execution plans; coordinate with DBA for index, stats, and planstability actions. Maintain and leverage an uptodate entityrelationship (ER) understanding of core tables and data flows across DFIO/SCM modules.
- Manhattan UI & Configuration Support: For Manhattan UI configuration change requests, validate expected behavior in lower environments; create/execute test scenarios and capture evidence. For unclear or outlier behaviors, collaborate with the Manhattan product team to confirm productlevel settings and recommended fixes.
- Daily Operations Governance: Publish daily open items and status across the team queue; prioritize by business criticality and SLAs; call out blockers and risks. Adhere to change/incident processes (e.g., ServiceNow: incidents, problems, change requests), maintain accurate runbooks and knowledge base articles.
- Quality, Testing & Readiness: Create and execute system test plans for fixes and configuration changes; support release readiness and postrelease validation.
- Stakeholder Management: Partner with business users to understand impacts and timelines; provide clear, concise communication on incident status, ETAs, and next steps.
Must-Have Skills:
- Oracle SQL & PL/SQL: Strong query writing, debugging, and package/procedure analysis.
- Unix/Linux: Proficient with shell scripting, log parsing (grep/awk/sed), and standard job orchestration (cron or enterprise schedulers).
- RCA & Troubleshooting: Demonstrated ability to read logs, trace data across tables, and isolate failure causes; produce actionable CAPA.
- Performance Tuning Basics: Reading execution plans, understanding indexing/statistics, and working closely with DBAs on remediation.
- SCM Functional Context: Working knowledge of Supply Chain, with exposure to Inventory planning and/or Demand forecasting processes (tool-agnostic).
- Process & Communication: Experience with incident/change management and structured status reporting to business/IT.
Good to Have:
- Experience supporting Manhattan DFIO application (including afterhours/incident support), understanding common processing/data errors.
- Familiarity with cloud data warehouses such as Snowflake or BigQuery for downstream analytics/performance considerations.
- Retail domain exposure.
Experience Range:
6-8 years overall, with substantial time in PL/SQL, Unix batch operations, and production support.
Education:
Bachelors degree in Computer Science, Engineering, Information Systems, or equivalent practical experience.
KPIs / Success Measures:
- Reduction in repeat batch failures and mean time to resolve (MTTR).
- Ontime completion rates for critical jobs and adherence to SLA.
- Quality of RCAs/CAPAs and knowledge base contributions.
- Positive stakeholder feedback on communication and prioritization.
Work Model & Availability:
Comfortable with shift coverage / oncall rotation to support critical batch windows, as needed.
Tools & Environment (Representative):
Oracle DB, PL/SQL Developer/SQL*Plus or equivalent IDE; Unix/Linux shell; ticketing (ServiceNow); job schedulers; monitoring
Special Requirements
Visa: H1B, USC, OPT or H4-EAD. Duration: 6 months. Oncall rotation required.
Compensation & Location
Salary: $70,000 – $100,000 per year (Estimated)
Location: Atlanta, GA
Recruiter / Company – Contact Information
Recruiter / Employer: Scalable Systems
Email: .sharma@scalable-systems.com
Recruiter Notice:
To remove this job posting, please send an email from
.sharma@scalable-systems.com with the subject:
DELETE_JOB_ID_3944