Job ID: JOB_ID_6855
Job Summary:
We are seeking a Jr. SRE (Site Reliability Engineer) to join our team. This role is responsible for the end-to-end onboarding, validation, and readiness of new Azure VM and hardware SKUs. You will own host and guest configuration compliance, control plane onboarding, SKU qualification execution, and production readiness gating in close collaboration with Azure Compute, Networking, Storage, Fabric, and Capacity teams. The engineer acts as a technical owner and integrator, ensuring that every SKU meets published specifications, reliability standards, and operational readiness before General Availability (GA).
Responsibilities:
- Lead end-to-end onboarding of new Azure VM and hardware SKUs.
- Integrate SKUs into Azure control plane and lifecycle management systems.
- Validate host configuration including firmware, drivers, and platform settings.
- Validate guest configuration including OS images, VM sizing, and features.
- Ensure compliance with published Azure VM and hardware specifications.
- Define and execute SKU qualification scope and test objectives.
- Coordinate synthetic and scenario-based validation activities.
- Track SKU readiness, risks, and blockers across qualification gates.
- Manage work items, test plans, and defects in Azure DevOps (ADO).
- Partner with compute, network, storage, fabric, and capacity teams.
- Collaborate with hardware vendors to resolve platform and BOM issues.
- Provide clear go/nogo signals for SKU launch readiness.
- Support private preview, public preview, and GA launch milestones.
- Produce and maintain SKU onboarding and qualification documentation.
- Drive first-time quality and continuous improvement in SKU onboarding processes.
Qualifications:
- Bachelors or Masters Degree in Computer Science, Information Technology, or a related field.
- 4+ years of experience in Azure infrastructure, cloud engineering, or platform engineering.
- Strong understanding of Azure VM architecture, host/guest boundaries, and control plane concepts.
- Hands-on experience with SKU onboarding, qualification, and workflows.
- Experience in DevOps, SRE, test automation, or tooling engineering roles and bug management.
- Hands-on experience with Azure infrastructure or cloud-scale automation.
- Knowledge of server hardware platforms (CPU, memory, NICs, GPUs, storage).
- Experience in cloud compute engineering, virtualization, or VM platform roles.
- Experience in hardware platform engineering, server design, or SKU definition roles.
- Understanding of firmware (BIOS/BMC) and driver dependencies.
- Hands-on experience with Azure or hyperscale cloud hardware platforms.
- Familiarity with Compute hardware engineering.
- Certifications in relevant technologies such as Azure Certification.
- Exceptional analytical, problem-solving, and communication skills.
- Ability to collaborate effectively with cross-functional teams.
- Strong organizational skills with a focus on meeting deadlines and achieving project goals.
Special Requirements
Visa: USC, GC; LinkedIn: Must with photo location or match project
Compensation & Location
Salary: $110,000 – $160,000 per year
Location: Remote
Recruiter / Company – Contact Information
Recruiter / Employer: Microsoft
Email: dmeyourhotlist@gmail.com
Recruiter Notice:
To remove this job posting, please send an email from
dmeyourhotlist@gmail.com with the subject:
DELETE_JOB_ID_6855