NEWPosted 4 hours ago

Job ID: JOB_ID_4152

Job Summary

We are looking for a skilled Site Reliability Engineer (SRE) to join our team in Sunnyvale, CA. This role involves ensuring the reliability, scalability, and performance of our systems through automation, proactive monitoring, and robust infrastructure management. The ideal candidate will have extensive experience in AWS, Linux, CI/CD pipelines, and scripting languages.

Key Responsibilities

  • Ensure the reliability, availability, and performance of production systems and services.
  • Develop and implement automation for operational tasks, deployments, and incident response.
  • Manage and maintain cloud infrastructure, primarily on AWS.
  • Monitor system health, performance, and security using tools like CloudWatch and Grafana.
  • Troubleshoot and resolve complex infrastructure and application issues.
  • Implement and manage CI/CD pipelines to streamline software delivery.
  • Write and maintain scripts (Bash, Python) to automate manual processes and improve efficiency.
  • Participate in on-call rotations to provide 24/7 support for critical systems.
  • Collaborate with development and operations teams to design and build reliable and scalable systems.
  • Contribute to the continuous improvement of SRE practices and tooling.
  • Ensure strong system and infrastructure knowledge is applied to maintain and enhance system stability.

Required Skills and Qualifications

  • BS/MS in Computer Science or equivalent practical experience.
  • At least 8 years of experience in Reliability Engineering, DevOps, or an infrastructure-focused role.
  • Strong expertise in AWS and Linux Operating System.
  • Proficiency in standard networking protocols, component troubleshooting, and system diagnostics.
  • Demonstrated ability to write effective Bash and Python scripts for task automation.
  • Proactive approach to SRE duties and on-call support.
  • Advanced experience with programming languages such as Python and Java.
  • Passion for designing, building, and maintaining reliable and scalable systems.
  • Advanced knowledge and hands-on experience with CI/CD systems (e.g., Jenkins, GitLab CI, CircleCI).
  • A strong belief in automation as a means to reduce operational load through software solutions.
  • Deep understanding of systems and infrastructure architecture.
  • Experience with monitoring tools like CloudWatch and Grafana, and release management processes.
  • Strong sense of ownership, integrity, and excellent communication and collaboration skills.

Additional Information

  • This is a 6-month contract role.
  • The position requires 5x/week onsite presence in Sunnyvale, CA.
  • A LinkedIn profile is mandatory for this role.

Special Requirements

5x/ week onsite. Must have LinkedIn profile. On-call Support.


Compensation & Location

Salary: $65 – $85 per year

Location: Sunnyvale, CA


Recruiter / Company – Contact Information

Email: anand@livemindz.com


Interested in this position?
Apply via Email

Recruiter Notice:
To remove this job posting, please send an email from
anand@livemindz.com with the subject:

DELETE_JOB_ID_4152

to delete@join-this.com.