Job ID: JOB_ID_4608
Job Summary:
Scalable Systems Inc. is seeking an experienced Golang Lead Site Reliability Engineer (SRE) for an onsite contract position in Austin, TX. The ideal candidate will have over 10 years of experience, with a strong focus on Go (Golang) for building scalable backend services, Kubernetes, and AWS cloud infrastructure. This role requires strong leadership and communication skills to collaborate with engineering teams and management. If you are passionate about SRE principles and have a proven track record in maintaining high system reliability, we encourage you to apply.
Key Responsibilities:
- Lead the design, development, and maintenance of scalable backend services using Go (Golang).
- Write high-quality, maintainable code and comprehensive unit tests in Go.
- Manage and optimize Kubernetes deployments, including StatefulSets and load balancers.
- Leverage Amazon Web Services (AWS) for cloud infrastructure management and scaling.
- Design, implement, and maintain CI/CD pipelines for automated build and deployment processes.
- Implement and manage observability and monitoring tools such as Grafana.
- Utilize Prometheus, Alertmanager, and Loki for effective monitoring and alerting.
- Apply Site Reliability Engineering (SRE) practices to ensure system reliability and performance.
- Troubleshoot complex distributed systems and cloud-native applications.
- Provide strong communication and leadership to engineering teams and management.
Required Skills and Experience:
- 10+ years of overall IT experience.
- Extensive hands-on experience with Go (Golang) for backend service development.
- Proficiency in writing high-quality, maintainable Go code and unit tests.
- In-depth experience with Kubernetes, including deployments, StatefulSets, and load balancers.
- Strong experience with Amazon Web Services (AWS) cloud infrastructure.
- Proven experience in designing and maintaining CI/CD pipelines.
- Hands-on experience with observability and monitoring tools (e.g., Grafana).
- Experience with Prometheus, Alertmanager, and Loki.
- Solid understanding of Site Reliability Engineering (SRE) principles, monitoring, and system reliability.
- Experience troubleshooting distributed systems and cloud-native applications.
- Excellent communication and leadership skills.
Job Details:
- Job Role: Golang Lead SRE
- Location: Austin, TX (Onsite)
- Job Type: W2/C2C Contract
- Experience: 10+ Years
Special Requirements
Onsite role. Accepts W2/C2C Contract.
Compensation & Location
Salary: $120,000 – $160,000 per year (Estimated)
Location: Austin, TX
Recruiter / Company – Contact Information
Email: nshu.singh@scalable-systems.com
Recruiter Notice:
To remove this job posting, please send an email from
nshu.singh@scalable-systems.com with the subject:
DELETE_JOB_ID_4608