NEWPosted 8 hours ago

Job ID: JOB_ID_4104

Job Description: AWS Cloud Platform Engineer

We are seeking a skilled AWS Cloud Platform Engineer to join our team. This role is crucial for maintaining and enhancing our cloud infrastructure, ensuring its reliability, stability, and performance. You will be responsible for a wide range of tasks, from provisioning and supporting application infrastructure to implementing automation and ensuring security best practices.

Key Responsibilities:

  • Application Infrastructure Provisioning & Support: Manage and support the provisioning of application infrastructure in the cloud.
  • Building OS Images (Golden Image): Create and maintain standardized OS images for cloud deployments to ensure consistency and efficiency.
  • Ensuring Reliability, Stability & Recoverability: Guarantee the overall IT infrastructure is reliable, stable, and recoverable in case of failures.
  • Capacity Planning: Optimize server and application performance through effective capacity planning and resource management.
  • Management of Server Space: Oversee and manage server space and remotely managed shared storage.
  • Hardware & Software Updates: Perform regular hardware and software updates, patching, and OS upgrades.
  • Firewall Policy Validation & Troubleshooting: Validate and execute firewall policies, and troubleshoot any related errors.
  • Backup, Restoration & Reporting: Manage backup and restoration tasks, along with automated reporting.
  • Infrastructure as Code (IaC): Work on IaC for server provisioning using Terraform & GitHub.
  • Cloud Operations – Monitoring and Alerting: Continuously monitor the performance, availability, and health of cloud resources (VMs, containers, databases, Load Balancers, networks, applications). Set up and manage monitoring tools (e.g., CloudWatch, Azure Monitor) or integrate cloud components with enterprise monitoring tools like Dynatrace. Define and implement alerting mechanisms for proactive issue identification. Analyze logs, metrics, and traces for system insights.
  • Incident Management and Troubleshooting: Act as the first line of defense for cloud incidents, quickly identifying, troubleshooting, and resolving issues. Participate in on-call rotations for 24/7 coverage. Collaborate with development, security, and other IT teams to resolve complex problems. Document incident resolutions and contribute to post-incident analysis (PIRs/RCAs).
  • Infrastructure Management: Manage day-to-day cloud infrastructure operations, including provisioning, deprovisioning, and scaling of resources. Perform regular maintenance tasks like patching, updates, and backups. Implement and manage cloud configuration using Terraform for consistency. Optimize resource utilization and performance.
  • Automation and Scripting: Develop and implement automation scripts (e.g., Python, Power, Bash) to streamline operational tasks. Automate provisioning, deployment, and scaling using IaC tools like Terraform. Integrate automation into CI/CD pipelines.
  • Security and Compliance: Implement and enforce security best practices (e.g., IAM, network security groups, encryption). Monitor for security vulnerabilities and threats, and respond to security incidents.
  • Cost Optimization and Financial Management (FinOps): Monitor and analyze cloud spending to identify cost-saving opportunities. Implement cost optimization strategies (rightsizing, reserved instances, identifying idle resources). Generate cost reports and provide recommendations.
  • Cloud Governance: Define and enforce cloud governance policies, standards, and procedures for resource usage, security, and cost. Develop and maintain documentation for cloud operations processes, runbooks, and best practices.
  • Collaboration and Communication: Work closely with DevOps, cloud architects, security teams, and other stakeholders. Communicate clearly with technical and non-technical audiences regarding system status, incidents, and changes. Participate in cross-functional meetings.
  • Capacity Planning: Monitor resource utilization trends and forecast future capacity needs to ensure scalability. Plan for scaling based on anticipated demand.
  • Continuous Improvement: Proactively identify areas for improvement in cloud operations processes, tools, and infrastructure. Implement automation and process enhancements. Stay up-to-date with cloud technologies and best practices.
  • Vendor Management: Work with cloud service providers (AWS, Azure, GCP) to resolve issues and optimize service utilization.

Required Skills and Experience:

  • Proficiency in Cloud Platform (AWS)
  • Experience with IaC for server provisioning via Terraform & GitHub
  • Experience with DevOps practices and CI/CD pipelines
  • Sound understanding of cloud security principles and compliance with industry standards
  • Ability to write and run scripts in various programming languages like Python, Power, Bash
  • Skills in monitoring cloud resources, logging performance, and optimizing costs
  • Experience in the finance/banking industry is required.

Special Requirements

Onsite


Compensation & Location

Salary: $50 – $70 per year

Location: Woodland Hills, CA


Recruiter / Company – Contact Information

Email: ra.s@avanceservices.com


Interested in this position?
Apply via Email

Recruiter Notice:
To remove this job posting, please send an email from
ra.s@avanceservices.com with the subject:

DELETE_JOB_ID_4104

to delete@join-this.com.