Job Overview
We are seeking a skilled Cloud AI Infrastructure Specialist to enhance our GenAI platform in Sydney.
This role is pivotal in ensuring the stability, scalability, and continuous improvement of our GenAI infrastructure.
The successful candidate will design and maintain cloud-native infrastructure, collaborate with cross-functional teams to integrate ML workflows and services, and drive automation, observability, and CI/CD best practices.
A strong focus on security, governance, and cost-efficiency is essential, as is supporting the deployment and monitoring of AI/ML models in production.
* Design and maintain GenAI platform infrastructure in the cloud as part of a cross-functional team.
* Integrate ML workflows and services, and drive automation, observability, and CI/CD practices.
* Ensure security, governance, and cost-efficiency across platforms.
* Support deployment, monitoring, and operation of AI/ML models in production.
* Collaborate with ML engineers, data scientists, and product teams to onboard emerging technologies and improve capabilities.
Key Qualifications:
* AWS: Proficient in Amazon SageMaker, EKS, Lambda, Step Functions, CloudFormation/Terraform.
* GCP & AI Tooling: Experience with Vertex AI, AI Platform, BigQuery.
* MLOps & DevOps: Familiarity with AI/ML model deployment and MLOps on AWS.