Job Opportunity
We are seeking a seasoned Cloud Operations Specialist to join our team. This role is for an Australia-based Senior TechOps engineer – primarily focusing on Cadence Opensource technology – that includes operating, maintaining, upgrading and continuously improving the Managed Service for Cadence (across AWS, Azure and GCP) to deliver a great customer experience.
Responsibilities:
* Collaborate with our Managed Service Product development team to establish Cadence operational requirements and support procedures.
* Respond to customer queries and incidents, diagnose and solve complex technical issues by liaising with customer's engineers. This will include written communication via support tickets and occasional video-call based support.
* Work extensively on Apache Cassandra, Kafka, Opensearch, PostgreSQL, along with Cloud providers such as AWS, GCP and Azure.
* Assist/mentor Level-1 team members to develop their technical capabilities on Cadence.
* Perform complex cluster operations such as migrations, upgrades and maintenance.
* Provide expert operational support to our nodes running in the cloud (AWS, Azure and GCP) as well as On-premise, using technologies such as Linux (Debian), Docker, and languages including Java, Python and Bash.
* Investigate issues and apply standard maintenance procedures to optimize the performance and stability of production systems.
* Liaise with the Development and Product Management team through all stages of the development cycle to ensure proper release processes/procedures are being followed.
* Develop and continually improve our suite of internal automation tools, applications, and processes.
Requirements:
* Minimum 3-5 years of working experience in addition to managing Production environment, including performance benchmarking and tuning on application and kernel level.
* Strong Linux skills with experience in cloud environments is a must, preferably AWS or GCP or Azure. Should be comfortable working from the command line. This is essential, there are no GUIs here.
* Familiarity with installing and maintaining VMs and applications in scale, including upgrade, migration and life cycle management.
* Ability to debug applications using logs and metrics, and replicate issues in local environment.
* Preferably experience with Ansible, Prometheus, Terraform, Grafana and Docker.
* Good fundamental computer science / software engineering skills and knowledge, particularly operating system internals, memory management, and networking.
* Ideally, programming skills in languages such as GO, Python, Java, Bash scripting, SQL and source code control using Git.
* Exceptional ability to communicate clearly and professionally in written and verbal English (essential).
Preferred Skills:
* Customer service experience is favorable.
* Passion for all things IT, and especially open source.
As a member of our team, you will have the opportunity to work with cutting-edge technology, collaborate with experienced professionals, and grow your skills and expertise. If you are a motivated and dedicated individual who is passionate about technology and problem-solving, we encourage you to apply for this exciting opportunity.