Job Description
Join the team redefining how the world experiences design.
Where and How You Can Work
Our flagship campus is in Sydney, with additional campuses in Melbourne and co‐working spaces in Brisbane, Perth, and Adelaide. You have the choice of where and how you work; we trust our team members to choose the balance that empowers them and their teams to achieve their goals.
What You'll Be Doing
As Canva scales, change continues to be part of our DNA. This role will give you the flavour of what you'll be working on, but it will evolve over time.
At the Moment, This Role Is Focused On
* Designing, building, and continuously improving our observability platforms, pipelines, and tooling used by all Canva engineers.
* Providing technical leadership and expertise to drive pragmatic solutions and dive into impactful design decisions.
* Brainstorming, researching, and prototyping to optimise our metrics and continuous profiling platforms, refining their operational effectiveness and reliability.
* Proactively improving metrics, continuous profiling querying, visualisation, and monitoring user experience, and advocating best practices.
* Participating in team ceremonies, knowledge sharing, brainstorming sessions, etc.
* Becoming an observability champion, evangelising best practices and guiding other engineers in the observability space.
* Identifying and advocating for solutions cross‐functionally to ensure all engineers can make the best use of our metrics, continuous profiling, and insights platforms.
You're Probably a Match If
* You are proficient and happy to code in Python, Golang, and/or Java.
* You have deep knowledge and understanding of computer engineering fundamentals and first principles.
* You are proficient with infrastructure‐as‐code; we're a Terraform and Jsonnet shop, but strong experience with other IaC tools will do the trick.
* You have practical experience with and knowledge of AWS services (EC2, EKS, Lambda, SQS, Kinesis, S3, MSK) or equivalent.
* You have experience with observability technologies – competency with tools like Prometheus, Grafana, OpenTelemetry, or equivalent.
* You have experience with highly reliable and available high‐throughput distributed data pipeline systems, with highly scalable databases.
Not Essential, But Helpful Experience
* Deep experience with OpenTelemetry, which underpins much of the tooling the team owns.
* Experience designing, building, and running metrics and monitoring infrastructure at a large scale; experience with distributed Prometheus clusters (Thanos, Cortex, Mimir, M3) or similar databases is highly regarded.
* Experience with data handling / pipelining at scale (experience with Kinesis and Kafka is highly regarded).
* Experience with Kubernetes, particularly running stateful workloads at scale.
* Experience with data security and data obfuscation.
* Experience with alerting and SLA/SLO theory and industry best practices.
About The Team
The Observability Insights, Metrics and Profiles Team is part of the Observability sub‐group and is responsible for the end‐to‐end experience for visualisation, metrics, continuous profiling, and standardised alerting for Canva products and services.
Our goal is to provide our development team with world‐class tools to view how their services are performing in production. We achieve this by combining industry‐leading third‐party solutions with our own in‐house developed solutions. We work across the entire stack maintaining metrics and profiling SDKs (Java, Golang, Python), data pipelines, and related infrastructure to tie it all together.
As we scale, all of these areas require more sophisticated solutions to ensure Canva developers can continue to grow without compromising on reliability or availability.
What's In It For You?
We also offer a range of benefits to set you up for success in and outside of work.
Here's a Taste of What's on Offer
* Equity packages – we want our success to be yours too.
* Inclusive parental leave policy that supports all parents and carers.
* An annual Vibe & Thrive allowance to support your wellbeing, social connection, office setup, and more.
* Flexible leave options that empower you to be a force for good, take time to recharge, and support you personally.
#J-18808-Ljbffr