Role Summary
We’re looking for hands-on Data Engineers with 2 years experience to join our growing backend team. You’ll be responsible for building and maintaining the data pipelines that power our machine learning models and mobile app. This is a great opportunity to work with modern data tools, solve real-world data problems, and help shape the future of sports analytics.
What You’ll Be Doing
- Build and maintain data pipelines for ingesting NBA data from external APIs, using Azure Data Factory
- Transform ingested raw JSONL data into structured formats using Python notebooks (pandas/dataframes) publishing to Databricks DeltaTables across a medallion architecture
- Design and manage dimension tables (SCD2) and development of surrogate keys for linking common external datasets for pipeline redundancy
- Ensure data quality through testing and validation of transformation logic
- Collaborate with data scientists and developers to support ML model development through engineering of features
- Work within the Azure ecosystem, including, but not limited to, Data Factory, Databricks, ADLS v2, Azure Queues, Function Apps and CosmosDB
- Use GitHub for version control and collaborate in an Agile environment using Jira
What You Bring
You’re a problem-solver who enjoys working with data at scale, writing clean Python code, and building systems that are reliable and scalable.
Must-haves:
- Strong Python skills, especially for data transformation using pandas/dataframes
- Experience designing and maintaining dimension tables (SCD2)
- Ability to develop and manage surrogate keys for data linking
- Experience testing and validating data pipelines to ensure accuracy and integrity
- Ability to conceptualize and design solutions to complex data transformation problems
Nice-to-haves:
- Familiarity with the Databricks ecosystem, including Unity Catalog, Delta Tables, and Medallion architecture
- Experience with Azure Data Factory and cloud-based data ingestion
- Interest in sports (NBA, AFL, NRL, NFL, soccer) and understanding of sports statistics
- Experience using GitHub for code management and collaboration
Why Join XO Sports?
- Work on a greenfield project with modern tools and real-world impact
- Help shape the data backbone of a cutting-edge sports tech platform
- Join a passionate, fast-moving team with a clear product vision
- Flexible work setup – remote, hybrid, or in-office
- Market-competitive salary + equity options
Unlock job insights
Salary match • Number of applicants • Skills match
Your application will include the following questions:
- Which of the following statements best describes your right to work in Australia?
- How many years\' experience do you have as a Data Engineer?
- Which of the following programming languages are you experienced in?
- Have you worked in a role which requires experience with machine learning techniques?
- How many years\' experience do you have working in an agile environment?
- Which of the following issue and bug tracking software do you have experience with?
#J-18808-Ljbffr
📌 Data Engineer – Python, Azure, Sports Data
🏢 XO Sports
📍 Melbourne