The ideal candidate will have proven experience in data engineering, data analysis, or a related field, with strong experience in SQL, Python, or R for data manipulation and analysis. Experience with developing and improving data pipelines using tools like Apache Spark, Hadoop, or similar is also highly desirable.