Responsibilities
* Design, develop, and maintain scalable data pipelines using AWS services such as S3, Glue, Lambda, and EMR.
* Build and optimize data warehousing solutions using Amazon Redshift and Teradata platforms.
* Develop ETL/ELT processes to ingest, transform, and load large volumes of structured and unstructured data.
* Write efficient SQL queries and optimize performance for complex data transformations and reporting.
* Ensure data quality, integrity, and governance across multiple data sources and systems.
* Implement data modeling techniques (star/snowflake schemas) for analytical and reporting use cases.
* Monitor, troubleshoot, and enhance data pipelines for reliability, scalability, and performance.
* Collaborate with data analysts, scientists, and business stakeholders to understand data requirements.
* Automate workflows and deployments using CI/CD tools and infrastructure-as-code practices.
* Maintain documentation, enforce best practices, and ensure compliance with security and data policies.
#J-18808-Ljbffr