Key Responsibilities
- Design and implement data pipelines for ETL processes
- Develop scalable solutions for data storage and retrieval
- Optimize SQL queries and database performance for analytics
- Collaborate with data scientists to enable machine learning workflows
- Ensure data integrity and security across all systems
- Monitor and maintain data infrastructure for reliability
Requirements
- 4+ years of experience in data engineering or related fields
- Proficiency in Python, SQL, and big data tools (Spark, Hadoop)
- Experience with cloud data services (AWS Glue, Redshift) and data warehousing
- Strong understanding of data modeling and schema design
- Problem-solving skills with a focus on performance and scalability