Key Responsibilities
- Design and maintain scalable data pipelines and ETL processes
- Develop data models and optimize database schemas
- Ensure data quality and integrity across systems
- Collaborate with analytics teams to provide clean datasets
- Monitor data pipeline performance and troubleshoot issues
- Implement best practices for data security and governance
Requirements
- 3-5 years of experience in data engineering
- Proficiency in Python and SQL
- Experience with big data tools (Spark, Hadoop)
- Knowledge of cloud data services (AWS Glue, Redshift)
- Understanding of data warehousing concepts