Key Responsibilities
- Design, build, and maintain scalable data pipelines and infrastructure
- Develop ETL processes to transform raw data into actionable insights
- Optimize database performance and query efficiency
- Collaborate with data scientists and analysts to support analytics initiatives
- Ensure data integrity, security, and compliance with best practices
- Monitor and troubleshoot data systems for reliability
Requirements
- 5+ years of experience in data engineering and pipeline development
- Proficiency in Python, SQL, and big data technologies (Spark)
- Experience with data warehousing and ETL tools
- Strong understanding of database design and optimization
- Familiarity with cloud platforms and distributed computing