Key Responsibilities
- Design, build, and maintain scalable data pipelines and ETL processes
- Optimize data storage and retrieval systems for performance and cost efficiency
- Develop and deploy machine learning models for data processing and analytics
- Collaborate with data scientists and analysts to enable data-driven decision making
- Ensure data integrity, security, and compliance with industry standards
- Monitor and troubleshoot data infrastructure to minimize downtime
Requirements
- 5+ years of experience in data engineering or related fields
- Proficiency in big data technologies (Hadoop, Spark, Kafka)
- Strong programming skills in Python, Java, or Scala
- Experience with cloud platforms (AWS, GCP, Azure) and data warehousing solutions
- Knowledge of SQL, NoSQL databases, and data modeling principles