About the Role
Tata Technologies is seeking a skilled Data Scientist with a focus on Machine Learning and Data Science in enterprise settings. The ideal candidate will have experience in developing and deploying robust data solutions.
Responsibilities
- Perform Python engineering with strong development, maintenance/debugging, unit/integration testing, and CI/CD practices.
- Work on GenAI document extraction, including prompting and evaluation.
- Handle PDF/document parsing and text matching/validation.
- Utilize Azure OpenAI Services, VLM/OCR/layout models, ReqIF/XML handling, and DNG/DOORS import workflows.
- Apply basic DevOps principles, including containers, logging/monitoring.
Requirements
- Experience: 3+ years in ML/data science in enterprise settings.
- Deep Learning: Proficiency in supervised/unsupervised learning, Segmentation, anomaly detection, model evaluation, feature engineering, and Pytorch.
- Programming: Expert-level proficiency in Python.
- Data: Familiarity with ETL/ELT, and handling structured/unstructured data.
- Tools: Experience with Git, VSCode, MLflow, Docker, Azure ML Studio, Azure DevOps.
- Domain: Manufacturing/Quality inspection department experience is preferred but not mandatory.
Nice to Have Skills
- Docker
- Kubernetes
- ReactJS
- Frontend development