About the Role
Mirage is seeking a Research Scientist to push the boundaries of large language models for multimodal creative tasks. You’ll develop new approaches for adapting and extending LLMs to understand and operate over complex, real-world data, particularly video. This role focuses on advancing model capabilities, improving reasoning and control, and enabling new forms of interaction between language and time-based media.
Responsibilities
- Develop novel approaches for training and adapting large language models
- Design new objectives, datasets, and fine-tuning strategies
- Explore multimodal reasoning and structured generation
- Run systematic experiments to improve model behavior and reliability
- Design evaluation frameworks for complex, real-world tasks in video analysis
- Analyze failure modes and iterate on model improvements
What makes you a great fit
- MS/PhD in ML, CS, or related field
- Strong track record in LLMs, NLP, video understanding, or multimodal learning
- Deep understanding of transformers and modern LLM techniques
- Experience with fine-tuning, alignment, or post-training methods, especially for adapting models to generate structured outputs
- Strong experimental rigor and research taste