logo

Lightricks

Research Scientist – LTX Model Evaluation

Department
Research
Job Type / Location
Jerusalem
Experience Required
2+ years
Posted On

About Us

Lightricks is an AI-first company dedicated to creating next-generation content creation technology for businesses, enterprises, and studios. Our mission is to bridge the gap between imagination and creation. At the core of our technology is LTX-2, an open-source generative video model, designed to deliver expressive, high-fidelity video with unmatched speed. This model powers both our own products and a growing ecosystem of partners through API integration.

We are globally recognized for pioneering consumer creativity with products like Facetune, a leading creative brand that introduced AI-powered visual expression to hundreds of millions of users worldwide. We combine deep research, user-first design, and end-to-end execution to bring the future of expression to all.

The Team

Following the success of LTX-2, our widely adopted open-source text-to-audio+video model, we are expanding our efforts to develop cutting-edge audio+video generation models. We are currently hiring Research Scientists to join our Model Evaluation team, which is part of the LTX Foundational Model group.

The Model Evaluation team serves as the central nervous system of the LTX Foundation Model group. Our role is not only to measure performance but also to define what "good" entails across a vast array of use cases. While we empower the next generation of creative tools, LTX also functions as a foundational engine for simulation pipelines, game engines, synthetic data generation, architectural rendering, and digital avatars. We act as the critical bridge between raw research and industrial-grade reliability, building the benchmarks that ensure our models are world-class for both artists and engineers.

The Role

As a Research Scientist in Model Evaluation, you will be the ultimate authority on model quality and utility. Your responsibilities will include designing automated judges, reward models, evaluation datasets, and benchmarking ecosystems that will shape the future of LTX. Your mission is to provide the "ground truth" for our pre-training and post-training teams. You will blend the rigor of a researcher with the intuition of a product-thinker, developing metrics that capture both the aesthetic soul of a video and the functional precision required for high-stakes professional use.

Key Responsibilities

  • Steer Training & Research: Systematically evaluate model checkpoints to provide actionable insights that guide training experiments and architectural decisions.
  • Design Benchmark Ecosystems: Develop and run rigorous benchmarks for release candidates against competitive models, ensuring LTX-2 remains world-class.
  • Build Next-Gen Metrics: Develop robust automatic metrics and Reward Models (e.g., for RL, ITS, auto-research agents) that quantify complex attributes like temporal coherence, physical correctness, spatial accuracy, and foley synchronization.
  • Diagnose & Analyze: Perform deep root-cause analysis on model failures, providing the diagnostic clarity needed for researchers to implement targeted fixes.
  • Scale Evaluation: Collaborate with platform engineers to deploy evaluation frameworks across large-scale GPU clusters.

Ideal Candidate Profile

  • Technical Depth: Master’s or PhD in Computer Vision, ML, or a related field, with strong software engineering skills and comfort in complex ML training environments.
  • The "Metric" Mindset": Deep expertise in evaluation methodology and statistical rigor. You understand why standard metrics often fail and how to build better ones.
  • Perceptual Intuition: Possess a sharp "eye and ear" for quality. You can articulate subtle nuances in motion or sound that automated systems might miss and leverage that intuition to improve our reward models.
  • Data-Driven Detective: You enjoy diving into datasets to uncover the "why" behind the numbers, taking pride in curating and specializing data for specific evaluation tasks.
  • Product-Minded Scientist: You can think like an end-user, caring that our models not only "beat the benchmark" but also work reliably in professional pipelines.
  • Statistical Rigor: You understand experimental design, significance testing, and the nuances of perceptual quality assessment.

Perks & Benefits

  • Daily door-to-door shuttles, offering Car-to-go subscriptions from several locations in central Israel, plus free parking and train-station pickups.
  • Two chef-led restaurants on site by the legendary Machneyuda Group, plus a bakery filled daily with fresh pastries.
  • Empowerment with cutting-edge tools and learning opportunities for growth and success through workshops, platform access and training, subscriptions, and clear guidelines for responsible AI use.

View Assessment Process

Think you'll be a good fit?