logo

GPTZero

LLM / Machine Learning Engineer

Department
Engineering
Job Type / Location
New York
Experience Required
3+ years
Posted On

About the Role

This role is part of the Jobright TNT - the private hiring network connecting top talent with top AI startups. This is not a mass job posting. Only select, high-signal candidates are invited to Jobright TNT and recommended directly to hiring teams. The hiring company, GPTZero, is an AI detection platform that helps educators and publishers verify whether content was written by humans or AI. Used by over 4M people. Salary: $140K/yr - $250K/yr.

Role Responsibilities

  • Fine-tune and evaluate state-of-the-art language models
  • Optimize prompts to maximize classification accuracy, personalize outputs, and enforce style guidelines
  • Develop multi-agent workflows incorporating data from diverse sources using RAG
  • Improve and iterate on AI agents using observability and experimentation tools
  • Stay up-to-date with the latest literature and emerging technologies to solve novel problems
  • Work closely with product and design teams to develop intuitive applications that create societal impact

Qualifications

Required

  • 3+ YOE in Python
  • 1+ YOE in LLM framework like Langchain or LlamaIndex
  • 1+ YOE with agentic or RAG applications
  • Strong exploratory data analysis (EDA) skills to effectively leverage data in a way that informs pragmatic solutions
  • Experience pushing the cutting-edge in LLM abilities on novel tasks with subjective outputs
  • Excellent software engineer with experience building highly extensible and modular codebases, as well as complex pipelines
  • Self-starter (pitch, plan, and implement as a project owner in a fast-paced team)
  • Highly motivated to make positive societal impact
  • Ability to wear multiple hats and be a leader as our team grows
  • Visa for work in Canada or US

Preferred

  • Strong open-source portfolio
  • Publications at top-tier ML venues
  • Experience working in an early-stage startup environment
  • Experience with a prompt optimization framework like DsPY or TextGrad

View Assessment Process

Think you'll be a good fit?