We're a fast-growing startup building production-grade AI agents for enterprise customers at scale. We're looking for Applied AI Engineers who can own the design, build, and deployment of agentic workflows powered by Large Language Models (LLMs)--from early prototypes to production-grade AI agents, to deliver concrete business value in enterprise workflows. In this role, you'll work closely with customers on real-world business problems, often building first-of-their-kind agent workflows that integrate LLMs with tools, APIs, and data sources. While our pace is startup-fast, the bar is enterprise-high: agents must be reliable, observable, safe, and auditable from day one. You'll collaborate closely with customers, product, and platform teams, and help shape how agentic systems are built, evaluated, and deployed at scale. Work with enterprise customers and internal teams to turn business workflows into scalable, production-ready agentic AI systems. Design and build LLM-powered agents that reason, plan, and act across tools and data sources with enterprise-grade reliability. Balance rapid iteration with enterprise requirements, evolving prototypes into stable, reusable solutions. Define and apply evaluation and quality standards to measure success, failures, and regressions. Debug real-world agent behavior and systematically improve prompts, workflows, tools, and guardrails. Contribute to shared frameworks and patterns that enable consistent delivery across customers.
Preferred experience:
Strong agent design skills, including prompt engineering, tool use, multi-step agent workflows (e.g. ReAct), and failure handling.
Ability to reason about and balance trade-offs between customization and reuse, as well as autonomy, control, cost, latency, and risk.
Hands-on experience with modern LLMs (e.g., GPT, Claude, Gemini), vector databases, and agent/orchestration frameworks (e.g., LangChain, LangGraph, LlamaIndex, or custom solutions).
Ability and interest to travel up to 25%, flexible.
Strong written and verbal communication skills.
3+ years of experience building and shipping production software; 2+ years working with LLMs or AI APIs.
Strong communication skills and experience leading technical discussions with customers or partners.
Strong programming skills in Python and/or JavaScript/TypeScript.
Practical experience with RAG, agent workflows, evaluation, and performance optimization.
Bachelor's degree in Computer Science or a related technical field.
Experience working in a fast-moving startup environment.
Prior work delivering AI or automation solutions to enterprise customers.
Familiarity with human-in-the-loop workflows, fine-tuning, or LLM evaluation techniques.
Experience with cloud deployment and production operations for AI systems.
Background in applied ML, NLP, or decision systems.