Key Responsibilities
- Design, implement, and deploy production-grade AI agents with multi-step reasoning, tool-calling workflows, and human-in-the-loop coordination
- Build agent harnesses for reliability, including context management, tool definitions, memory, feedback loops, and observability
- Develop agentic infrastructure for robot operations, backend services, and operator interfaces
- Architect and maintain scalable backend services in Go, Rust, or TypeScript/Node.js with REST/GraphQL APIs
- Implement production-grade reliability features: retry logic, cost controls, structured output validation, and sandboxed tool execution
- Create systematic evaluation frameworks (evals, golden datasets, regression suites) to measure agent quality and catch regressions
Requirements
- 5+ years of professional software engineering with full-stack production experience
- Strong command of Python and/or TypeScript for clean, testable, and maintainable code
- Backend proficiency in Go, Rust, or TypeScript/Node.js with API design and system integration
- Frontend expertise in React/Next.js for building functional, production-grade UIs
- Experience with Docker, Kubernetes, CI/CD pipelines, and LLM-specific tracing and cost tracking