About the Role

Fieldguide is building AI agents for the most complex audit and advisory workflows. We’re a San Francisco-based vertical AI company operating in a $100B+ market undergoing rapid transformation. Over 50 of the top 100 accounting and consulting firms trust Fieldguide to power mission-critical work.

As an AI Engineer, you’ll build Fieldguide’s intelligence layer—agentic workflows, architectures, and evaluation systems that power enterprise-grade agents. You’ll operate at the intersection of product engineering, applied AI, and production systems.

What You’ll Own

Build and Ship AI Agents

Implement agentic workflows that automate complex audit tasks
Turn customer problems into discrete, testable agent behaviors
Integrate LLMs, tools, and retrieval logic into reliable agent experiences
Monitor and maintain agents in production with a focus on reliability and performance

Execute with AI-Native Leverage

Use AI tooling to accelerate how you design, build, and test features
Prototype quickly and harden systems for enterprise reliability
Build evaluations and feedback loops to improve agent outputs
Write prompts and retrieval pipelines that perform at scale

Contribute to Product Impact

Work closely with senior engineers and product to scope and deliver features
Translate customer workflows into clear engineering requirements
Identify and surface capability gaps to improve team velocity

Who You Are

You’re a strong software engineer who’s built your skills for an AI-native world. These principles resonate with you:

Bias to building: You move fast and resolve uncertainty by shipping
AI-native instincts: You're excited to use LLMs, agents, and automation as core tools
Curiosity and learning velocity: You ramp quickly on new problems and technology
Collaborative: You communicate proactively and seek feedback often
Attention to detail: You care about quality in code and customer-facing outputs

Experience

We care more about capability and trajectory than years on a resume, but most strong candidates have:

1–3 years shipping production software in real-world systems
Proficiency in TypeScript and/or Python
Some exposure to LLM APIs (OpenAI, Anthropic, Gemini) or AI tooling
Familiarity with retrieval pipelines, RAG concepts, or vector databases is a plus
Experience with React and Postgres is helpful but not required
Comfortable working in ambiguity with direction from senior engineers

What Should Excite You

Enterprise-grade reliability: Building systems professionals depend on
Human-in-the-loop design: Knowing when to automate vs. when to surface decisions
Nuanced evaluation: Audits require judgment, so feedback structures matter
Explainability: Making AI outputs and reasoning transparent and trustworthy
Complex domains: Navigating compliance and enterprise rigor while moving fast
Shipping daily value: Delivering agent experiences customers use every day

AI Engineer