Join our team
We build research infrastructure that helps teams understand how AI agents behave, fail, and improve in dynamic, real-world environments — before they reach production
We’re a team of AI researchers and engineers building infrastructure to evaluate and steer frontier AI systems
Our work combines research rigor with hands-on engineering, from publishing at top ML venues to shipping production-grade evaluation tools
Our Core Values
Customer Obsession
We are active listeners, prioritizing customer needs and delivering experiences that surpass expectations.
Our commitment to customers means crafting personalized solutions to address their unique challenges.
We build and nurture meaningful, long-term relationships with our customers.
Customer
Obsession
Growth Mindset
We are lifelong learners, fostering an environment that nurtures curiosity and personal development.
Change is our ally, not our enemy. We adapt and evolve to stay at the forefront of our field.
Every challenge is an opportunity to grow, and we are unafraid to step outside of our comfort zone.
Growth Mindset
Adaptability & Proactivity
We thrive in uncertainty, using our skills and resources to navigate evolving landscapes.
We strive to stay flexible and patient, recognizing that our journey involves both small steps and giant leaps.
While we might face temporary obstacles, we see them as stepping stones, not roadblocks.
Adaptability &
Proactivity
Kindness & Positivity
We champion a culture of trust, building meaningful relationships within our team.
Compassion, empathy, and respect aren't just words; they're the backbone of our interactions.
We lead by example, bringing kindness and positivity to the world around us.
Kindness & Positivity
How we support you
We support our team with the tools, flexibility, and trust needed to do deep, meaningful work
Flexible work & focus time
Work in a way that supports deep thinking and sustainable pace
We value realism, diversity, and scalable difficulty in our environments. We set up scenarios with real-world rules, data, tools (actions), and rewards – iterating on complexity as we go
Growth & learning
We invest in long-term growth, not just short-term output
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat.
Ownership & impact
Small teams mean real responsibility and visible impact
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat.
Well-being & balance
We care about sustainable work, not burnout
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat.