Role

You’ll own the AI product experience for Pocket, focusing on prompts, agent behaviors, evaluation, and reliability to deliver best-in-class summaries and assistant workflows.

You’ll work closely with product, design, and backend to turn messy real-world inputs into consistently great outputs. You’ll join a small, high-output team that ships at the level of world-class consumer tech companies.

Compensation: Competitive salary + equity

Location: San Francisco

Responsibilities

AI Product Engineering

Design and iterate on prompt and agent systems that power summaries, structured notes, and “ask the assistant” experiences.
Build robust tool-using agent workflows: retrieval, function calling, planning, and safe execution.
Define output schemas and consistency rules so results are reliable, scannable, and user-trustworthy.
Develop evaluation harnesses for quality: golden sets, regression tests, rubric-based scoring, and human review loops.
Reduce failure modes: hallucinations, instruction drift, bad formatting, missing context, and unsafe outputs.
Tune for latency and cost while preserving quality; implement caching and fallbacks.
Partner with backend to productionize pipelines (queueing, retries, idempotency, observability).

Data & Feedback Loops

Build feedback mechanisms and labeling workflows to continuously improve quality.
Analyze failures, cluster issues, and translate them into prompt/tooling changes.
Create dashboards/metrics for quality, latency, and user trust.