Writing

research notes on deep research apis, verification workflows, and building production-grade research systems.

building a deep research api benchmark from scratchfeb 2026
read a 95-page survey, found no real benchmarks existed, started building one. here's the process and first data.
why ai2's dr tulu isn't on the index (yet)jan 2026
the first open deep research model is impressive — but it's solving a different problem than the apis we track
observability in deep research apisdec 2025
what you can actually verify: comparing verification, attribution, and reasoning traceability across providers
verification debtdec 2025
why unverified citations compound risk in production research systems
citation quality metricsdec 2025
a framework for evaluating source reliability across deep research apis
enterprise deep researchdec 2025
what operators and decision-makers should know before adopting deep research apis