Use cases

What people build with NEO

Real projects built by NEO — from LLM benchmarks to agent swarms. Pick a workflow below to browse, or start with a featured use case.

Featured

3× recall ↑

Build Pipelines

Audit a RAG pipeline end-to-end

NEO inspects retrieval, prompts, and model calls, finds zero-context failures, and delivers a report with fixed thresholds.

View walkthrough
60% cost ↓

Evaluate & Benchmark

Benchmark models before a production swap

Run structured quality, latency, and cost comparisons across providers automatically. Pick models with evidence, not intuition.

View walkthrough
Production-ready

Build Agents

Compose a self-healing multi-agent swarm

NEO wires tool boundaries, shared memory, and retry logic so your agent swarm stays reliable as requirements change.

View walkthrough

Browse by workflow

Same stack you're already debugging

Agents with brittle tool calls. Prompts that need another pass. Evals before you trust a model swap. NEO lives in VS Code or Cursor and helps you turn that work into real code and runs, so you iterate on behavior, not boilerplate.

Get started