niah & mcp.heptafox.com are live

Testing infrastructure for
GenAI & AI agents

HeptaFox builds the datasets, MCP servers, and scrape targets you need to test RAG pipelines, agents, and MCP clients — under realistic, reproducible conditions.

# GenAI # Agents # MCP # RAG # Evals # Testing
// products

Built for evaluating AI systems

Each product targets one part of the AI testing stack. NIAH and the MCP Sandbox are live today; Scrape is on the way.

// the stack

From retrieval to agents to evals

HeptaFox focuses on the unglamorous-but-essential layer: reproducible test data and environments that let you measure GenAI systems instead of guessing.

RAG & retrieval

Controlled haystacks and ground-truth needles to quantify recall, precision, and long-context degradation.

Agents & tools

Deterministic MCP servers and scrape targets so agent behavior is testable, not anecdotal.

Evals & testing

Fixed, versioned environments that make regression testing of GenAI pipelines actually possible.