AI Agent Index
Zep logo

Zep

4.3/ 5

by Zep AI

MCPEditorial review
Visit site →

Context engineering platform for AI agents with temporal knowledge graph, Graph RAG, and automated context assembly. Free tier available, paid plans from $125/month. SOC 2 Type 2 and HIPAA certified.

Zep is a context engineering platform for AI agents built on Graphiti, an open-source temporal knowledge graph engine. Unlike traditional memory systems that rely on static RAG or chat history alone, Zep automatically extracts entities, relationships, and facts from conversations and business data, then builds a unified context graph that evolves as information changes. When a fact is superseded (for example, when a user changes their preferences or a business relationship shifts), Zep automatically invalidates the old fact while preserving its historical context and provenance. This temporal awareness is what separates Zep from simpler vector-based memory systems such as mem0. The core workflow has three steps: ingest (chat messages, JSON business data, and documents sent via the API), build (Graphiti constructs a temporal knowledge graph with automatic entity extraction and fact invalidation), and assemble (Zep retrieves relevant context and formats it for the LLM at under 200ms P95 retrieval latency). On the LoCoMo benchmark, Zep achieves 80.32% accuracy at 189ms with single-shot retrieval and no slow agentic loops. On LongMemEval, Zep scores 63.8% on temporal retrieval tasks versus mem0's 49%, with accuracy improvements of up to 18.5% and 90% latency reduction versus baseline implementations. On the Deep Memory Retrieval benchmark, Zep scores 94.8% versus MemGPT's 93.4%. Pricing has four tiers: Free (full API access, no credit card required), Flex ($125/month, 50,000 credits included, 5 projects, 10 custom entity types, auto-topup), Flex Plus ($375/month, 200,000 credits, analytics, webhooks, custom extraction instructions, API logs), and Enterprise (custom, on-premises deployment, SSO, HIPAA BAA, dedicated SLA). The Graphiti MCP Server gives Claude Desktop, Cursor, VS Code with Copilot, and other MCP-compatible clients persistent knowledge graph memory across sessions. The open-source Graphiti library requires Docker, FalkorDB or Neo4j, and an LLM API key for self-hosted deployment. Python, TypeScript, and Go SDKs are available. Works with LangChain, LangGraph, CrewAI, and any agent framework. Customers include AWS, Samsung, HoneyBook, Twin Health, and Thrive AI Health. Founded 2023. SOC 2 Type 2 and HIPAA certified.

Pricing

freemium · Free

Segment

b2b

Setup

easy

Verified

May 14, 2026

Capabilities

data-analysisautonomousdeep-research

Pros & Limitations

Editorial assessment

Pros

  • Temporal knowledge graph automatically tracks how facts change over time: when information is updated or contradicted, Zep invalidates outdated facts while preserving historical context, making it more accurate than vector-only memory systems for complex multi-session agent workflows
  • Benchmark-leading retrieval with 80.32% accuracy at 189ms on LoCoMo and 63.8% on LongMemEval versus mem0's 49%, delivering measurably better temporal reasoning at lower latency using single-shot retrieval with no slow agentic tool-call loops
  • Graphiti MCP server gives Claude Desktop, Cursor, and other MCP clients persistent knowledge graph memory across sessions in three lines of code, with Python, TypeScript, and Go SDKs and no agent framework lock-in

Limitations

  • Steep pricing jump from free tier to Flex at $125/month with no intermediate tier, so early-stage teams and lower-volume projects that outgrow free credit limits face a significant cost increase with no middle-ground option
  • Credit-based pricing makes costs difficult to forecast at production scale, as usage spikes trigger auto-topup charges that require careful per-request credit monitoring to avoid unexpected billing
  • Graphiti MCP server is experimental and self-hosted, requiring Docker, FalkorDB or Neo4j, and LLM API key management, meaning teams seeking a fully managed zero-infrastructure MCP memory solution face meaningful setup and operational overhead

Technical Details

Deployment
cloudself-hostedapi
Avg setup timeUnder 5 minutes (free tier with full API access, no credit card, three lines of code to production)
Autonomous rateAutomatically extracts entities, relationships, and facts from conversations and business data without manual management; invalidates outdated facts as information changes; retrieves and assembles relevant context for agents at under 200ms P95 latency.
MCP compatibleYes
Integrations
PythonTypeScriptGoLangChainLangGraphCrewAIClaudeOpenAIGeminiGroq
Security
SOC 2 Type 2HIPAA

Similar agents

Rating

4.3/ 5

Editorial score

Industries

SaaSEnterpriseB2BDevTools

Leave a review

Never displayed publicly.

Agent Stacks

See workflow stacks that feature Zep.

Compare

Related Content

Is this your tool?

Claim this listing to update your details and get a Verified badge.

Claim listing →