Context engineering platform for AI agents with temporal knowledge graph, MCP server, and Graph RAG. Free tier, paid from $125/month ($104/mo annual). SOC 2 Type II and HIPAA certified.
Zep is a context engineering platform for AI agents built on Graphiti, an open-source temporal knowledge graph engine with over 27,000 GitHub stars. Unlike traditional memory systems that rely on static RAG or chat history alone, Zep automatically extracts entities, relationships, and facts from conversations and business data, then builds a unified context graph that evolves as information changes. When a fact is superseded (for example, when a user changes their preferences or a business relationship shifts), Zep automatically invalidates the old fact while preserving its historical context and provenance. This temporal awareness separates Zep from simpler vector-based memory systems such as mem0. The core workflow has three steps: ingest (chat messages, JSON business data, and documents sent via the API), build (Graphiti constructs a temporal knowledge graph with automatic entity extraction and fact invalidation), and assemble (Zep retrieves relevant context and formats it for the LLM at under 200ms P95 retrieval latency). Named customers include AWS, Samsung, Writer, HoneyBook, Twin Health, Thrive AI Health, Praktika.ai, AGI Inc, Harper, FlockX, and Aurasell.
On the LoCoMo benchmark, Zep achieves 80.32% accuracy at 189ms with single-shot retrieval and no slow agentic loops. On LongMemEval, Zep scores 63.8% on temporal retrieval tasks versus mem0's 49%, with accuracy improvements of up to 18.5% and 90% latency reduction versus baseline implementations. On the Deep Memory Retrieval benchmark, Zep scores 94.8% versus MemGPT's 93.4%. The Graphiti MCP Server gives Claude Desktop, Cursor, VS Code with Copilot, and other MCP-compatible clients persistent knowledge graph memory across sessions. Python, TypeScript, and Go SDKs are available. Works with LangChain, LangGraph, CrewAI, and any agent framework via API-first architecture with webhook support. Named gaps: no no-code interface for non-developers; self-hosted Graphiti requires Docker, FalkorDB or Neo4j, and LLM API key management; HIPAA BAA is available on Enterprise only.
Pricing verified live on getzep.com/pricing as of June 2026. Free tier at $0 includes 1,000 credits per month for prototyping with limits applied. Flex at $125 per month (or $104 per month billed annually at $1,250 per year, saving 17%) includes 50,000 credits per month, auto-topup at 20% with 30-day rollover, 600 requests per minute, 5 projects, 10 custom entity and edge types, and 1-day API logs. Flex Plus at $375 per month (or $312.50 per month billed annually at $3,750 per year) includes 200,000 credits per month, 60-day rollover, 1,000 requests per minute, 10 projects, 20 custom entity and edge types, custom extraction instructions, webhooks, analytics, and 7-day API logs. Enterprise is custom-priced and includes on-premises or BYOC deployment, SSO, HIPAA BAA, dedicated SLA, VPC deployment, and 1-year audit and API logs. Overage pricing on Flex is $25 per 10,000 credits; on Flex Plus is $75 per 40,000 credits. Credits are consumed based on Episode size: 1 credit per Episode up to 350 bytes, plus 1 credit per additional 350 bytes. Retrieval, storage, threads, users, and graph storage consume zero credits. Security confirmed live on getzep.com/enterprise: SOC 2 Type II certified via annual third-party audits with continuous monitoring; HIPAA-compliant infrastructure with BAAs available for healthcare organizations processing PHI. Deployment options include Zep Cloud (managed), BYOK (your own encryption keys), and BYOC (deployed inside your VPC).
Zep is not the right fit for teams that need a no-code memory interface: the platform is developer-facing and requires API or SDK integration; no visual configuration tool exists for non-technical users. Teams on tight budgets face a significant pricing gap: the free tier is limited to 1,000 credits per month for prototyping, then the next tier jumps to $125 per month ($104 annual) with no intermediate option, meaning teams with moderate usage that exceeds free limits have no gradual upgrade path. Teams needing real-time browser or screen context rather than conversation and document memory should evaluate Browser Use (free open-source) instead; Zep is optimized for structured conversation and entity data, not web interaction context.
Current state Q2 2026: Zep surpassed 27,000 GitHub stars on the Graphiti repository and now positions as "The Context Lake for AI agents" with enterprise-scale deployment options including cloud, BYOK, and BYOC. Named customers include Fortune 500 companies AWS and Samsung alongside Twin Health, Thrive AI Health, Writer, HoneyBook, Praktika.ai, and others. S&P Global Market Intelligence has published coverage of Zep. The platform has launched dedicated comparison pages against mem0, Letta, AWS AgentCore, Vertex AI Memory Bank, Cognee, and Supermemory, reflecting competitive maturity in the agent memory category. Annual billing with 17% savings was added to the Flex and Flex Plus tiers. No G2 listing exists for Zep AI; the primary evidence signals are GitHub adoption, named enterprise customers, and analyst coverage.