Zep AI (YC W24)

Zep AI (YC W24)

Software Development

The Foundational Memory Layer for AI

About us

Long-Term Memory for AI Assistants. Recall, understand, and extract data from chat histories. Power personalized AI experiences.

Industry
Software Development
Company size
2-10 employees
Type
Privately Held
Founded
2023

Employees at Zep AI (YC W24)

Updates

  • Zep AI (YC W24) reposted this

    View profile for Santiago Valdarrama, graphic

    Computer scientist and writer. I teach hard-core Machine Learning at ml.school.

    Knowledge graphs are huge for AI Agents! A knowledge graph is the difference between a dumb AI agent and one that blows everyone's mind. Agents need memory and must know how to keep it updated over time (This is difficult, and it's the main reason most agents you've seen get dumber overnight!) This is where a knowledge graph helps. A knowledge graph is a network of connected points, each representing a piece of information. It's a very efficient structure for capturing complex relationships between data. Google uses a (huge) knowledge graph as part of Search. (Probably the largest knowledge graph in the world.) It was arguably one of the best improvements to Search since it was created. For building AI agents, knowledge graphs have two advantages: 1. They make it easier to extract facts from memory 2. They make it easier to update facts as they change The second point is crucial: You want agents to keep up with the world and update old facts as they discover new information. Here is a recommendation that will teach you how to use a knowledge graph as the memory layer of an AI agent: Ken Collins wrote an excellent article in which he builds a chat history for Llama 3 using Zep AI (YC W24)'s AI Memory (backed by a knowledge graph.) Here is a link to the article: https://fnf.dev/4fPAXtx This article is a great example of how to build agents that keep up with change. Zep is an open-source library that will serve as your agent's memory. You can connect it to any agent framework, model, or platform. The article's source code is in TypeScript, but you can use Zep with Python or Go as well. In a few bullet points: 1. You send messages to your AI agent 2. Zep synthesizes the information into a knowledge graph 3. You can retrieve any relevant facts from memory extremely fast Thanks to the Zep team for sponsoring this post.

  • Thanks for the shout out, Kesha Williams!

    View profile for Kesha Williams, graphic

    AI Advisor • Head of Enterprise Architecture • AWS Hero (Machine Learning) • Award-Winning Engineer • Keynote Speaker • Tech Influencer

    Have you heard of Zep AI (YC W24)? In the fast-evolving world of AI, Zep is the memory layer for AI agents we've been waiting for. The ability of AI agents to retain and utilize contextual information over time is paramount. Zep addresses this need by providing a robust memory layer that enhances the capabilities of AI agents and assistants. ➡️ Temporal Knowledge Graph: At the heart of Zep lies a temporal knowledge graph, a dynamic structure that models the evolving relationships between complex entities such as users and products. This graph enables AI agents to understand and reason about changes over time, ensuring their responses remain accurate and contextually relevant. ➡️ Low-Latency Memory Retrieval: Zep distinguishes itself with its low-latency memory retrieval system. By avoiding reliance on large language model (LLM)-based agentic behavior for memory access, Zep ensures that AI agents can swiftly recall pertinent information, leading to more responsive and efficient interactions. ➡️ Platform Independence: Designed with flexibility, Zep is platform, model, and framework-independent. Developers can integrate Zep into their AI systems regardless of the underlying technologies, making it a versatile solution for many applications. ➡️ Getting Started with Zep: Embarking on the journey with Zep is straightforward. The platform offers comprehensive documentation and SDKs in multiple programming languages, including Python, TypeScript, and Go, facilitating seamless integration into existing AI projects. And the best part? Zep is available now as a community edition, with a hosted version launching soon. Whether you're an AI developer or a product owner, this is your chance to explore how Zep can redefine what's possible in AI memory. 🔗 Ready to dive in? Start your Zep journey today: https://fnf.dev/4gyzIiY 

  • Zep AI (YC W24) reposted this

    If you give a RAG agent chunks of space articles from 1994, it’ll tell you Pluto’s a planet. But what happens if you give it chunks of articles from 2004, and chunks from an article from 2024 that says Pluto is no longer a planet? The LLM might get confused, or bias towards the more frequent mention of Pluto being a planet. Vector databases have enabled AI agents to perform incredibly well at information retrieval. But RAG doesn’t do a great job of encoding time. Vector databases match for keyword and semantic relevance…not “How recent is this information?” or “Is this fact still valid?” Unless you design your system to account for this, you’ll get inaccurate results. That’s fine if time isn’t an issue for your AI agent. But if you’re designing an enterprise product that talks to users, it probably is: - Your customer support agent should know past issues your customers ran into - Your product recommendation agent should know the customer bought Nikes, then returned them because they “didn’t like them” - Your company docs agent should know you used to support Salesforce in v1, but deprecated support in v2 LLMs, surprisingly enough, do a great job understanding this. But they can only reason with the information they have, and if the vector database doesn't retrieve temporally relevant chunks, it doesn’t matter. Luckily, we solved this with Graphiti, Zep’s open-source graph library. DM if you’re interested in learning more.

  • Zep AI (YC W24) reposted this

    I think all the hype around LLMs has caused us to overlook Small Language Models. Let me explain - you might actually realize you need them. We’re building the foundational memory later for AI agents. That means our customers expect low latency - no one wants to wait more than a few seconds for a response from an AI agent. Just imagine trying to build a sales bot that takes 15 seconds to answer your question. You’d just buy from somebody else. So the ability to do high-leverage tasks (like sales) quickly is extremely powerful. This is where smaller-language models can give you leverage. Fine-tuning for specific tasks and using accurate memory to ensure right data is provided can result in faster results and with the same accuracy as frontier models. Example models that come to mind: Microsoft Phi-3, Llama3.2 11B Just don’t feel like you have to throw the largest and most expensive model at a problem.

  • Zep AI (YC W24) reposted this

    “What if OpenAI copies your idea?” We’re living this out right now, and let me tell you what it’s like. First, some background: LLMs have a problem with memory. You shouldn’t just send a huge data dump to GPT-4o as context - it's inefficient, and you’ll need to worry about hallucination and poor recall. We realized this in 2023, and now we’ve built the foundational memory layer for AI. Now, your agents can work intelligently with real context - extracting facts and observations from dialogue. …And OpenAI, of course, realized they needed this too. They’re developing similar features, and shipping new models. Models that will get better, faster, cheaper, with larger context windows, and better reasoning capabilities. But I’m not worried, for two key reasons. First, we built Zep to benefit from these model improvements. Second, models alone can’t solve some key enterprise use cases: - Deciding which enterprise data to feed into a prompt from a huge universe of user touch points - Building low-latency pipelines to deliver this data - Securing and governing the data with multi-tenancy, privacy compliance, RBAC, etc These are enterprise necessities that A) aren’t the domain of a model company and B) get easier as models get better. On the surface we look similar, but really, we’re playing different games. That’s why I’m not worried.

  • Zep AI (YC W24) reposted this

    Seeing Anthropic Claude solving complex tasks using Zep AI (YC W24) for memory is fun! 💥 ➡️ Anthropic announced the Model Context Protocol yesterday, and Paul P. built an MCP server for Zep. With Zep, Claude has access to a Knowledge Graph memory spanning business data and chat history, enabling it to solve complicated tasks such as customer support tickets. 💬 + 🗄️ Developers can add business data, such as CRM tickets, emails, billing information, and more to Zep’s Knowledge Graph. With MCP, Claude can query Zep for relevant memory, assisting with queries such as “Help, I can’t log in!”. 🪄⭐️ The Zep PR is up on the MCP Server GitHub. https://lnkd.in/grkeNUBV Looking forward to other interesting projects built using MCP. If you try out the Zep Server, let us know what you think! ❤️

  • Zep AI (YC W24) reposted this

    There's so much at stake in next week's US elections. 🇺🇸 The Russian state has tried hard to influence the outcomes, sowing disbelief and discord. The Zep AI (YC W24) team built a visual tool to explore Russia's recent activities. View the Explorer: ➡️ https://lnkd.in/g7Mh7FmC The app refers to 50+ sources, including US DOJ indictments, US and foreign government research, non-governmental organization research, and media articles. Offering users a detailed view of these malign influence operations. The Explorer uses Graphiti, Zep's open-source Knowledge Graph library. The AI Assistant was built with LangChain's LangGraph framework and FastHTML. We've spent hours delving into the data. 🔎 We hope you find it as fascinating as we did. Let us know what you think!

  • Zep AI (YC W24) reposted this

    This is a fantastic overview, Eric Vyacheslav! Thanks for the writing about Zep AI (YC W24)! ❤️ 🚀

    View profile for Eric Vyacheslav, graphic

    AI/ML Engineer | Ex-Google | Ex-MIT

    You can now give your AI models long-term memory that actually learns and adapts over time. Zep is a memory layer that allows your agents to store, retrieve, and update information over time using a temporal knowledge graph. This lets your agents to remember key facts, track changes, and reason about evolving data in real-time. Key features: - Instant memory retrieval: Get relevant data from memory in milliseconds without slowing down your LLM. - Built-in temporal reasoning: Zep automatically updates as facts change, so your agents can adapt to new information without manual intervention. - Framework-agnostic: Integrate Zep with Python, TypeScript, or Go—whatever fits your stack. Try it out: https://lnkd.in/gYgcRX3A

Similar pages

Browse jobs

Funding

Zep AI (YC W24) 2 total rounds

Last Round

Pre seed

US$ 500.0K

See more info on crunchbase