MemGPT: Towards LLMs as Operating Systems - UC Berkeley 2023 - Is able to create unbounded/infinite...

This post has been de-listed

It is no longer included in search results and normal feeds (front page, hot posts, subreddit posts, etc). It remains visible only via the author's post history.

136

MemGPT: Towards LLMs as Operating Systems - UC Berkeley 2023 - Is able to create unbounded/infinite LLM context!

Post Flair (click to view more posts with a particular flair)

Other

Post Body

Posting it also here because I hope that it will be usefull for the LocalLLaMa community!

Paper: https://arxiv.org/abs/2310.08560

Github: https://github.com/cpacker/MemGPT

Overview

LLMs are increasingly being used for perpetual chats
Limited context lengths makes perpetual chat challenging
MemGPT manages a virtual context (inspired by virtual memory in operating systems) to create unbounded LLM context
With MemGPT, we demonstrate that LLMs can be taught to manage their own memory!

Abstract:

Large language models (LLMs) have revolutionized AI, but are constrained by limited context windows, hindering their utility in tasks like extended conversations and document analysis. To enable using context beyond limited context windows, we propose virtual context management, a technique drawing inspiration from hierarchical memory systems in traditional operating systems that provide the appearance of large memory resources through data movement between fast and slow memory. Using this technique, we introduce MemGPT (Memory-GPT), a system that intelligently manages different memory tiers in order to effectively provide extended context within the LLM's limited context window, and utilizes interrupts to manage control flow between itself and the user. We evaluate our OS-inspired design in two domains where the limited context windows of modern LLMs severely handicaps their performance: document analysis, where MemGPT is able to analyze large documents that far exceed the underlying LLM's context window, and multi-session chat, where MemGPT can create conversational agents that remember, reflect, and evolve dynamically through long-term interactions with their users.

https://preview.redd.it/5gdsp3hxyfvb1.jpg?width=1340&format=pjpg&auto=webp&s=404cc2d9cf3cfc90273718092fc5a87b448bf31e

https://preview.redd.it/9k0t26hxyfvb1.jpg?width=1333&format=pjpg&auto=webp&s=9b6aa462821b44d218a30f03424fd5ef6c7bfcf5

https://preview.redd.it/gpkav6hxyfvb1.jpg?width=1339&format=pjpg&auto=webp&s=e2c4573576bd57cb1d8ee047a11c0a2dd549cf08

Duplicate Posts

2 posts with the exact same title by 1 other authors

View Details

Author

Account Strength

100%

Account Age

3 years

Verified Email

Yes

Verified Flair

Total Karma

14,809

Link Karma

11,774

Comment Karma

2,039

Profile updated: 4 days ago

Singularian2501

Subreddit

r/LocalLLaMA

Post Details

We try to extract some basic information from the post title. This is not always successful or accurate, please use your best judgement and compare these values to the post title and body for confirmation.

Posted: 1 year ago
Reddit URL: View post on reddit.com
External URL: reddit.com/r/LocalLLaMA/...