Long Term Memory extension #675

wawawario2 · 2023-03-31T05:21:41Z

wawawario2
Mar 31, 2023

EDIT: As a quick heads up, the repo has been converted to a proper extension, so you no longer have to manage a fork of ooba's repo.

So I'm working on a long-term memory module. Right now, I'm using this UI as a means to field-test it and make improvements, but if there's any interest in merging this module directly into this repo, I can align some of my priorities accordingly (mainly adding some features necessary to make it usable for a general-purpose audience). It's currently a decoupled extension so it should have minimal, if any, impact on the rest of the repo. The current implementation is detailed in the readme. I'm open to any suggestions.

kreolsky · 2023-03-31T13:57:39Z

kreolsky
Mar 31, 2023

Wow! It's looks ineteresing and great feature for chat mode!

0 replies

USBhost · 2023-03-31T14:47:58Z

USBhost
Mar 31, 2023

@wawawario2 I have a question: So does LTM only work the next time you load chat? Can this function as additional context? Or the only way to use LTM is to keep reloading your character?

3 replies

wawawario2 Mar 31, 2023
Author

Yes, but you can override this behavior by pressing a button in the UI and force-load memories from disk to RAM. This was a deliberate design decision for a couple of reasons, but I'm open to changing this, especially if it will improve the user experience.

From experimentation I've seen that if we make long-term memory immediately visible to the chatbot, there can be a tendency to load very recently-stored memories, which would basically defeat the spirit of long-term memory, even if said memories are just outside the context window. That said, I can see a use case for this for much longer chat sessions.
It keeps the implementation simpler on the backend. That said, it's not super-hard to have this module support it.

It doesn't expand the size of the context window, but it does enable context to automatically transcend just the most recent 2048 (or however many) tokens by looking for semantically similar conversations in the past. It essentially tries to make "better" use of the limited window we have.

USBhost Mar 31, 2023

1: To resolve this could we add message priority based on LTM age?
This would be a cool way to give the bot a little more memory. Kind of feels like Kobold world info but dynamic on what's being talked about.

Can you load character info into LTM and free up some context space?

wawawario2 Mar 31, 2023
Author

I have some ideas in mind for how to implement a notion of priority, I can experiment with that. As for storing character traits into LTM that's actually a very good point you bring up, there's a lot of potential with having some "fixed" LTM storage that stores much more detailed character profile info that can be dynamically looked up as-needed.

oobabooga · 2023-04-01T02:59:02Z

oobabooga
Apr 1, 2023
Maintainer

By merging the extension here, you end up losing your ability to easily modify it. Since this is a very complex extension in active development, I encourage you to convert your repository into something that can be installed like this:

cd text-generation-webui/extensions
git clone https://github.com/wawawario2/long_term_memory

I have added a mention to your extension in the wiki: https://github.com/oobabooga/text-generation-webui/wiki/Extensions

Do you want me to merge the changes to modules/chat.py?

2 replies

wawawario2 Apr 1, 2023
Author

Sounds good and sure, if you merge those changes, that'll make this extension completely decoupled and make things super easy. Thank you for the mention!

oobabooga Apr 1, 2023
Maintainer

Done here: fcda3f8

tensiondriven · 2023-04-01T15:01:20Z

tensiondriven
Apr 1, 2023

@wawawario2 This is great; I haven't looked at the code yet, can you describe how it works?

Also, if you open up the issues queue on your repro, I'd gladly test and possibly contribute.

Lack of long term memory (be it through context, lora, softprompt, or some other voodoo) is the biggest thing holding these bots back from becoming parts of our lives (at least, for the very bleeding edge early adopters like us). I look forward to this! Let me know if I can help!

1 reply

wawawario2 Apr 2, 2023
Author

Good to hear! I plan to expand the descriptions of how things works under the hood in the readme at some point. I opened up issues and discussions.

tensiondriven · 2023-04-01T15:33:54Z

tensiondriven
Apr 1, 2023

This is so great, this link describes how the LTM module works under the hood.

0 replies

bmoconno · 2023-04-01T23:20:07Z

bmoconno
Apr 1, 2023

Yeah, this is very cool. I'm excited to see where this goes. Thanks for your efforts so far.

0 replies

oobabooga · 2023-04-05T21:59:11Z

oobabooga
Apr 5, 2023
Maintainer

I think that this commit broke the extension: e722c24

The fix should be simple. I will also need to add a "continue" parameter to generate_chat_prompt later.

0 replies

BarfingLemurs · 2023-04-25T09:13:06Z

BarfingLemurs
Apr 25, 2023

I never realized this extension was the vector embedding storage system.
It seems to still be updated, great news.

Could I feed an entire github repository with it's various files, and query related lines using the extension? that would be magic, I never really played with chatgpt and its vector embedding storage so I don't really have a feel to its effectiveness for code.

0 replies

Felixqian4160 · 2023-04-26T07:52:19Z

Felixqian4160
Apr 26, 2023

This is a really great feature, I hope it can be quickly integrated into the code.

2 replies

BarfingLemurs Apr 26, 2023

it already is. just install it and all the features work.

Felixqian4160 Apr 26, 2023

thank you

mykeehu · 2023-06-13T11:00:52Z

mykeehu
Jun 13, 2023

I installed this extension and it worked fine for two days, but then my models started to go crazy. I don't know why, I switched to any model, got irrelevant responses, mixed up the persons in the responses, or the Chronos model put in a specific detail from LTM in one of the responses (with syntax!)
So now I turned it off, but is it possible some incompatibility with the new webui changes? It was quick, I was happy to use it, but I can't turn it on for now.

0 replies

wawawario2 · 2023-08-22T02:11:13Z

wawawario2
Aug 22, 2023
Author

Hey all, as a heads up this extension is no longer in development. Anyone still using this extension should migrate. I really appreciate all the support over the past few months.

0 replies

TomLucidor · 2023-11-02T08:37:06Z

TomLucidor
Nov 2, 2023

What is the relationship and differences between RAG ("just read my files") vs long-term memory? Is it merely a coherence issue?

0 replies

pay-tracker · 2024-04-25T16:28:41Z

pay-tracker
Apr 25, 2024

Hey wawawario, I've really been meaning to get in contact with you to ask a simple question about your extension. I want to know if I edit long_term_memory.db with DataBase Browser for SQLite (as I have been able to), will the embeddings be automatically updated or will it still use the old embeddings? I want to retroactively change memories as we are trying to implement oobabooga in a small business setting with business reports as memories, and it would be much easier than re-submitting all of the memories in order to change the contents of one of them.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Long Term Memory extension #675

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 13 comments 8 replies

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Long Term Memory extension #675

Replies: 13 comments · 8 replies

wawawario2 Mar 31, 2023 Author

wawawario2 Mar 31, 2023 Author

oobabooga Apr 1, 2023 Maintainer

wawawario2 Apr 1, 2023 Author

oobabooga Apr 1, 2023 Maintainer

wawawario2 Apr 2, 2023 Author

oobabooga Apr 5, 2023 Maintainer

wawawario2 Aug 22, 2023 Author

Replies: 13 comments 8 replies

wawawario2 Mar 31, 2023
Author

wawawario2 Mar 31, 2023
Author

oobabooga
Apr 1, 2023
Maintainer

wawawario2 Apr 1, 2023
Author

oobabooga Apr 1, 2023
Maintainer

wawawario2 Apr 2, 2023
Author

oobabooga
Apr 5, 2023
Maintainer

wawawario2
Aug 22, 2023
Author