Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Text-embeddings #10

Open
haesleinhuepf opened this issue Nov 18, 2024 · 3 comments
Open

Text-embeddings #10

haesleinhuepf opened this issue Nov 18, 2024 · 3 comments
Assignees
Labels
enhancement New feature or request

Comments

@haesleinhuepf
Copy link
Member

In order to know which slides are related to each other and group slides by content, it might be useful to explore text-embeddings. To minimize costs, I propose to try mxbai-embed-large and ollama-python.

We could reuse infrastructure developed in #8 to turn PDFs into images, then images into text, and text into embeddings. The embedding could then be explored interactively using a notebook similar to this one.

@lea-33 lea-33 added the enhancement New feature or request label Nov 19, 2024
@lea-33 lea-33 self-assigned this Nov 19, 2024
@lea-33
Copy link
Collaborator

lea-33 commented Nov 19, 2024

Thanks! I am working on it. I just wanted to add that in the example notebook the code for the stackview.sliceplot is somehow not working correctly as you don't actually see the plot! But it worked after i tried it myself, so it should work actually.

@haesleinhuepf
Copy link
Member Author

Don't worry about the interactive plot. This doesn't render on the github website :-)

@lea-33
Copy link
Collaborator

lea-33 commented Nov 19, 2024

Don't worry about the interactive plot. This doesn't render on the github website :-)

Alright! I also tried to do it in the Notebook I downloaded but still I can't really select the data points like it is supposed to be. I only can draw a circle but the data points are not chosen as a selection afterwards.. Have you ever encountered that problem?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

When branches are created from issues, their pull requests are automatically linked.

2 participants