-
-
Notifications
You must be signed in to change notification settings - Fork 2.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add ingest folders with symlink #748
base: main
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Perfect.
But have you tested it?
What do you mean by separate storage? Can you specify further? |
Do you mean by different hard-disk/SSD? or something? |
@spaceymonk just for clarification I am asking. |
With this change I just run There might be occur one problem, as the documentation stated, if you created a loop in your path, i.e. symlinking parent directory in child directory, it may run into infinite loop due to lack of storing of visited paths in Python. |
Okay.
…On Sat 6 Apr, 2024, 11:08 AM Berktuğ Kaan Özkan, ***@***.***> wrote:
- I've tested it, of course.
- By storage I meant another directory in the filesystem that I can
symlink to under SOURCE_DOCUMENTS folder. For example, I store my
documents under ~/Documents path and if I wanted to run *localGPT* on
my documents, I have to copy/move al the files into SOURCE_DOCUMENTS
path.
With this change I just run ln -s ~/Documents/ Documents and it
automatically detects and ingests all the files under ~/Documents/.
There might be occur one problem, as the documentation stated, if you
created a loop in your path, i.e. symlinking parent directory in child
directory, it may run into infinite loop due to lack of storing of visited
paths in Python.
—
Reply to this email directly, view it on GitHub
<#748 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AVTTXYC5I3W4U2WONCL43X3Y36C5ZAVCNFSM6AAAAABDKC7SYKVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDANBQHE3TMOJUGQ>
.
You are receiving this because you commented.Message ID:
***@***.***>
|
Any updates ? |
Hi, I wanted to propose a simple change in
ingest
script.Reason behind it: I store my documents in a separate storage and instead of copying files I symlink them to
SOURCE_DOCUMENTS
folder. It works when I link documents one by one (i.e. giving full path for each file). But I have documents in nested folders and linking directories not work due tofollowlinks
flag defaults toFalse
. DocumentationThe PR solves this problem.