Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve exclude path filesystem logic #13894

Closed

Conversation

johnlarkin1
Copy link

@johnlarkin1 johnlarkin1 commented Nov 27, 2023

See the issue above for more detail, but I don't think that the exclude parameter is being handled really like what the API is indicating / what we were thinking (largely due to annoyances with pathlib's Path.match). Let me know if you all want me to add some more unit tests and build this out a little bit more.

@dosubot dosubot bot added the size:XS This PR changes 0-9 lines, ignoring generated files. label Nov 27, 2023
Copy link

vercel bot commented Nov 27, 2023

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Ignored Deployment
Name Status Preview Comments Updated (UTC)
langchain ⬜️ Ignored (Inspect) Visit Preview Nov 27, 2023 1:53pm

@dosubot dosubot bot added Ɑ: doc loader Related to document loader module (not documentation) 🤖:improvement Medium size change to existing code to handle new use-cases labels Nov 27, 2023
@johnlarkin1 johnlarkin1 marked this pull request as draft November 27, 2023 02:55
@johnlarkin1
Copy link
Author

Hey team, I wasn't able to actually run poetry install --with test so if there are issues with the formatting / builds let me know how to resolve and I definitely can.

Here was the issue i filed: #13912

@johnlarkin1 johnlarkin1 marked this pull request as ready for review November 27, 2023 14:10
@dosubot dosubot bot added size:S This PR changes 10-29 lines, ignoring generated files. and removed size:XS This PR changes 0-9 lines, ignoring generated files. labels Nov 27, 2023
Copy link
Member

@efriis efriis left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

pathlib's match sounds like a mess! However, I'm not sure the included logic is the best way of handling it, because it's a bit too custom.

I haven't looked too deeply into this, but given the python bug is tagged only up to python3.9, does the issue resolve on newer python versions?

If possible, I'd love to resolve this without any custom path matching logic in the langchain library to overcome issues in python.

@hwchase17 hwchase17 closed this Jan 30, 2024
@baskaryan baskaryan reopened this Jan 30, 2024
@ccurme ccurme added the langchain Related to the langchain package label Jun 21, 2024
@ccurme
Copy link
Collaborator

ccurme commented Jul 22, 2024

Closing as author is unresponsive, will re-open if needed.

@ccurme ccurme closed this Jul 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Ɑ: doc loader Related to document loader module (not documentation) 🤖:improvement Medium size change to existing code to handle new use-cases langchain Related to the langchain package size:S This PR changes 10-29 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants