feat: support reranker #1532

Anhui-tqhuang · 2024-01-23T04:20:37Z

@imartinez hey review appreciated

i want to add the support for reranker as a postnodeprocesser

The functionality of the Reranker is as follows:

It evaluates the similarity between a query and documents retrieved by the retriever.
If the similarity score is less than cut_off, the document is excluded from the results.
In scenarios where the filtered documents are fewer than top_n, the system defaults to providing the top top_n documents ignoring the cut_off score.
The hf_model_name parameter allows users to specify the particular FlagReranker model from Hugging Face for the reranking process.

Use the enabled flag to toggle the Reranker as per requirement for optimized results.

imartinez · 2024-02-07T11:26:25Z

@Anhui-tqhuang I think the idea is great! I have a general question though: those reranker models can be used through FlagEmebedding (as you proposed) or through SentenceTransformers. The benefit of the latter is that LlamaIndex already contains a SentenceTransformerRerank class. That way we could add the benefits you are proposing without:

depending on an extra library (flagembedding)
having to write a custom reranker class

I'd love to know your opinion. Maybe you want to give a try to the SentenceTransformer reranker using the same model. The rest of the PR looks like an amazing addition to the project. I'd ask you to update the documentation to reflect it (fern/docs/pages...)

Thanks!

Anhui-tqhuang · 2024-02-09T15:24:03Z

Hey thanks for your suggestion! I would try to use the sentence transformer as well as updating the documentation when i back from the spring festival!

Anhui-tqhuang · 2024-02-09T15:36:00Z

btw though llamax index has already supported the llm reranker. But it is not that stable since the output of llm is not expectable, as a result
, private gpt might meet errors when the response from the llm cannot be parsed

For example i have one parser when want to read the output of the llm in the following format

doc 1: relevances 10
doc 3: relevances 8
doc 2: relevances 6

Sometime it gives

doc 1 relevances 10
doc 3 relevances 8
doc 2 relevances 6

Someone it gives

doc 1: relevances 10 (this is the most relevant doc)
doc 3: relevances 8 this doc has mentioned the xxxx
doc 2: relevances 6

Even it could give some summaries which is not expected

No matter what kind of of prompts i make for llm reranker, it could not pass all the cases

That's the reason i want to use a dedicated model which is used for reranker as its result will contains the similarity only

I might need some time on the investigation could we use the sentences transformer to run that dedicated model directly

Anhui-tqhuang · 2024-02-18T06:35:50Z

@imartinez hey i give another pass to the model card again: https://huggingface.co/BAAI/bge-reranker-large#usage-for-reranker

it cannot be used with the sentence transformer

moreover with my previous comments, i still try to avoid using llm for reranker purpose as the result is not stable for parser, a dedicated model with float result would be more stable, this is also the reason that i want to have a customized component instead of using LLMReranker

github-actions · 2024-02-18T07:46:54Z

Published docs preview URL: https://privategpt-preview-558e5b9a-a6ea-4a3d-9894-ac503104ed33.docs.buildwithfern.com

github-actions · 2024-02-18T07:48:08Z

Published docs preview URL: https://privategpt-preview-d4b9d30f-33e4-4cc1-bb59-b80a5e0348da.docs.buildwithfern.com

github-actions · 2024-02-18T07:54:23Z

Published docs preview URL: https://privategpt-preview-761df179-fe21-475c-a6b6-a48205602e71.docs.buildwithfern.com

github-actions · 2024-02-19T06:21:59Z

Published docs preview URL: https://privategpt-preview-c71c8bab-4dcb-4e83-ae34-1c22bb1ee919.docs.buildwithfern.com

Anhui-tqhuang · 2024-02-19T06:23:14Z

@imartinez this is doc https://privategpt-preview-c71c8bab-4dcb-4e83-ae34-1c22bb1ee919.docs.buildwithfern.com/manual/advanced-setup/reranker

github-actions · 2024-02-20T14:34:22Z

Published docs preview URL: https://privategpt-preview-f74f9e65-9a99-4e9e-b870-24b676de7b39.docs.buildwithfern.com

github-actions · 2024-02-20T15:16:25Z

Published docs preview URL: https://privategpt-preview-25d90b0b-5f0f-4688-90d9-47e2d6085778.docs.buildwithfern.com

github-actions · 2024-02-20T15:57:13Z

Published docs preview URL: https://privategpt-preview-276aed0d-3166-4f3f-b46a-e1fb501c61ad.docs.buildwithfern.com

Anhui-tqhuang · 2024-02-27T07:34:12Z

@imartinez hey, could you plz have a pass? it will be appreciated

cloudrage999 · 2024-02-28T17:52:17Z

hi anhui
im using your PR in my pg instance
im using bge-reranker as well
when you think this PR would be approved ?

also is bge-reranker the best local reranker we can have ?

another thing,i made a PR about query search results , in order be able to do both semantic search and classic keyword-based search to retrieve more relevant context for user query

can you please take a look at it and tell me your opinion on it,its bugs and etc
thanks man

Anhui-tqhuang · 2024-02-29T02:53:01Z

@cloudrage999 hey

when you think this PR would be approved ?

i am still waiting for review from @imartinez

also is bge-reranker the best local reranker we can have ?

to be honest, i haven't tried other reranker models, so i cannot tell

Anhui-tqhuang · 2024-02-29T02:54:32Z

@cloudrage999

another thing,i made a PR about query search results , in order be able to do both semantic search and classic keyword-?> > based search to retrieve more relevant context for user query

can you please take a look at it and tell me your opinion on it,its bugs and etc
thanks man

could you show me the link to the PR plz?

github-actions · 2024-03-14T11:45:30Z

Published docs preview URL: https://privategpt-preview-80a4d6a7-7a8e-4d32-97d1-cc81c0ccf737.docs.buildwithfern.com

github-actions · 2024-03-14T11:48:39Z

Published docs preview URL: https://privategpt-preview-3b7f1967-eff1-469a-8121-8ca2fb3b3d1e.docs.buildwithfern.com

github-actions · 2024-03-14T11:50:41Z

Published docs preview URL: https://privategpt-preview-f45b863c-5d78-4345-8e0c-6b8f78877a56.docs.buildwithfern.com

github-actions · 2024-03-14T12:17:38Z

Published docs preview URL: https://privategpt-preview-5e757e5b-3be9-45db-9fa7-ad16ce1d2e73.docs.buildwithfern.com

github-actions · 2024-03-14T12:25:13Z

Published docs preview URL: https://privategpt-preview-91aabc0a-cd11-4e11-992b-0a3777f2b1a8.docs.buildwithfern.com

github-actions · 2024-03-18T09:09:24Z

Published docs preview URL: https://privategpt-preview-021ccb45-1090-4d16-85db-869335d4f115.docs.buildwithfern.com

github-actions · 2024-03-20T03:13:37Z

Published docs preview URL: https://privategpt-preview-3d250861-0983-4862-b7cd-8268bb725490.docs.buildwithfern.com

Signed-off-by: Anhui-tqhuang <[email protected]>

Anhui-tqhuang force-pushed the reranker branch from c3da53c to f3b4374 Compare February 19, 2024 06:21

Anhui-tqhuang force-pushed the reranker branch from f3b4374 to 831e438 Compare February 20, 2024 14:33

Anhui-tqhuang force-pushed the reranker branch from 73df085 to 1891507 Compare March 14, 2024 11:44

Anhui-tqhuang force-pushed the reranker branch from cafc58a to 5c4e520 Compare March 18, 2024 09:09

Anhui-tqhuang added 3 commits March 21, 2024 11:16

feat: support reranker

642b75b

Signed-off-by: Anhui-tqhuang <[email protected]>

docs: add doc to the reranker

080d977

docs: add doc to the reranker

616353d

Anhui-tqhuang added 8 commits March 21, 2024 11:16

docs: add doc to the reranker

f4c58ce

fix: reformat

b652b2d

fix: tests

dc33bb0

fix: type hionts

f60ae69

chore: rebase and update

9458622

fix: tests

3b6dda1

fix: black

8d0e0cb

fix: black

841158e

Anhui-tqhuang force-pushed the reranker branch from 431a062 to 841158e Compare March 21, 2024 03:17

Anhui-tqhuang closed this Apr 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support reranker #1532

feat: support reranker #1532

Anhui-tqhuang commented Jan 23, 2024 •

edited

Loading

imartinez commented Feb 7, 2024

Anhui-tqhuang commented Feb 9, 2024

Anhui-tqhuang commented Feb 9, 2024 •

edited

Loading

Anhui-tqhuang commented Feb 18, 2024 •

edited

Loading

github-actions bot commented Feb 18, 2024

github-actions bot commented Feb 18, 2024

github-actions bot commented Feb 18, 2024

github-actions bot commented Feb 19, 2024

Anhui-tqhuang commented Feb 19, 2024

github-actions bot commented Feb 20, 2024

github-actions bot commented Feb 20, 2024

github-actions bot commented Feb 20, 2024

Anhui-tqhuang commented Feb 27, 2024

cloudrage999 commented Feb 28, 2024 •

edited

Loading

Anhui-tqhuang commented Feb 29, 2024 •

edited

Loading

Anhui-tqhuang commented Feb 29, 2024

github-actions bot commented Mar 14, 2024

github-actions bot commented Mar 14, 2024

github-actions bot commented Mar 14, 2024

github-actions bot commented Mar 14, 2024

github-actions bot commented Mar 14, 2024

github-actions bot commented Mar 18, 2024

github-actions bot commented Mar 20, 2024

feat: support reranker #1532

feat: support reranker #1532

Conversation

Anhui-tqhuang commented Jan 23, 2024 • edited Loading

imartinez commented Feb 7, 2024

Anhui-tqhuang commented Feb 9, 2024

Anhui-tqhuang commented Feb 9, 2024 • edited Loading

Anhui-tqhuang commented Feb 18, 2024 • edited Loading

github-actions bot commented Feb 18, 2024

github-actions bot commented Feb 18, 2024

github-actions bot commented Feb 18, 2024

github-actions bot commented Feb 19, 2024

Anhui-tqhuang commented Feb 19, 2024

github-actions bot commented Feb 20, 2024

github-actions bot commented Feb 20, 2024

github-actions bot commented Feb 20, 2024

Anhui-tqhuang commented Feb 27, 2024

cloudrage999 commented Feb 28, 2024 • edited Loading

Anhui-tqhuang commented Feb 29, 2024 • edited Loading

Anhui-tqhuang commented Feb 29, 2024

github-actions bot commented Mar 14, 2024

github-actions bot commented Mar 14, 2024

github-actions bot commented Mar 14, 2024

github-actions bot commented Mar 14, 2024

github-actions bot commented Mar 14, 2024

github-actions bot commented Mar 18, 2024

github-actions bot commented Mar 20, 2024

Anhui-tqhuang commented Jan 23, 2024 •

edited

Loading

Anhui-tqhuang commented Feb 9, 2024 •

edited

Loading

Anhui-tqhuang commented Feb 18, 2024 •

edited

Loading

cloudrage999 commented Feb 28, 2024 •

edited

Loading

Anhui-tqhuang commented Feb 29, 2024 •

edited

Loading