use AISearch import and vectorize data wizard with azure-search-openai-demo #2275

evan2k · 2025-01-13T13:15:41Z

I was trying to use Azure AI Search import and vectorize data wizard (from the portal) with the app.
It seems that the wizard is quite opinionated on the name of index fields it uses when
creating the index, which makes it incompatible with the app, unless you make changes to
the code and use the same names. Am I missing something or am I correct in my assumption?
thank you.

pamelafox · 2025-01-14T21:30:51Z

I just helped another team use the app with the integrated vectorization. These were the changes they had to make.

Modify this code in approach.py to pull out the correct field names.

 
                documents.append(
                    Document(
                        id=document.get("id"),
                        content=document.get("content"),
                        embedding=document.get("embedding"),
                        image_embedding=document.get("imageEmbedding"),
                        category=document.get("category"),
                        sourcepage=document.get("sourcepage"),
                        sourcefile=document.get("sourcefile"),
                        oids=document.get("oids"),
                        groups=document.get("groups"),
                        captions=cast(List[QueryCaptionResult], document.get("@search.captions")),
                        score=document.get("@search.score"),
                        reranker_score=document.get("@search.reranker_score"),
                    )
                )

You probably want id to be "chunk_id", and also for sourcepage and sourcefile to be "chunk_id", as I've been told that field contains the filename. You need filenames for citations to work. The embedding field should be "text_vector", I think. The rest of the fields are optional and don't need updating.

Change this line of code in approach.py to "text_vector" (or whatever your embedding field is called) -

return VectorizedQuery(vector=query_vector, k_nearest_neighbors=50, fields="embedding")

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use AISearch import and vectorize data wizard with azure-search-openai-demo #2275

use AISearch import and vectorize data wizard with azure-search-openai-demo #2275

evan2k commented Jan 13, 2025 •

edited

Loading

pamelafox commented Jan 14, 2025

use AISearch import and vectorize data wizard with azure-search-openai-demo #2275

use AISearch import and vectorize data wizard with azure-search-openai-demo #2275

Comments

evan2k commented Jan 13, 2025 • edited Loading

pamelafox commented Jan 14, 2025

evan2k commented Jan 13, 2025 •

edited

Loading