Currently we use Vector search for RAG and manually do tokenization and vectorization. I have talked to Pamela Fox, and she proposed the next improvements:
- Use Hybrid search instead of vector search https://aka.ms/ragrelevance
- Add semantic ranker to reduce noise from the vector search
- Use Azure AI search built in vectorization
- Use Azure AI search built in splitter
- Change the documentation accordingly.
Currently we use Vector search for RAG and manually do tokenization and vectorization. I have talked to Pamela Fox, and she proposed the next improvements: