Retrieval Augmented Generation (RAG)
Overview
ref: https://github.com/chatchat-space/Langchain-Chatchat/blob/master/README_en.md
My Pick
Raw
Examples
Mistral-7B-Instruct Multiple-PDF Chatbot with Langchain & Streamlit
Fixed notebook is Chat_with_MultiplePDFs_Mistral_7B_Instruct1.ipynb
-
Youtube: https://www.youtube.com/watch?v=tqpXvPzteT4
-
Colab: https://colab.research.google.com/drive/11sf5LAF5EC1M0cDh-pUyowvS7EflwlMH?usp=sharing
-
⚠️ This notebook didn't use GPU. To support GPU will need.
!CMAKE_ARGS="-DLLAMA_CUBLAS=on" FORCE_CMAKE=1 pip install llama-cpp-python
and
llm = LlamaCpp( streaming = True, model_path="./mistral-7b-instruct-v0.1.Q4_K_M.gguf", temperature=0.75, top_p=1, verbose=True, n_ctx=4096, n_gpu_layers=30, n_threads=2, n_batch=521, )