- Get a llam-2 model (llama-2-7b.Q5_K_M.gguf)
- Install llama-cpp-python with GPU support
- Install dependencies
- Run application
streamlit run app.py
- Minimize hallucination
- Create chains to validate input
- Ensure speaker is always extracted (if present)
- Prompt to accomodate many kinds of speech