LlamaMedicalQA

Own implementation of Refine chain for generative QA from langchain library.

Technologies used:

Ran on Nvidia RTX A6000 GPU

The inference speed is faster than the vanilla HuggingFace pipeline.

Custom prompts lead to better quality, then base langchain implementation

TODO:

Scrape more data from the American Cancer Society website
Try a bigger model e.g. llama2-70b
Implement automatical detection of the GPUs count to parallelize the computation
Perform further prompt engineering to fight hallucinations
Try another embedding NN
Experiment with context-aware chunking

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data.zip		data.zip
llama_medical_QA.ipynb		llama_medical_QA.ipynb

Provide feedback