Retrieval augmented generation demonstration

This demo allows the user to ask detailed questions about a number of events in history.

Raw data for this demo has been taken from the Wikipedia list of "Vital Articles" for history. These pages contain topics Wikipedia asserts should have high quality, but not all have been assessed or meet the standard of the editors.

After a user inputs a question, the system will look up a relevant article by comparing a question to the articles which were downloaded, then it will attempt to answer the question using these articles with a process called retrieval augmented generation.

System setup

The articles were downloaded from Wikipedia using the download_data.py script. Any article containing meta-information was removed, as well as any article which was not downloaded correctly.

This system consists of the following:

a postgres server with pgvector inside a docker image
streamlit application to respond to user queries

Data ingestion

Download files from Wikipedia

poetry run python3 dl_scripts/download_data.py

Remove less helpful files

bash dl_scripts/remove_short_files.sh downloaded_articles/*.txt

Install poetry env
```
poetry install
```

Download the huggingface model - N.B. this requires your credentials in the environment

huggingface-cli download TheBloke/Wizard-Vicuna-7B-Uncensored-GGUF Wizard-Vicuna-7B-Uncensored.Q4_K_M.gguf --local-dir models --local-dir-use-symlinks False

Create the docker image locally
```
docker build . -t history_buffbot
```
Launch the stack
```
poetry run docker-compose up -d
```

Upload the reference files

poetry run python3 db_scripts/upload_data.py downloaded_articles

Application usage

Now navigate to the streamlit application in your browser and you should be able to formulate short questions about historical events

Example video!

Caveat

As this is a tool for demonstrating how to perform retrieval augmented generation rather than a production ready prototype, be aware that you may have unreliable results with such a small model.

I recommend consulting one of the community leaderboards, such as one of the following, for model suitable for production.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
db_scripts		db_scripts
demos		demos
dl_scripts		dl_scripts
.dockerignore		.dockerignore
.gitignore		.gitignore
.python-version		.python-version
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
docker-compose.yaml		docker-compose.yaml
install_docker_compose.sh		install_docker_compose.sh
llm_logic.py		llm_logic.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Retrieval augmented generation demonstration

System setup

Data ingestion

Application usage

Caveat

About

Releases

Packages

Languages

mgoldey/history_buffbot

Folders and files

Latest commit

History

Repository files navigation

Retrieval augmented generation demonstration

System setup

Data ingestion

Application usage

Caveat

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages