You Believe Your LLM is Not Delusional? Think Again!

This repository contains the code accompanying our paper titled "You Believe Your LLM is Not Delusional? Think Again! A Study of LLM Hallucination on Foundation Models under Perturbation," which is currently under review.

Overview

In this work, we present an evaluation framework designed to detect hallucinations in Large Language Models (LLMs). Our approach involves:

Query Perturbation: Applying controlled perturbations at different levels (character, word, and sentence) to the input queries.
Consistency Score Calculation: Measuring the consistency between the responses generated for the original query and the perturbed query to identify potential hallucination scenarios.

Repository Structure

This repository is organized into the following sections:

1. Prepare Data

Description: Contains scripts to download datasets from Hugging Face Hub and apply various perturbations to the queries at character, word, and sentence levels.
Files and Modules:
- perturbations: Module with functions to introduce character-level, word-level, and sentence-level perturbations.
- main.py: Script to download data from Hugging Face Hub and apply perturbations.

2. Generate Responses

Description: Includes code for querying LLMs and obtaining responses for both original and perturbed queries.
Files and Modules:
- generate_response.py: Function to format LLM call.
- main.py: Script to generate responses using LLM.

3. Generate Scores

Description: Provides tools for calculating hallucination scores by comparing responses to original and perturbed queries.
Files and Modules:
- utils.py: Contains the model to return scores.
- main.py: Script to generate scores.

Usage Instructions

Prepare the environment: Ensure you have the required dependencies installed by running:
```
pip install -r requirements.txt
```
Run the data preparation pipeline:
```
python -m prepare_data.main
```
Generate LLM responses:
```
python -m generate_responses.main
```
Calculate hallucination scores:
```
python -m generate_scores.main
```

Contact

For any questions or discussions, please feel free to reach out to the authors:

Name	Email Address
Anirban Saha	[email protected]
Binay Gupta	[email protected]
Anirban Chatterjee	[email protected]
Kunal Banerjee	[email protected]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

You Believe Your LLM is Not Delusional? Think Again!

Overview

Repository Structure

1. Prepare Data

2. Generate Responses

3. Generate Scores

Usage Instructions

Contact

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
generate_responses		generate_responses
generate_scores		generate_scores
llm		llm
prepare_data		prepare_data
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

anirbansaha96/robustness_framework

Folders and files

Latest commit

History

Repository files navigation

You Believe Your LLM is Not Delusional? Think Again!

Overview

Repository Structure

1. Prepare Data

2. Generate Responses

3. Generate Scores

Usage Instructions

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages