GitHub - HarshitSamani/Natural-Language-Inference: In this repository, we deal with the task of implementing Natural Language Inferencing (NLI) using the SNLI dataset. Different methods such as BiLSTM, BiGRU, Attention models and Logistic Regression are experimented.

Natural Language Inference (NLI)

Natural Language Inference (NLI) using SNLI dataset.

tags : natural language inferencing, natural language processing, NLI, SNLI, MNLI, deep learning, tensorflow

About The Project

This project is an implementation of the task of Natural Language Inference (NLI). In this task, we are given two sentences called premise and hypothesis. We are supposed to determine whether the ”hypothesis” is true (entailment), false (contradiction), or undetermined (neutral) provided that the ”premise” is true. For this project, we have used the Stanford Natural Language Inference (SNLI) dataset. From the dataset, we use the files snli_1.0_train.jsonl for training the model and snli_1.0_test.jsonl for testing the model. From each entry in these files, we consider the fields corresponding to gold_label, sentence1 and sentence2. sentence1 serves as the premise and sentence2 serves as the hypothesis and gold_label serves as the relationship label. The following models were implemented and the performance was evaluated.

Logistic regression classifier using TF-IDF features
Deep model classifiers for text such as GRU, LSTM and SumEmbeddings

Models overview

Logistic regression

Logistic regression model was trained using TF-IDF (Term Frequency-Inverse Document Frequency) features obtained using sklearn python library. The feature vector used to train the model is obtained by concatenating the TF-IDF vectors of premise and hypothesis. The model is trained (fit) using L-BFGS (Limited memory - Broyden–Fletcher–Goldfarb–Shanno algorithm) solver with a maximum iteration limit of 1000. The trained (fit) model is saved at ./models/LR.pickle for future uses and testing. The model attains an accuracy of 63.38% and the results of prediction are written to a file at ./tfidf.txt.

Deep learning models

The first step towards implementing a deep model for text is to convert each atomic discrete entity in the input (words or characters) into real vectors from $ \mathbf{R}^{d} $ so that their semantics are captured meaningfully. For this purpose, GloVe embedding has been used in this project. Different pretrained GloVe embeddings are available and the embedding chosen for this project is the one with 6 billion tokens trained over Wikipedia corpus. Once the training is completed the model is stored at ./model/ directory as a h5 file.

BiGRU (Bidirectional Gated Reccurent Unit)

The model attains an accuracy of 78.58% and the output text file and plots are saves at ./results/BiGRU/.

BiLSTM (Bidirectional Long Short Term Memory)

The model attains an accuracy of 76.38% and the output text file and plots are saves at ./results/BiLSTM/.

SumEmbeddings

A SumEmbedding lambda layer, which sums up all the embedding vectors in the sentence is used in this model. The model attains an accuracy of 80.38% and the output text file and plots are saves at ./results/SumEmbeddings/. Since this is the best performing model the output text files are stored at ./deep_model.txt as well

Results

Model	Accuracy
Logistic regression	63.38%
BiGRU	78.58%
BiLSTM	76.38%
SumEmbeddings	80.38%

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
Deep model		Deep model
TFIDF		TFIDF
results		results
utils		utils
Deep_Learning_Report_3.pdf		Deep_Learning_Report_3.pdf
LICENSE		LICENSE
README.md		README.md
deep_model.txt		deep_model.txt
main.py		main.py
tfidf.txt		tfidf.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Natural Language Inference (NLI)

About The Project

Models overview

Logistic regression

Deep learning models

BiGRU (Bidirectional Gated Reccurent Unit)

BiLSTM (Bidirectional Long Short Term Memory)

SumEmbeddings

Results

About

Releases

Packages

Languages

License

HarshitSamani/Natural-Language-Inference

Folders and files

Latest commit

History

Repository files navigation

Natural Language Inference (NLI)

About The Project

Models overview

Logistic regression

Deep learning models

BiGRU (Bidirectional Gated Reccurent Unit)

BiLSTM (Bidirectional Long Short Term Memory)

SumEmbeddings

Results

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages