BNLP is a natural language processing toolkit for Bengali Language.
-
Updated
Sep 11, 2024 - Jupyter Notebook
BNLP is a natural language processing toolkit for Bengali Language.
This repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla" accpeted in Findings of the Annual Conference of the North American Chap…
Fully Configurable RAG Pipeline for Bengali Language RAG Applications. Supports both Local and Huggingface Models, Built with Langchain.
Dataset for identifying potential hates (e.g., political, religious, personal, gender abusive, geopolitical, etc.) for under-resourced Bengali language.
Bengali transformer using transformers
Bangla Text Augmentation
Fine tuned llama 3 models for context based question answering in bengali language.
[AAAI 2021] - Simple or Complex? Learning to Predict Readability of Bengali Texts.
A Python package for translating emoji, emoticons into Bengali text for NLP tasks.
[EACL 2021] - Unsupervised Abstractive Summarization of Bengali Text Documents.
Machine Learning approach to Bengali Corpus POS (Parts of Speech) Tagging using BNLP (Bengali Natural Language Processing) Toolkit. This is the Minor Project Presentation at Heritage Institute of Technology under the mentorship of Prof. Sandipan Ganguly.
Bengali News Summarization - BengaliGPT & T5
This repository consists of Bengali Text-Visualization using Word2Vec Model. A mini project under the mentorship of Prof. Sandipan Ganguly, HIT-K.
Machine Learning approach to Bengali corpus NER- Named Entity Recognition using BNLP. A mini project under the mentorship of Prof. Sandipan Ganguly, HIT-K
Machine Learning approach to Bengali Corpus POS Tagging using BNLTK. This is an experimenting project under the mentorship of Prof. Sandipan Ganguly, HIT-K.
Bengali POS Tagging using Indian Corpus through NLTK. A sample testing to apply POS Tagging under the supervision of Prof. Sandipan Ganguly, HIT-K.
Fine-tune mBart 50 for Bengali Sentence Error Correction
Repository to track the state of the art research progress in Bengali natural language processing for most common task
AI project to detect abusive comments in social media.
Add a description, image, and links to the bengali-nlp topic page so that developers can more easily learn about it.
To associate your repository with the bengali-nlp topic, visit your repo's landing page and select "manage topics."