一个 Golang 实现的相对智能、无需规则维护的通用新闻网站数据提取工具库。含域名探测、网页编码语种识别、网页链接分类提取、网页新闻要素抽取以及新闻正文抽取等组件。
-
Updated
Aug 30, 2024 - Go
一个 Golang 实现的相对智能、无需规则维护的通用新闻网站数据提取工具库。含域名探测、网页编码语种识别、网页链接分类提取、网页新闻要素抽取以及新闻正文抽取等组件。
A fully customisable language detection pipeline for spaCy
✨ Split text by languages (e.g. 你喜欢看アニメ吗 -> 你喜欢看 | アニメ | 吗) for NLP tasks (e.g. parse, TTS). Powered by fasttext and langua
Zabanshenas is a solution for identifying the most likely language of a piece of written text. Demo (👇 )
Personalized anime recommendations based on collaborative filtering. Discover your next favorite anime!
Natural language processing tools developed by the World Bank's DECAT unit. A suite of text preprocessing and cleaning algorithms for NLP analysis and modeling.
The goal of this repository is to build a comprehensive set of tools and examples that leverage recent advances in NLP algorithms, neural architectures, and distributed machine learning systems. The content is based on our past and potential future engagements with customers as well as collaboration with partners, researchers, and the open sourc…
Crawled only the Bengali comments from cricket news of Bangladeshi newspaper Prothom Alo.
This Project provides an in-depth analysis of the sentiment analysis that leverages BERT embeddings.
An AI model that automatically detects the language of a given text supported by 3 pre-trained NLP models and one ML algorithm.
A vocal assistant for a university reception that responds to certain topics related to the administration with both languages English and French.
Conversate effortlessly in more than 50 languages!
Read image and speek the text content using python
Flask-based API that provides a user-friendly interface to summarize text in any language supported by the BART model and then translate it to English.
This my mini-projects that you may be interested in doing too... Enjoy!!
Detecting the languages of tweets and incorporating a language column into the DataFrame to signify their respective languages.
Detect the language from the given sentence
A Julia package for language identification.
Plateforme de Connaissances Unifiées (PCU) project (i.e Unified Knowledge Platform)
Add a description, image, and links to the langdetect topic page so that developers can more easily learn about it.
To associate your repository with the langdetect topic, visit your repo's landing page and select "manage topics."