This project aims to parse telegram channels specified and find info you want in channel history.
- make channels list
- parse it daily
- watch for the info you want
- creation of locked channels, where you want to send notifications about findings
- You need a valid telegram account with tied phone number
- You cannot parse locked channels until you are not in them
- make telegram channels list
- make patterns list
- clone repositories
- make list of github repos (currently you can parse only specific repo but it is not hard to add some abstractions for common cases)
When you run the script firstly it would save the channels data in specific data directory which was specified in the config file. To parse only actual massages for each channel the message_offset is created - it is the id of the latest seen message. Output formats: csv or json. You can write simple function for history search if you want. The script returns findings via RocketChat or telegram. For RocketChat you need channel hook, where bot would send findings and server domain name, for telegram: api_id and api_hash you can read about it here, username, bot token and channel, where bot would send findings. The first run requires authentication of telegram user. After that the username.session would be created.
git clone https://github.com/St-elin/tgosint.git
python3 -m venv <venv>
source <venv>/bin/activate
cd tgosint
pip install -r requirements.txt
python tgosint.py