Skip to content

This is a web scraping tool designed to extract academic papers from various conference proceedings.

License

Notifications You must be signed in to change notification settings

yfChang-cv/PaperSpider

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PaperSpider

This is a web scraping tool designed to extract academic papers from various conference proceedings.

  • At present, papers from CVPR/ICCV/WACV can be crawled.
  • Other conferences work in process.
  • And multithreading feature is still under development.

Install

git clone https://github.com/yfChang-cv/PaperSpider.git
cd PaperSpider
pip install -r requirements.txt

Usage

if you need proxies, just replace set_req_old with set_req_proxies.
if you want to crawl papers from other conferences, simply modify these content:

CVPR2022

head = 'https://openaccess.thecvf.com'
url = "https://openaccess.thecvf.com/CVPR2022?day=all"
filename = 'cvpr2022.csv'

ICCV2021

head = 'https://openaccess.thecvf.com'
url = "https://openaccess.thecvf.com/ICCV2021?day=all"
filename = 'iccv2021.csv'

License

MIT © Richard McRichface.

About

This is a web scraping tool designed to extract academic papers from various conference proceedings.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages