8000 GitHub - xfsprogram/spider: 裁判文书网爬虫
[go: up one dir, main page]

Skip to content

xfsprogram/spider

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

中国裁判文书网爬虫

A spider for China Judgements Online

It is only used for personal study and technical exchange, and cannot be used for commercial purposes.

Overview

This is a spider for 中国裁判文书网.

Features

  • Support IP proxy
  • Support multiple processes
  • Support full crawling
  • Divide data according to decision time and province

Run

python spider.py -num_processes 1 -start_time 2016-1-2 -end_time 2016-1-2

Result

  • raw data

image

  • processed data

image

If you have any questions, please open an issue.

Welcome to pull requests to improve this project!

About

裁判文书网爬虫

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • JavaScript 83.6%
  • Python 16.4%
0