spider.py 0.5
Newer version available (0.44)
Released:
Multithreaded crawling, reporting, and mirroring for Web and FTP
Navigation
Unverified details
These details have not been verified by PyPIProject links
Meta
- License: BSD License (BSD)
- Author: L. C. Rees
- Maintainer: L. C. Rees
- Tags spider , robot , crawler , ftp crawler , ftp robot , ftp spider , web crawler , web robot , web spider , web-bot , link checker , bad link finder , site management , web reporting
Classifiers
- Development Status
- License
- Operating System
- Programming Language
- Topic
Project description
This module provides multithreaded crawling, reporting, and mirroring for Web and FTP in one convenient library. Crawling depth, maximum number of URLs to crawl, and maximum number of threads are user-configurable. Reports can be generated on external URLS, internal redirects to outside URLs, unparsable HTML, non-HTTP/FTP URLs, and broken links.
Project details
Unverified details
These details have not been verified by PyPIProject links
Meta
- License: BSD License (BSD)
- Author: L. C. Rees
- Maintainer: L. C. Rees
- Tags spider , robot , crawler , ftp crawler , ftp robot , ftp spider , web crawler , web robot , web spider , web-bot , link checker , bad link finder , site management , web reporting
Classifiers
- Development Status
- License
- Operating System
- Programming Language
- Topic