crawlib

tool set for crawler project.

These details have not been verified by PyPI

Project links

Development Status
- 4 - Beta
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Natural Language
- English
Operating System
Programming Language

Project description

https://circleci.com/gh/MacHu-GWU/crawlib-project.svg?style=svg

https://img.shields.io/pypi/v/crawlib.svg

https://img.shields.io/pypi/l/crawlib.svg

https://img.shields.io/pypi/pyversions/crawlib.svg

https://img.shields.io/badge/STAR_Me_on_GitHub!--None.svg?style=social

https://img.shields.io/badge/Link-Document-blue.svg

https://img.shields.io/badge/Link-API-blue.svg

https://img.shields.io/badge/Link-Source_Code-blue.svg

https://img.shields.io/badge/Link-Install-blue.svg

https://img.shields.io/badge/Link-GitHub-blue.svg

https://img.shields.io/badge/Link-Submit_Issue-blue.svg

https://img.shields.io/badge/Link-Request_Feature-blue.svg

https://img.shields.io/badge/Link-Download-blue.svg

Welcome to crawlib Documentation

crawlib is a board-first-search crawler framework for targeting-crawler (For those you know where’s your data located and how’s been organized). You just need to focus on the data model and html extraction logic, and let the framework do the rest of things like:

duplicate filter
recursive crawling
status tracking
periodical update

Currently it supports mongodb as backend storage only.

Install

crawlib is released on PyPI, so all you need is:

$ pip install crawlib

To upgrade to latest version:

$ pip install --upgrade crawlib

Project details

These details have not been verified by PyPI

Project links

Development Status
- 4 - Beta
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Natural Language
- English
Operating System
Programming Language

Release history Release notifications | RSS feed

This version

0.1.1

Dec 31, 2019

0.1.0

Dec 27, 2019

0.0.27

Dec 30, 2018

0.0.26

Sep 9, 2018

0.0.25

Sep 6, 2018

0.0.24

Sep 5, 2018

0.0.23

Aug 22, 2018

0.0.22

Aug 19, 2018

0.0.21

Aug 18, 2018

0.0.20

Aug 15, 2018

0.0.19

Aug 15, 2018

0.0.18

Aug 14, 2018

0.0.17

Aug 12, 2018

0.0.16

Mar 22, 2018

0.0.15

Jan 22, 2018

0.0.14

Nov 24, 2017

0.0.13

Nov 20, 2017

0.0.12

Nov 19, 2017

0.0.11

Nov 19, 2017

0.0.10

Nov 8, 2017

0.0.9

Nov 8, 2017

0.0.8

Oct 30, 2017

0.0.7

Oct 2, 2017

0.0.6

Apr 6, 2017

0.0.5

Feb 7, 2017

0.0.4

Feb 6, 2017

0.0.3

Feb 2, 2017

0.0.2

Sep 14, 2016

0.0.1

Aug 29, 2016

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

crawlib-0.1.1.tar.gz (84.0 kB view details)

Uploaded Dec 31, 2019 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

crawlib-0.1.1-py2.py3-none-any.whl (139.7 kB view details)

Uploaded Dec 31, 2019 Python 2Python 3

File details

Details for the file crawlib-0.1.1.tar.gz.

File metadata

Download URL: crawlib-0.1.1.tar.gz
Upload date: Dec 31, 2019
Size: 84.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/42.0.2 requests-toolbelt/0.9.1 tqdm/4.41.0 CPython/3.6.2

File hashes

Hashes for crawlib-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`a61ea39ed1d11111abc055a03a2a185483f5accb7d932d26c6b504820eff349a`
MD5	`9f3359c9167452b4ad94ea14302cd020`
BLAKE2b-256	`6dc31b46b84930da81f3af7bf27bee4929b6b70ed31100c57aeff8a98c9f0906`

See more details on using hashes here.

File details

Details for the file crawlib-0.1.1-py2.py3-none-any.whl.

File metadata

Download URL: crawlib-0.1.1-py2.py3-none-any.whl
Upload date: Dec 31, 2019
Size: 139.7 kB
Tags: Python 2, Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/42.0.2 requests-toolbelt/0.9.1 tqdm/4.41.0 CPython/3.6.2

File hashes

Hashes for crawlib-0.1.1-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`aa7e4cc142929568bc70c81d07bca1bc3d52904d302115748e8feb2fcf9f8268`
MD5	`3642acf8b3fa3bb50799e1f6293876dd`
BLAKE2b-256	`0fc37b489bde627c27bbc549b64005295d68b1024936a3b3c51958f357c02478`

See more details on using hashes here.

crawlib 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Welcome to crawlib Documentation

Install

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes