Nasy Crawler Framework -- Never had such a pure crawler.
Project description
Table of Contents
Prologue
Never had such a pure crawler like this nacf.
Although I often write crawlers, I don’t like to use huge frameworks, such as scrapy, but prefer
simple requests+bs4 or more general requests_html. However, these two are inconvenient for a
crawler. E.g. Places, such as error retrying or parallel crawling, need to be handwritten by
myself. It is not very difficult to write it while writing too much can be tedious. Hence I
started writing this nacf (Nasy Crawler Framework), hoping to simplify some error retrying or
parallel writing of crawlers.
Packages
| Package | Version | Description |
|---|---|---|
| requests-html | 0.10.0 | HTML Parsing for Humans. |
| nalude | 0.3.0 | A standard module. Inspired by Haskell’s Prelude. |
Usage
see tests.
Development Process
DONE Http Functions
DONE Get
DONE Post
DONE Bugs
DONE Fix an error from inspect.Parameter which caused the function parallel down. :err:1:
NEXT Docs
NEXT Usage
Epoligue
History
Version 1.0.2
- Data:
- Changes: Update nalude.
Version 1.0.1
- Data:
- Changes: Update requests-html.
Version 1.0.0
- Data:
- Changes: Now, old HTTP methods (
getandpost) cannot accept multiple URLs. Instead, we can usegetsandposts. - Adds: -
nacf.htmlnacf.jsonnacf.getsnacf.posts
- Includes: -
nalude
Version 0.1.2
- Data:
- Fixed:
inspect.Parametererror in last version.
Version 0.1.1
- Data:
- Ignored: An error caused by
inspect.Parameter - Help Wanted: Can someone help me about the Parameter?
Version 0.1.0
- Date:
- Commemorate Version: First Version
- Basic Functions.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file nacf-1.0.3.tar.gz.
File metadata
- Download URL: nacf-1.0.3.tar.gz
- Upload date:
- Size: 17.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/0.12.15 CPython/3.7.3 Darwin/18.5.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
845ca6adb3a8d977b4f3f2e14e7a83d6fce1178188bb8b10b50e0140692c0d47
|
|
| MD5 |
d833d1b96815be57d5299feb99ad1bda
|
|
| BLAKE2b-256 |
b4479b65af0294d957bd0966fec0146e53397eb0df7059103ef4508dc1633f88
|
File details
Details for the file nacf-1.0.3-py3-none-any.whl.
File metadata
- Download URL: nacf-1.0.3-py3-none-any.whl
- Upload date:
- Size: 20.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/0.12.15 CPython/3.7.3 Darwin/18.5.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b6f54ecf39f80381885d9ea1b75d40bd85d0a347d95c1ae2cabbddbc0ba075c3
|
|
| MD5 |
a18820e408b1f2278169931fe78073f3
|
|
| BLAKE2b-256 |
42e69e3131f0f66272f9e734471c8820e49c9313be250e12cb145040d8f360f6
|