Nasy Crawler Framework -- Never had such a pure crawler.
Project description
Table of Contents
Prologue
Never had such a pure crawler like this nacf
.
Although I often write crawlers, I don’t like to use huge frameworks, such as scrapy, but prefer
simple requests+bs4
or more general requests_html
. However, these two are inconvenient for a
crawler. E.g. Places, such as error retrying or parallel crawling, need to be handwritten by
myself. It is not very difficult to write it while writing too much can be tedious. Hence I
started writing this nacf (Nasy Crawler Framework), hoping to simplify some error retrying or
parallel writing of crawlers.
Packages
Package | Version | Description |
---|---|---|
requests-html | 0.10.0 | HTML Parsing for Humans. |
nalude | 0.3.0 | A standard module. Inspired by Haskell’s Prelude. |
Usage
see tests.
Development Process
DONE Http Functions
DONE Get
DONE Post
DONE Bugs
DONE Fix an error from inspect.Parameter which caused the function parallel down. :err:1:
NEXT Docs
NEXT Usage
Epoligue
History
Version 1.0.2
- Data:
- Changes: Update nalude.
Version 1.0.1
- Data:
- Changes: Update requests-html.
Version 1.0.0
- Data:
- Changes: Now, old HTTP methods (
get
andpost
) cannot accept multiple URLs. Instead, we can usegets
andposts
. - Adds: -
nacf.html
nacf.json
nacf.gets
nacf.posts
- Includes: -
nalude
Version 0.1.2
- Data:
- Fixed:
inspect.Parameter
error in last version.
Version 0.1.1
- Data:
- Ignored: An error caused by
inspect.Parameter
- Help Wanted: Can someone help me about the Parameter?
Version 0.1.0
- Date:
- Commemorate Version: First Version
- Basic Functions.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file nacf-1.0.3.tar.gz
.
File metadata
- Download URL: nacf-1.0.3.tar.gz
- Upload date:
- Size: 17.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/0.12.15 CPython/3.7.3 Darwin/18.5.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 845ca6adb3a8d977b4f3f2e14e7a83d6fce1178188bb8b10b50e0140692c0d47 |
|
MD5 | d833d1b96815be57d5299feb99ad1bda |
|
BLAKE2b-256 | b4479b65af0294d957bd0966fec0146e53397eb0df7059103ef4508dc1633f88 |
File details
Details for the file nacf-1.0.3-py3-none-any.whl
.
File metadata
- Download URL: nacf-1.0.3-py3-none-any.whl
- Upload date:
- Size: 20.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/0.12.15 CPython/3.7.3 Darwin/18.5.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b6f54ecf39f80381885d9ea1b75d40bd85d0a347d95c1ae2cabbddbc0ba075c3 |
|
MD5 | a18820e408b1f2278169931fe78073f3 |
|
BLAKE2b-256 | 42e69e3131f0f66272f9e734471c8820e49c9313be250e12cb145040d8f360f6 |