Nasy Crawler Framework -- Never had such a pure crawler.
Table of Contents
- Development Process
Never had such a pure crawler like this
Although I often write crawlers, I don’t like to use huge frameworks, such as scrapy, but prefer
requests+bs4 or more general
requests_html. However, these two are inconvenient for a
crawler. E.g. Places, such as error retrying or parallel crawling, need to be handwritten by
myself. It is not very difficult to write it while writing too much can be tedious. Hence I
started writing this nacf (Nasy Crawler Framework), hoping to simplify some error retrying or
parallel writing of crawlers.
|requests-html||0.10.0||HTML Parsing for Humans.|
|nalude||0.3.0||A standard module. Inspired by Haskell’s Prelude.|
DONE Http Functions
DONE Fix an error from inspect.Parameter which caused the function parallel down. :err:1:
- Changes: Update nalude.
- Changes: Update requests-html.
- Changes: Now, old HTTP methods (
post) cannot accept multiple URLs. Instead, we can use
- Adds: -
- Includes: -
inspect.Parametererror in last version.
- Ignored: An error caused by
- Help Wanted: Can someone help me about the Parameter?
- Commemorate Version: First Version
- Basic Functions.
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.