Skip to main content

Nasy Crawler Framework -- Never had such a pure crawler.

Project description

Table of Contents

Prologue

Never had such a pure crawler like this nacf.

Although I often write crawlers, I don’t like to use huge frameworks, such as scrapy, but prefer simple requests+bs4 or more general requests_html. However, these two are inconvenient for a crawler. E.g. Places, such as error retrying or parallel crawling, need to be handwritten by myself. It is not very difficult to write it while writing too much can be tedious. Hence I started writing this nacf (Nasy Crawler Framework), hoping to simplify some error retrying or parallel writing of crawlers.

Packages

Table 1: Packages
Package Version Description
requests-html 0.10.0 HTML Parsing for Humans.
nalude 0.3.0 A standard module. Inspired by Haskell’s Prelude.

Usage

see tests.

Development Process

DONE Http Functions

CLOSED: <Thu Feb 28 20:51:00 2019>

DONE Get

CLOSED: <Tue Dec 25 17:36:00 2018>

DONE Post

CLOSED: <Thu Feb 28 20:44:00 2019>

DONE Bugs

CLOSED: <Thu Feb 28 20:51:00 2019>

DONE Fix an error from inspect.Parameter which caused the function parallel down. :err:1:

CLOSED: <Wed Dec 26 20:26:00 2018>

NEXT Docs

NEXT Usage

Epoligue

History

Version 1.0.2

  • Data: <Sun Mar 10, 2019>
  • Changes: Update nalude.

Version 1.0.1

  • Data: <Sun Mar 10, 2019>
  • Changes: Update requests-html.

Version 1.0.0

  • Data: <Thu Feb 28, 2019>
  • Changes: Now, old HTTP methods (get and post) cannot accept multiple URLs. Instead, we can use gets and posts.
  • Adds: - nacf.html
    • nacf.json
    • nacf.gets
    • nacf.posts
  • Includes: - nalude

Version 0.1.2

  • Data: <Wed Dec 26, 2018>
  • Fixed: inspect.Parameter error in last version.

Version 0.1.1

  • Data: <Wed Dec 26, 2018>
  • Ignored: An error caused by inspect.Parameter
  • Help Wanted: Can someone help me about the Parameter?

Version 0.1.0

  • Date: <Sun Dec 23, 2018>
  • Commemorate Version: First Version
    • Basic Functions.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nacf-1.0.3.tar.gz (17.9 kB view details)

Uploaded Source

Built Distribution

nacf-1.0.3-py3-none-any.whl (20.8 kB view details)

Uploaded Python 3

File details

Details for the file nacf-1.0.3.tar.gz.

File metadata

  • Download URL: nacf-1.0.3.tar.gz
  • Upload date:
  • Size: 17.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/0.12.15 CPython/3.7.3 Darwin/18.5.0

File hashes

Hashes for nacf-1.0.3.tar.gz
Algorithm Hash digest
SHA256 845ca6adb3a8d977b4f3f2e14e7a83d6fce1178188bb8b10b50e0140692c0d47
MD5 d833d1b96815be57d5299feb99ad1bda
BLAKE2b-256 b4479b65af0294d957bd0966fec0146e53397eb0df7059103ef4508dc1633f88

See more details on using hashes here.

File details

Details for the file nacf-1.0.3-py3-none-any.whl.

File metadata

  • Download URL: nacf-1.0.3-py3-none-any.whl
  • Upload date:
  • Size: 20.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/0.12.15 CPython/3.7.3 Darwin/18.5.0

File hashes

Hashes for nacf-1.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 b6f54ecf39f80381885d9ea1b75d40bd85d0a347d95c1ae2cabbddbc0ba075c3
MD5 a18820e408b1f2278169931fe78073f3
BLAKE2b-256 42e69e3131f0f66272f9e734471c8820e49c9313be250e12cb145040d8f360f6

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page