OceanMonkey

A High-Level Distributed Web Crawling and Web Scraping framework

These details have not been verified by PyPI

Project links

Homepage

Project description

Overview

OceanMonkey is a High-Level Distributed Web Crawling and Web Scraping framework, used to crawl websites and extract structured data from their pages. It can be used for a wide range of purposes, from data mining to monitoring and automated testing.

OceanMonkey was brought to life and is maintained by chenzhengqiang(wechat:Pretty-Style, blog:http://www.chipscoco.com) while teaching the python’s web scraping in 2021.

Requirements

Python 3.5+
Works on Linux, Windows, macOS, BSD

Install

The quick way:: pip install oceanmonkey

Quick start

Firstly execute monkeys startproject in command line to create a OceanMonkey Project like the following:: monkeys startproject BeBe

Then write your crawling logic in gibbons.py under the monkeys’ directory and write your storing logic in orangutans.py.

Execute the monkeys run command under the project’s directory finally when you finish your coding work:

cd BeBe

monkeys run

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

1.0.0

Jan 4, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

OceanMonkey-1.0.0.tar.gz (20.2 kB view hashes)

Uploaded Jan 4, 2022 Source

Hashes for OceanMonkey-1.0.0.tar.gz

Hashes for OceanMonkey-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`1f4172d50233b15f960f170e01b967e3b99698f63d60e6ffba193740c0b1c682`
MD5	`1aae4da1ea613bbca4d4fd8c1cc6c15c`
BLAKE2b-256	`ec2f98db33c6ce77cbd2694e7bc07e403c2bc89abf2b448ed2d0606fd52f8d64`