Skip to main content

Simple scrape as SELECT * FROM ANYTHING in network

Project description

byteflows

Simple data workflows

Byteflows is a microframework that makes it easier to retrieve information from APIs and regular websites.

Byteflows, unlike complex projects like Scrapy or simple libraries like BeautifulSoup, is extremely easy to use due to the unification of the information extraction process and at the same time has quite a wide range of functionality.

Why use Byteflows?

  • 🚀 Byteflows is built on top of asyncio and asynchronous libraries, which significantly speeds up your code in the context of I/O operations.

  • 🔁 With Byteflows, there is no need to continuously customize the data scraping process. From project to project, you will have a single, transparent architecture.

  • s3 kafka psql clickhouse Byteflows allows you to route data to any backend: s3-like storage, database, network file system, broker/message bus, etc.

  • ⚙️ Byteflows allows the user to choose what to do with the data: hold it in memory until a certain critical value accumulates, or immediately send it to the backend, perform pre-processing, or leave it as is.

Installation

Installation is as simple as:

pip install byteflows

Dependencies

The list of core Byteflows dependencies is represented by the following libraries:

  • aiohttp
  • aioitertools
  • fsspec
  • more-itertools
  • regex
  • uvloop (for Unix platforms)
  • yarl
  • dateparser

More information about the project

You can learn more about Byteflows in the project documentation, including the API and Tutorial sections. Changes can be monitored in the Changelog section.

Project status

Byteflows is currently a deep alpha project with an unstable API and limited functionality. Its use in production is strictly not recommended.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

byteflows-0.2.1a1.tar.gz (42.3 kB view details)

Uploaded Source

Built Distribution

byteflows-0.2.1a1-py3-none-any.whl (52.5 kB view details)

Uploaded Python 3

File details

Details for the file byteflows-0.2.1a1.tar.gz.

File metadata

  • Download URL: byteflows-0.2.1a1.tar.gz
  • Upload date:
  • Size: 42.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: pdm/2.18.1 CPython/3.11.7 Windows/10

File hashes

Hashes for byteflows-0.2.1a1.tar.gz
Algorithm Hash digest
SHA256 eab08bfa2f28dc22b6cfa366195580d074f67e0b78b92762c4fd0bc96c2e2bbd
MD5 96c3eb416329a8456e6cbcaa39c86fd7
BLAKE2b-256 037bc9f4e7c59326b93bc9a382a91d9b6716d12979dfaf2098ea07b99fc3a598

See more details on using hashes here.

File details

Details for the file byteflows-0.2.1a1-py3-none-any.whl.

File metadata

  • Download URL: byteflows-0.2.1a1-py3-none-any.whl
  • Upload date:
  • Size: 52.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: pdm/2.18.1 CPython/3.11.7 Windows/10

File hashes

Hashes for byteflows-0.2.1a1-py3-none-any.whl
Algorithm Hash digest
SHA256 4ff6bae3ee7bfcf7a4ba43c856612e8fe8a96dd049e30bdc6d1cbc0e707223b9
MD5 6390fd1238d64dde37ba37b654f261c0
BLAKE2b-256 f714b6356f6f36ffa7b8473c01fd897753ec46290d68ea8abfa50bf5774f26dc

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page