Simple scrape as SELECT * FROM ANYTHING in network
Project description
Simple data workflows
Byteflows is a microframework that makes it easier to retrieve information from APIs and regular websites.
Byteflows, unlike complex projects like Scrapy or simple libraries like BeautifulSoup, is extremely easy to use due to the unification of the information extraction process and at the same time has quite a wide range of functionality.
Why use Byteflows?
-
🚀 Byteflows is built on top of asyncio and asynchronous libraries, which significantly speeds up your code in the context of I/O operations.
-
🔁 With Byteflows, there is no need to continuously customize the data scraping process. From project to project, you will have a single, transparent architecture.
-
Byteflows allows you to route data to any backend: s3-like storage, database, network file system, broker/message bus, etc.
-
⚙️ Byteflows allows the user to choose what to do with the data: hold it in memory until a certain critical value accumulates, or immediately send it to the backend, perform pre-processing, or leave it as is.
Installation
Installation is as simple as:
pip install byteflows
Dependencies
The list of core Byteflows dependencies is represented by the following libraries:
- aiohttp
- aioitertools
- fsspec
- more-itertools
- regex
- uvloop (for Unix platforms)
- yarl
- dateparser
More information about the project
You can learn more about Byteflows in the project documentation, including the API and Tutorial sections. Changes can be monitored in the Changelog section.
Project status
Byteflows is currently a deep alpha project with an unstable API and limited functionality. Its use in production is strictly not recommended.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file byteflows-0.2.1a1.tar.gz
.
File metadata
- Download URL: byteflows-0.2.1a1.tar.gz
- Upload date:
- Size: 42.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: pdm/2.18.1 CPython/3.11.7 Windows/10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | eab08bfa2f28dc22b6cfa366195580d074f67e0b78b92762c4fd0bc96c2e2bbd |
|
MD5 | 96c3eb416329a8456e6cbcaa39c86fd7 |
|
BLAKE2b-256 | 037bc9f4e7c59326b93bc9a382a91d9b6716d12979dfaf2098ea07b99fc3a598 |
File details
Details for the file byteflows-0.2.1a1-py3-none-any.whl
.
File metadata
- Download URL: byteflows-0.2.1a1-py3-none-any.whl
- Upload date:
- Size: 52.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: pdm/2.18.1 CPython/3.11.7 Windows/10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4ff6bae3ee7bfcf7a4ba43c856612e8fe8a96dd049e30bdc6d1cbc0e707223b9 |
|
MD5 | 6390fd1238d64dde37ba37b654f261c0 |
|
BLAKE2b-256 | f714b6356f6f36ffa7b8473c01fd897753ec46290d68ea8abfa50bf5774f26dc |