Python SDK for Crawlab
Project description
Crawlab Python SDK
Python SDK for Crawlab
Installation
pip install crawlab-sdk
Usage
CLI
crawlab-cli
Scrapy Integration
In settings.py
, add the following:
ITEM_PIPELINES = {
'crawlab.CrawlabPipeline': 300
}
Save Scraped Items
from crawlab import save_item
scraped_items = [
{
'name': 'item1',
'value': 'value1'
},
{
'name': 'item2',
'value': 'value2'
}
]
for item in scraped_items:
save_item(item)
Development
Pre-requisites
pip install poetry
Install dependencies
poetry install
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
crawlab_sdk-0.7.0.tar.gz
(7.5 kB
view details)
Built Distribution
File details
Details for the file crawlab_sdk-0.7.0.tar.gz
.
File metadata
- Download URL: crawlab_sdk-0.7.0.tar.gz
- Upload date:
- Size: 7.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 973c559311d5ed7ed40e48de55540cd69ff38c43c21737039fb0295bbfbe1e37 |
|
MD5 | bff48eded1dc6765c15c9967bc110344 |
|
BLAKE2b-256 | 38b8eef82a08ab193017626f24aa7c965a7dc17b64727ad1e5bc3a2d2d208d1f |
Provenance
The following attestation bundles were made for crawlab_sdk-0.7.0.tar.gz
:
Publisher:
publish_python.yaml
on crawlab-team/crawlab-python-sdk
-
Statement:
- Statement type:
https://in-toto.io/Statement/v1
- Predicate type:
https://docs.pypi.org/attestations/publish/v1
- Subject name:
crawlab_sdk-0.7.0.tar.gz
- Subject digest:
973c559311d5ed7ed40e48de55540cd69ff38c43c21737039fb0295bbfbe1e37
- Sigstore transparency entry: 171230599
- Sigstore integration time:
- Permalink:
crawlab-team/crawlab-python-sdk@ffa57a9089dd9596589c6344a152bab5ac9482cd
- Branch / Tag:
refs/heads/main
- Owner: https://github.com/crawlab-team
- Access:
public
- Token Issuer:
https://token.actions.githubusercontent.com
- Runner Environment:
github-hosted
- Publication workflow:
publish_python.yaml@ffa57a9089dd9596589c6344a152bab5ac9482cd
- Trigger Event:
push
- Statement type:
File details
Details for the file crawlab_sdk-0.7.0-py3-none-any.whl
.
File metadata
- Download URL: crawlab_sdk-0.7.0-py3-none-any.whl
- Upload date:
- Size: 11.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6ff4a31cec73bfa38f98953c9166df950e3c80ae9212c4cca2116517d81a4bcf |
|
MD5 | e421d255c03be8bf02891057b398a9e5 |
|
BLAKE2b-256 | e1a528016336f2a696996a7d0738c218d89e85206bafda6328c699e28a3a74c7 |
Provenance
The following attestation bundles were made for crawlab_sdk-0.7.0-py3-none-any.whl
:
Publisher:
publish_python.yaml
on crawlab-team/crawlab-python-sdk
-
Statement:
- Statement type:
https://in-toto.io/Statement/v1
- Predicate type:
https://docs.pypi.org/attestations/publish/v1
- Subject name:
crawlab_sdk-0.7.0-py3-none-any.whl
- Subject digest:
6ff4a31cec73bfa38f98953c9166df950e3c80ae9212c4cca2116517d81a4bcf
- Sigstore transparency entry: 171230600
- Sigstore integration time:
- Permalink:
crawlab-team/crawlab-python-sdk@ffa57a9089dd9596589c6344a152bab5ac9482cd
- Branch / Tag:
refs/heads/main
- Owner: https://github.com/crawlab-team
- Access:
public
- Token Issuer:
https://token.actions.githubusercontent.com
- Runner Environment:
github-hosted
- Publication workflow:
publish_python.yaml@ffa57a9089dd9596589c6344a152bab5ac9482cd
- Trigger Event:
push
- Statement type: