ASIMOV module for data import powered by the Bright Data web data platform.
Project description
ASIMOV Bright Data Module
ASIMOV module for data import powered by the Bright Data web data platform.
✨ Features
- Imports structured data from Airbnb, Amazon, Crunchbase, eBay, Facebook, Google, Indeed, Instagram, LinkedIn, Walmart, X (aka Twitter), Yahoo, and YouTube.
- Collects the raw JSON data via the Bright Data API (requires an API key).
- Constructs a semantic knowledge graph based on the KNOW ontology.
- Supports plain JSON output as well as RDF output in the form of JSON-LD.
- Distributed as a standalone static binary with zero runtime dependencies.
🛠️ Prerequisites
- Rust 1.85+ (2024 edition) if building from source code
⬇️ Installation
Installation from PyPI
pip install -U asimov-brightdata-module
Installation from RubyGems
gem install asimov-brightdata-module
Installation from NPM
npm install -g asimov-brightdata-module
Installation from Source Code
cargo install asimov-brightdata-module
👉 Examples
export BRIGHTDATA_API_KEY="..."
Fetching X Profiles
asimov-brightdata-fetcher https://x.com/bright_init # JSON
asimov-brightdata-importer https://x.com/bright_init # JSON-LD
Fetching LinkedIn Profiles
asimov-brightdata-fetcher https://www.linkedin.com/in/orlenchner/
asimov-brightdata-fetcher https://www.linkedin.com/company/bright-data/
Fetching Crunchbase Profiles
asimov-brightdata-fetcher https://www.crunchbase.com/organization/brightdata
Fetching Amazon Products
asimov-brightdata-fetcher https://www.amazon.com/Master-Algorithm-Ultimate-Learning-Machine/dp/0465094279
⚙ Configuration
Environment Variables
BRIGHTDATA_API_KEY: (required) the Bright Data API key to use
📚 Reference
Installed Binaries
asimov-brightdata-cataloger: discovers entities via the Bright Data API (not implemented yet)asimov-brightdata-fetcher: collects JSON data from the Bright Data APIasimov-brightdata-importer: collects and transforms JSON into JSON-LD (not implemented yet)
Supported Datasets
| Dataset | URL Prefix | JSON | RDF |
|---|---|---|---|
| Airbnb | https://www.airbnb.com/rooms/ |
✅ | 🚧 |
| Amazon | https://www.amazon.com/ |
✅ | 🚧 |
https://www.amazon.com/sp?seller= |
✅ | 🚧 | |
| Crunchbase | https://www.crunchbase.com/organization/ |
✅ | 🚧 |
| eBay | https://www.ebay.com/itm/ |
✅ | 🚧 |
https://www.facebook.com/events/ |
✅ | 🚧 | |
https://www.facebook.com/groups/ |
✅ | 🚧 | |
https://www.facebook.com/marketplace/item/ |
✅ | 🚧 | |
https://www.facebook.com/share/p/ |
✅ | 🚧 | |
https://www.google.com/shopping/product/ |
✅ | 🚧 | |
| Indeed | https://www.indeed.com/cmp/ |
✅ | 🚧 |
https://www.instagram.com/ |
✅ | 🚧 | |
https://www.instagram.com/p/ |
✅ | 🚧 | |
https://www.instagram.com/reel/ |
✅ | 🚧 | |
https://www.linkedin.com/company/ |
✅ | 🚧 | |
https://www.linkedin.com/in/ |
✅ | 🚧 | |
https://www.linkedin.com/jobs/ |
✅ | 🚧 | |
https://www.linkedin.com/posts/ |
✅ | 🚧 | |
https://www.linkedin.com/pulse/ |
✅ | 🚧 | |
| Walmart | https://www.walmart.com/global/seller/ |
✅ | 🚧 |
https://www.walmart.com/ip/ |
✅ | 🚧 | |
| X (Twitter) | https://x.com/ |
✅ | ✅ |
| Yahoo | https://finance.yahoo.com/quote/ |
✅ | 🚧 |
| YouTube | https://www.youtube.com/@ |
✅ | 🚧 |
https://www.youtube.com/watch?v= |
✅ | 🚧 | |
👨💻 Development
git clone https://github.com/asimov-modules/asimov-brightdata-module.git
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file asimov_brightdata_module-0.0.3-py3-none-win_amd64.whl.
File metadata
- Download URL: asimov_brightdata_module-0.0.3-py3-none-win_amd64.whl
- Upload date:
- Size: 2.6 MB
- Tags: Python 3, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f67ebb84b94cd1dcef0bbccfa1d7f4843048534cbb7f1de1bf4a8643a5065673
|
|
| MD5 |
aeac43603e7fc36c475fadc2ee326950
|
|
| BLAKE2b-256 |
183f32a79a69a914e865f334380aecc3b7c427409e2d23007911057ee4cda522
|
File details
Details for the file asimov_brightdata_module-0.0.3-py3-none-musllinux_1_1_x86_64.whl.
File metadata
- Download URL: asimov_brightdata_module-0.0.3-py3-none-musllinux_1_1_x86_64.whl
- Upload date:
- Size: 2.9 MB
- Tags: Python 3, musllinux: musl 1.1+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fd2e37907d3aa3033c50e9a51b447c8a2236c58ac3e7628b116ad1682245f058
|
|
| MD5 |
23e37a2829da3eabe3b406217db04cee
|
|
| BLAKE2b-256 |
7b1533138a97f6a4a89b5c893c52566c7384f161490ba16e492aaab3e629fd96
|
File details
Details for the file asimov_brightdata_module-0.0.3-py3-none-musllinux_1_1_aarch64.whl.
File metadata
- Download URL: asimov_brightdata_module-0.0.3-py3-none-musllinux_1_1_aarch64.whl
- Upload date:
- Size: 2.7 MB
- Tags: Python 3, musllinux: musl 1.1+ ARM64
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fc5c7fbb258e88f5518fd3d017bc5773daad670e2186cd1bfd8b0b914e3e6af3
|
|
| MD5 |
f3f467c2badd45260c5f8cde056347c3
|
|
| BLAKE2b-256 |
44d31f7a80a739da164dbcd2a1a870c4013135bac0a789c73803ba195a1bb6a0
|
File details
Details for the file asimov_brightdata_module-0.0.3-py3-none-manylinux_2_28_x86_64.whl.
File metadata
- Download URL: asimov_brightdata_module-0.0.3-py3-none-manylinux_2_28_x86_64.whl
- Upload date:
- Size: 2.9 MB
- Tags: Python 3, manylinux: glibc 2.28+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b34663ebd366e332debb87b0ad1a19dabb178cb0b2ec31e0118eb9f4862d7312
|
|
| MD5 |
950622644fb5ca6cac55cc2729980f0a
|
|
| BLAKE2b-256 |
d913c95038a46ff1ddd7145cb26c3dec2863de22d4668ef3bbbb307d3bbe464b
|
File details
Details for the file asimov_brightdata_module-0.0.3-py3-none-manylinux_2_28_aarch64.whl.
File metadata
- Download URL: asimov_brightdata_module-0.0.3-py3-none-manylinux_2_28_aarch64.whl
- Upload date:
- Size: 2.7 MB
- Tags: Python 3, manylinux: glibc 2.28+ ARM64
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2bb497cfe63b9093a6efdfebc657a50db56f648968fe8b7a2de75de1c578c5d5
|
|
| MD5 |
cbf9eec388b13d551eece7ac826c8417
|
|
| BLAKE2b-256 |
8d588a0d51e3df143ac9a7bc499be2d623b8b10b71a7eb3ef016a863deb5b3ad
|
File details
Details for the file asimov_brightdata_module-0.0.3-py3-none-macosx_12_0_x86_64.whl.
File metadata
- Download URL: asimov_brightdata_module-0.0.3-py3-none-macosx_12_0_x86_64.whl
- Upload date:
- Size: 2.8 MB
- Tags: Python 3, macOS 12.0+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a76df59242697d16275abc34355317845b877428663a6494de1b760141ac56ab
|
|
| MD5 |
679d70ee8bc053a43c6ced80d5699748
|
|
| BLAKE2b-256 |
1ffc92633d6e1ebd3240a32e90d9f2e7f322ced44fda0d824d5e83cf96aee753
|
File details
Details for the file asimov_brightdata_module-0.0.3-py3-none-macosx_12_0_arm64.whl.
File metadata
- Download URL: asimov_brightdata_module-0.0.3-py3-none-macosx_12_0_arm64.whl
- Upload date:
- Size: 2.7 MB
- Tags: Python 3, macOS 12.0+ ARM64
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f95871cbb75b2152b9b365149c2f5733740ca810e0c5077e151da1aef79f29f0
|
|
| MD5 |
2df5e106953e342e5c69325fa95b895f
|
|
| BLAKE2b-256 |
a32a9a4507d74c9aafd8b0702a9a0337721b2ac7ef44737a13589f9901d42a01
|