Skip to main content

ASIMOV module for data import powered by the Bright Data web data platform.

Project description

ASIMOV Bright Data Module

License Package on Crates.io Package on PyPI Package on RubyGems Package on NPM

ASIMOV module for data import powered by the Bright Data web data platform.

✨ Features

  • Imports structured data from Airbnb, Amazon, Crunchbase, eBay, Facebook, Google, Indeed, Instagram, LinkedIn, Walmart, X (aka Twitter), Yahoo, and YouTube.
  • Collects the raw JSON data via the Bright Data API (requires an API key).
  • Constructs a semantic knowledge graph based on the KNOW ontology.
  • Supports plain JSON output as well as RDF output in the form of JSON-LD.
  • Distributed as a standalone static binary with zero runtime dependencies.

🛠️ Prerequisites

  • Rust 1.85+ (2024 edition) if building from source code

⬇️ Installation

Installation from PyPI

pip install -U asimov-brightdata-module

Installation from RubyGems

gem install asimov-brightdata-module

Installation from NPM

npm install -g asimov-brightdata-module

Installation from Source Code

cargo install asimov-brightdata-module

👉 Examples

export BRIGHTDATA_API_KEY="..."

Fetching X Profiles

asimov-brightdata-fetcher https://x.com/bright_init   # JSON
asimov-brightdata-importer https://x.com/bright_init  # JSON-LD

Fetching LinkedIn Profiles

asimov-brightdata-fetcher https://www.linkedin.com/in/orlenchner/
asimov-brightdata-fetcher https://www.linkedin.com/company/bright-data/

Fetching Crunchbase Profiles

asimov-brightdata-fetcher https://www.crunchbase.com/organization/brightdata

Fetching Amazon Products

asimov-brightdata-fetcher https://www.amazon.com/Master-Algorithm-Ultimate-Learning-Machine/dp/0465094279

⚙ Configuration

Environment Variables

📚 Reference

Installed Binaries

  • asimov-brightdata-cataloger: discovers entities via the Bright Data API (not implemented yet)
  • asimov-brightdata-fetcher: collects JSON data from the Bright Data API
  • asimov-brightdata-importer: collects and transforms JSON into JSON-LD (not implemented yet)

Supported Datasets

Dataset URL Prefix JSON RDF
Airbnb https://www.airbnb.com/rooms/ 🚧
Amazon https://www.amazon.com/ 🚧
  https://www.amazon.com/sp?seller= 🚧
Crunchbase https://www.crunchbase.com/organization/ 🚧
eBay https://www.ebay.com/itm/ 🚧
Facebook https://www.facebook.com/events/ 🚧
  https://www.facebook.com/groups/ 🚧
  https://www.facebook.com/marketplace/item/ 🚧
  https://www.facebook.com/share/p/ 🚧
Google https://www.google.com/shopping/product/ 🚧
Indeed https://www.indeed.com/cmp/ 🚧
Instagram https://www.instagram.com/ 🚧
  https://www.instagram.com/p/ 🚧
  https://www.instagram.com/reel/ 🚧
LinkedIn https://www.linkedin.com/company/ 🚧
  https://www.linkedin.com/in/ 🚧
  https://www.linkedin.com/jobs/ 🚧
  https://www.linkedin.com/posts/ 🚧
  https://www.linkedin.com/pulse/ 🚧
Walmart https://www.walmart.com/global/seller/ 🚧
  https://www.walmart.com/ip/ 🚧
X (Twitter) https://x.com/
Yahoo https://finance.yahoo.com/quote/ 🚧
YouTube https://www.youtube.com/@ 🚧
  https://www.youtube.com/watch?v= 🚧

👨‍💻 Development

git clone https://github.com/asimov-modules/asimov-brightdata-module.git

Share on X Share on Reddit Share on Hacker News Share on Facebook Share on LinkedIn

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

asimov_brightdata_module-0.0.3-py3-none-win_amd64.whl (2.6 MB view details)

Uploaded Python 3Windows x86-64

asimov_brightdata_module-0.0.3-py3-none-musllinux_1_1_x86_64.whl (2.9 MB view details)

Uploaded Python 3musllinux: musl 1.1+ x86-64

asimov_brightdata_module-0.0.3-py3-none-musllinux_1_1_aarch64.whl (2.7 MB view details)

Uploaded Python 3musllinux: musl 1.1+ ARM64

asimov_brightdata_module-0.0.3-py3-none-manylinux_2_28_x86_64.whl (2.9 MB view details)

Uploaded Python 3manylinux: glibc 2.28+ x86-64

asimov_brightdata_module-0.0.3-py3-none-manylinux_2_28_aarch64.whl (2.7 MB view details)

Uploaded Python 3manylinux: glibc 2.28+ ARM64

asimov_brightdata_module-0.0.3-py3-none-macosx_12_0_x86_64.whl (2.8 MB view details)

Uploaded Python 3macOS 12.0+ x86-64

asimov_brightdata_module-0.0.3-py3-none-macosx_12_0_arm64.whl (2.7 MB view details)

Uploaded Python 3macOS 12.0+ ARM64

File details

Details for the file asimov_brightdata_module-0.0.3-py3-none-win_amd64.whl.

File metadata

File hashes

Hashes for asimov_brightdata_module-0.0.3-py3-none-win_amd64.whl
Algorithm Hash digest
SHA256 f67ebb84b94cd1dcef0bbccfa1d7f4843048534cbb7f1de1bf4a8643a5065673
MD5 aeac43603e7fc36c475fadc2ee326950
BLAKE2b-256 183f32a79a69a914e865f334380aecc3b7c427409e2d23007911057ee4cda522

See more details on using hashes here.

File details

Details for the file asimov_brightdata_module-0.0.3-py3-none-musllinux_1_1_x86_64.whl.

File metadata

File hashes

Hashes for asimov_brightdata_module-0.0.3-py3-none-musllinux_1_1_x86_64.whl
Algorithm Hash digest
SHA256 fd2e37907d3aa3033c50e9a51b447c8a2236c58ac3e7628b116ad1682245f058
MD5 23e37a2829da3eabe3b406217db04cee
BLAKE2b-256 7b1533138a97f6a4a89b5c893c52566c7384f161490ba16e492aaab3e629fd96

See more details on using hashes here.

File details

Details for the file asimov_brightdata_module-0.0.3-py3-none-musllinux_1_1_aarch64.whl.

File metadata

File hashes

Hashes for asimov_brightdata_module-0.0.3-py3-none-musllinux_1_1_aarch64.whl
Algorithm Hash digest
SHA256 fc5c7fbb258e88f5518fd3d017bc5773daad670e2186cd1bfd8b0b914e3e6af3
MD5 f3f467c2badd45260c5f8cde056347c3
BLAKE2b-256 44d31f7a80a739da164dbcd2a1a870c4013135bac0a789c73803ba195a1bb6a0

See more details on using hashes here.

File details

Details for the file asimov_brightdata_module-0.0.3-py3-none-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for asimov_brightdata_module-0.0.3-py3-none-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 b34663ebd366e332debb87b0ad1a19dabb178cb0b2ec31e0118eb9f4862d7312
MD5 950622644fb5ca6cac55cc2729980f0a
BLAKE2b-256 d913c95038a46ff1ddd7145cb26c3dec2863de22d4668ef3bbbb307d3bbe464b

See more details on using hashes here.

File details

Details for the file asimov_brightdata_module-0.0.3-py3-none-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for asimov_brightdata_module-0.0.3-py3-none-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 2bb497cfe63b9093a6efdfebc657a50db56f648968fe8b7a2de75de1c578c5d5
MD5 cbf9eec388b13d551eece7ac826c8417
BLAKE2b-256 8d588a0d51e3df143ac9a7bc499be2d623b8b10b71a7eb3ef016a863deb5b3ad

See more details on using hashes here.

File details

Details for the file asimov_brightdata_module-0.0.3-py3-none-macosx_12_0_x86_64.whl.

File metadata

File hashes

Hashes for asimov_brightdata_module-0.0.3-py3-none-macosx_12_0_x86_64.whl
Algorithm Hash digest
SHA256 a76df59242697d16275abc34355317845b877428663a6494de1b760141ac56ab
MD5 679d70ee8bc053a43c6ced80d5699748
BLAKE2b-256 1ffc92633d6e1ebd3240a32e90d9f2e7f322ced44fda0d824d5e83cf96aee753

See more details on using hashes here.

File details

Details for the file asimov_brightdata_module-0.0.3-py3-none-macosx_12_0_arm64.whl.

File metadata

File hashes

Hashes for asimov_brightdata_module-0.0.3-py3-none-macosx_12_0_arm64.whl
Algorithm Hash digest
SHA256 f95871cbb75b2152b9b365149c2f5733740ca810e0c5077e151da1aef79f29f0
MD5 2df5e106953e342e5c69325fa95b895f
BLAKE2b-256 a32a9a4507d74c9aafd8b0702a9a0337721b2ac7ef44737a13589f9901d42a01

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page