Skip to main content

Quickly extract metadata from URLs

Project description

RapidUnfurl

RapidUnfurl is a Python library designed to pull and process metadata very quickly to unfurl URL contents into a JSON object that can the be used by other programs for portraying that data, similar to how link expansion works in apps like Slack.

This library was originally forked from Loftie Ellis' pyunfurl library, which is an awesome project. I just wanted to do some things to speed up the process, and drop away the html rendering, which I didn't need.

Features

Installation

Use the package manager pip to install pyunfurl.

pip install rapidunfurl

Usage

import rapidunfurl
rapidunfurl.unfurl('https://davintaddeo.com') 

This will return a dict similar to the oembed spec:

{
  "type": "website",
  "url": "https://davintaddeo.com",
  "title": "Davin Taddeo | DevOps Advocate",
  "site_name": "@tdarwin",
  "description": "Homepage of Davin Taddeo, DevOps Advocate, Senior Customer Architect for Chef",
  "image": "https://davintaddeo.com/assets/images/round_headshot.png",
  "card": "summary",
  "favicon": "https://davintaddeo.com/favicon.ico"
}

Contributing

Pull requests are welcome. RapidUnfurl supports some custom integrations for sites that doesnt return any meta tags, if you want to improve the integration for a specific site you can look at the hackernews example.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rapidunfurl-0.1.1.tar.gz (16.4 kB view details)

Uploaded Source

Built Distribution

rapidunfurl-0.1.1-py3-none-any.whl (15.9 kB view details)

Uploaded Python 3

File details

Details for the file rapidunfurl-0.1.1.tar.gz.

File metadata

  • Download URL: rapidunfurl-0.1.1.tar.gz
  • Upload date:
  • Size: 16.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.10.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.10

File hashes

Hashes for rapidunfurl-0.1.1.tar.gz
Algorithm Hash digest
SHA256 cea35cbd39db7a78d8a3ee0dbfc35e8e5a55d6d14e7e464731e0d778a61023a5
MD5 e4a9e440d8262ebdb1925f935b0a61f8
BLAKE2b-256 27e4b3c204727e1a21290f0096dcd05edd43b1222a4e182e02f02eacabe489bb

See more details on using hashes here.

File details

Details for the file rapidunfurl-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: rapidunfurl-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 15.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.10.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.10

File hashes

Hashes for rapidunfurl-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 e993112c847c89aa191d41a995ff7ef224893cb5752b8a4f6cf901d4c5f0e3d8
MD5 e8dcc2854b0592d9a16a93e763131814
BLAKE2b-256 b1d4d1846507e7997f46af62093cba21f1bf0e14cc8a0b9ce89b2dd180816459

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page