Skip to main content

Quickly extract metadata from URLs

Project description

RapidUnfurl

RapidUnfurl is a Python library designed to pull and process metadata very quickly to unfurl URL contents into a JSON object that can the be used by other programs for portraying that data, similar to how link expansion works in apps like Slack.

This library was originally forked from Loftie Ellis' pyunfurl library, which is an awesome project. I just wanted to do some things to speed up the process, and drop away the html rendering, which I didn't need.

Features

Installation

Use the package manager pip to install pyunfurl.

pip install rapidunfurl

Usage

import rapidunfurl
rapidunfurl.unfurl('https://davintaddeo.com') 

This will return a dict similar to the oembed spec:

{
  "type": "website",
  "url": "https://davintaddeo.com",
  "title": "Davin Taddeo | DevOps Advocate",
  "site_name": "@tdarwin",
  "description": "Homepage of Davin Taddeo, DevOps Advocate, Senior Customer Architect for Chef",
  "image": "https://davintaddeo.com/assets/images/round_headshot.png",
  "card": "summary",
  "favicon": "https://davintaddeo.com/favicon.ico"
}

Contributing

Pull requests are welcome. RapidUnfurl supports some custom integrations for sites that doesnt return any meta tags, if you want to improve the integration for a specific site you can look at the hackernews example.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rapidunfurl-1.1.0.tar.gz (16.7 kB view details)

Uploaded Source

Built Distribution

rapidunfurl-1.1.0-py3-none-any.whl (16.2 kB view details)

Uploaded Python 3

File details

Details for the file rapidunfurl-1.1.0.tar.gz.

File metadata

  • Download URL: rapidunfurl-1.1.0.tar.gz
  • Upload date:
  • Size: 16.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.9.12

File hashes

Hashes for rapidunfurl-1.1.0.tar.gz
Algorithm Hash digest
SHA256 78f7dc4090a0a4dbb57ea81a58c91d9ddf8094ad3d0e19121249f65f892634bf
MD5 43e37fa41cf2c593fa244f240d4a8dab
BLAKE2b-256 d6a4004812b73f43b6230f66700de3db1aee08705235b666a66ab69ee83b94a2

See more details on using hashes here.

File details

Details for the file rapidunfurl-1.1.0-py3-none-any.whl.

File metadata

  • Download URL: rapidunfurl-1.1.0-py3-none-any.whl
  • Upload date:
  • Size: 16.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.9.12

File hashes

Hashes for rapidunfurl-1.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c2b39b4156763fd265592e7cbf20e41aa86da3be48a0d8dcb60625c1b738161d
MD5 47a07f2be0f6c351e9b59f92a2c61deb
BLAKE2b-256 61c28d6c4a94ab0d088d148ca151cc186fc3b1371db414ac24cda6f555c76130

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page