Skip to main content

Extracting the content from the webpage

Project description

content-fetch

This package helps you parse the text from the webpage. The input to the function can be a path to an HTML file on your disk or a URL to a web page.

You can install the package by running the following command

pip install contentfetch

Please refer to the code snippet for parsing the text from the HTML file

import contentfetch

results_json = contentfetch.extract_content(html=<html_file>)

results_json

### OR

from contentfetch import extract_content

results_json = extract_content(html=<html_file>)

results_json

Please refer to the code snippet for parsing the text from the webpage through URL

import contentfetch

results_json = contentfetch.extract_content(url=<webpage_url>)

results_json

### OR

from contentfetch import extract_content

results_json = extract_content(url=<webpage_url>)

results_json

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

contentfetch-0.0.5.tar.gz (12.1 kB view details)

Uploaded Source

Built Distribution

contentfetch-0.0.5-py3-none-any.whl (20.1 kB view details)

Uploaded Python 3

File details

Details for the file contentfetch-0.0.5.tar.gz.

File metadata

  • Download URL: contentfetch-0.0.5.tar.gz
  • Upload date:
  • Size: 12.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.3 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/0.10.1 urllib3/1.26.12 tqdm/4.64.1 importlib-metadata/4.8.3 keyring/23.4.1 rfc3986/1.5.0 colorama/0.4.4 CPython/3.6.9

File hashes

Hashes for contentfetch-0.0.5.tar.gz
Algorithm Hash digest
SHA256 d4f17fdfebf2afa4715fa6b56be11c8903a6839c006dde156b3b801042425952
MD5 9ad3d86e8d1d96ce02d767641bee0e36
BLAKE2b-256 741e668f3e7b887cb971e5cbebde5510cbdb8a46a8b866f04887b434dcce99fd

See more details on using hashes here.

File details

Details for the file contentfetch-0.0.5-py3-none-any.whl.

File metadata

  • Download URL: contentfetch-0.0.5-py3-none-any.whl
  • Upload date:
  • Size: 20.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.3 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/0.10.1 urllib3/1.26.12 tqdm/4.64.1 importlib-metadata/4.8.3 keyring/23.4.1 rfc3986/1.5.0 colorama/0.4.4 CPython/3.6.9

File hashes

Hashes for contentfetch-0.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 64226e8e0d58ac8c57963322179a493bc3e6840673a8f7cb2745b5df877b82bf
MD5 8552c8104d84f40b23a822ee51479ac2
BLAKE2b-256 eecb3407c3d67e79c307517328b46e8754fcef403bf456146a5eeab88697a75b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page