Skip to main content

Tools for parsing, extracting, reconciling, and unshortening urls

Project description

A newslynx-opinionated collection of utilities for dealing with urls.

Install

pip install newslynx-url

Test

requires nose

nosetests

Usage

This module contains various methods that are used throughout newslnyx-core. but the main functions are unshorten_url, is_article_url, and prepare_url:

from newslynx_url import (
  unshorten_url, is_article_url, prepare_url
)

print unshorten_url('bit.ly/1j3SrUC')
# http://towcenter.org/blog/tow-fellows-brian-abelson-and-michael-keller-to-study-the-impact-of-journalism/

print is_article_url(
  'http://towcenter.org/blog/tow-fellows-brian-abelson-and-michael-keller-to-study-the-impact-of-journalism'
  )
# True

print is_article_url(
  'http://towcenter.org/blog/tow-fellows-brian-abelson-and-michael-keller-to-study-the-impact-of-journalism',
  pattern = r'.*towcenter\.org/blog/.*'
)
# True

import re
pattern = re.compile(r'.*towcenter\.org/blog/.*')
print is_article_url(
  'http://towcenter.org/blog/tow-fellows-brian-abelson-and-michael-keller-to-study-the-impact-of-journalism',
  pattern = pattern
)
# True

print prepare_url(
  'http://towcenter.org/blog/tow-fellows-brian-abelson-and-michael-keller-to-study-the-impact-of-journalism/?q=lfjad&f=lkfdjsal'
  )
# http://towcenter.org/blog/tow-fellows-brian-abelson-and-michael-keller-to-study-the-impact-of-journalism

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

newslynx-url-0.1.0.tar.gz (7.8 kB view details)

Uploaded Source

Built Distribution

newslynx-url-0.1.0.macosx-10.9-intel.exe (78.8 kB view details)

Uploaded Source

File details

Details for the file newslynx-url-0.1.0.tar.gz.

File metadata

File hashes

Hashes for newslynx-url-0.1.0.tar.gz
Algorithm Hash digest
SHA256 8fb0c17840ba1f27adf44402f1fd0eadf5426ebee38054c1ee0295c9779cacbd
MD5 0b6049af8e74e172894f6dfc2c76214d
BLAKE2b-256 99c6b5d4ab7ea5e73b699acc23d7f6dc51c87bbd30770a971a92d35f551dc587

See more details on using hashes here.

File details

Details for the file newslynx-url-0.1.0.macosx-10.9-intel.exe.

File metadata

File hashes

Hashes for newslynx-url-0.1.0.macosx-10.9-intel.exe
Algorithm Hash digest
SHA256 10882293c4cc71b218605df61eb58113fb6a7b97a60c1dc8eacb8e5d7f459ccc
MD5 ed2cff1b8d2b5b0a4a532b37b9e65e7b
BLAKE2b-256 7bc8f6685a419260d27d94de5801e85f24e8c1b84046c1f740544969d3715d4a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page