Skip to main content

Lassie is a Python library for retrieving basic content from websites

Project description

https://badge.fury.io/py/lassie.png https://travis-ci.org/michaelhelmick/lassie.png?branch=master https://pypip.in/d/lassie/badge.png https://coveralls.io/repos/michaelhelmick/lassie/badge.png?branch=master

Lassie is a Python library for retrieving basic content from websites.

https://i.imgur.com/QrvNfAX.gif

Usage

>>> import lassie
>>> lassie.fetch('http://www.youtube.com/watch?v=dQw4w9WgXcQ')
{
    'description': u'Music video by Rick Astley performing Never Gonna Give You Up. YouTube view counts pre-VEVO: 2,573,462 (C) 1987 PWL',
    'videos': [{
        'src': u'http://www.youtube.com/v/dQw4w9WgXcQ?autohide=1&version=3',
        'height': 480,
        'type': u'application/x-shockwave-flash',
        'width': 640
    }, {
        'src': u'https://www.youtube.com/embed/dQw4w9WgXcQ',
        'height': 480,
        'width': 640
    }],
    'title': u'Rick Astley - Never Gonna Give You Up',
    'url': u'http://www.youtube.com/watch?v=dQw4w9WgXcQ',
    'keywords': [u'Rick', u'Astley', u'Sony', u'BMG', u'Music', u'UK', u'Pop'],
    'images': [{
        'src': u'http://i1.ytimg.com/vi/dQw4w9WgXcQ/hqdefault.jpg?feature=og',
        'type': u'og:image'
    }, {
        'src': u'http://i1.ytimg.com/vi/dQw4w9WgXcQ/hqdefault.jpg',
        'type': u'twitter:image'
    }, {
        'src': u'http://s.ytimg.com/yts/img/favicon-vfldLzJxy.ico',
        'type': u'favicon'
    }, {
        'src': u'http://s.ytimg.com/yts/img/favicon_32-vflWoMFGx.png',
        'type': u'favicon'
    }],
    'locale': u'en_US'
}

Install

Install Lassie via pip

$ pip install lassie

or, with easy_install

$ easy_install lassie

But, hey… that’s up to you.

Documentation

Documentation can be found here: https://lassie.readthedocs.org/

History

0.5.1 (2014-08-05)

  • Fix issue where headers didn’t always have ‘Content-Type’ key

0.5.0 (2014-06-23)

  • Added ability to fetch links that are image files (jpg, gif, png, bmp)

  • Renamed _retreive_content to _retrieve_content because I evidently don’t know how to spell correctly

0.4.0 (2013-09-30)

  • Updated requests and beautifulsoup4 library versions

  • Added support for manipulating the request, see Advanced Usage docs

  • Fixed issue where lassie.fetch would break if the page had no title

  • Lassie is now more lenient when it comes to width and height values of images (now accepts integers (100) or integer with px (100px)

  • Image URLs for all images are now absolute

0.3.0 (2013-08-15)

  • Added support for locale to be returned. If lang is specified in the html tag and it normalizes to an actual locale, it will be added to the returned data.

  • Fixed bug where height was not being returned for body images

  • Added test coverage, we’re 100% covered! :D

0.2.1 (2013-08-13)

  • Remove spaces from the returned keywords list

  • Fixed issue where favicon was not being retrieved

  • Fixed priority for class level vs method level params

0.2.0 (2013-08-06)

  • Fix package error when importing

0.1.0 (2013-08-05)

  • Initial Release

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lassie-0.5.1.tar.gz (9.5 kB view details)

Uploaded Source

File details

Details for the file lassie-0.5.1.tar.gz.

File metadata

  • Download URL: lassie-0.5.1.tar.gz
  • Upload date:
  • Size: 9.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for lassie-0.5.1.tar.gz
Algorithm Hash digest
SHA256 f46128ec6496d0d8f73bf67a225ac7836108a31b44768e3c74592b788dd53794
MD5 08a590d8f79c1f3380f6476c6afec634
BLAKE2b-256 f5a82fa5459922b20e4db8ad00f5cf2b01d6ff8517ba96fabd45b39b64802ea7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page