Lassie is a Python library for retrieving basic content from websites
Project description
Lassie is a Python library for retrieving basic content from websites.
Usage
>>> import lassie
>>> lassie.fetch('http://www.youtube.com/watch?v=dQw4w9WgXcQ')
{
'description': u'Music video by Rick Astley performing Never Gonna Give You Up. YouTube view counts pre-VEVO: 2,573,462 (C) 1987 PWL',
'videos': [{
'src': u'http://www.youtube.com/v/dQw4w9WgXcQ?autohide=1&version=3',
'height': 480,
'type': u'application/x-shockwave-flash',
'width': 640
}, {
'src': u'https://www.youtube.com/embed/dQw4w9WgXcQ',
'height': 480,
'width': 640
}],
'title': u'Rick Astley - Never Gonna Give You Up',
'url': u'http://www.youtube.com/watch?v=dQw4w9WgXcQ',
'keywords': [u'Rick', u'Astley', u'Sony', u'BMG', u'Music', u'UK', u'Pop'],
'images': [{
'src': u'http://i1.ytimg.com/vi/dQw4w9WgXcQ/hqdefault.jpg?feature=og',
'type': u'og:image'
}, {
'src': u'http://i1.ytimg.com/vi/dQw4w9WgXcQ/hqdefault.jpg',
'type': u'twitter:image'
}, {
'src': u'http://s.ytimg.com/yts/img/favicon-vfldLzJxy.ico',
'type': u'favicon'
}, {
'src': u'http://s.ytimg.com/yts/img/favicon_32-vflWoMFGx.png',
'type': u'favicon'
}],
'locale': u'en_US'
}
Install
Install Lassie via pip
$ pip install lassie
or, with easy_install
$ easy_install lassie
But, hey… that’s up to you.
Documentation
Documentation can be found here: https://lassie.readthedocs.org/
History
0.5.2 (2015-04-16)
Updated requests and beautifulsoup4 library versions
0.5.1 (2014-08-05)
Fix issue where headers didn’t always have ‘Content-Type’ key
0.5.0 (2014-06-23)
Added ability to fetch links that are image files (jpg, gif, png, bmp)
Renamed _retreive_content to _retrieve_content because I evidently don’t know how to spell correctly
0.4.0 (2013-09-30)
Updated requests and beautifulsoup4 library versions
Added support for manipulating the request, see Advanced Usage docs
Fixed issue where lassie.fetch would break if the page had no title
Lassie is now more lenient when it comes to width and height values of images (now accepts integers (100) or integer with px (100px)
Image URLs for all images are now absolute
0.3.0 (2013-08-15)
Added support for locale to be returned. If lang is specified in the html tag and it normalizes to an actual locale, it will be added to the returned data.
Fixed bug where height was not being returned for body images
Added test coverage, we’re 100% covered! :D
0.2.1 (2013-08-13)
Remove spaces from the returned keywords list
Fixed issue where favicon was not being retrieved
Fixed priority for class level vs method level params
0.2.0 (2013-08-06)
Fix package error when importing
0.1.0 (2013-08-05)
Initial Release
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.