Extract, clean, transform, hyphenate and metadata for ISBNs (International Standard Book Number).
Project description
Info
isbntools provides several useful methods and functions to validate, clean, transform, hyphenate and get metadata for ISBN strings.
Typical usage (as library):
#!/usr/bin/env python
import isbntools
...
Several scripts are provided to use from the command line:
$ to_isbn10 ISBN13
transforms an ISBN10 number to ISBN13.
$ to_isbn13 ISBN10
transforms an ISBN13 number to ISBN10.
$ isbn_info ISBN
gives you the group identifier of the ISBN.
$ isbn_mask ISBN
masks (hyphenate) an ISBN (split it by identifiers).
$ isbn_meta ISBN
gives you the main metadata associated with the ISBN (uses worldcat.org), to use the Google Books service enter isbn_meta ISBN goob.
$ isbn_editions ISBN
gives you the ISBNs of other editions of the given ISBN (uses worldcat.org) [collection of ISBNs that represent a given book].
$ isbn_validate ISBN
validates ISBN10 and ISBN13.
$ ... | isbn_stdin_validate
to use with posix pipes (e.g. cat FILE_WITH_ISBNs | isbn_stdin_validate).
$ isbn_from_words "words from title and author name"
a fuzzy script that returns the most probable ISBN from a set of words! (You can verify the result with isbn_meta)!
$ isbntools
writes version and copyright notice.
Many more scripts could be written with the isbntools library, using the methods for extraction, cleaning, validation and standardization of ISBNs.
Just for fun, suppose I want the most spoken about book with certain words in his title. For a quick-and-dirty solution, enter the following code in a file and save it as isbn_tmsa_book.py.
#!/usr/bin/env python
import sys
import urllib2
from isbntools import *
query = sys.argv[1].replace(' ', '+')
SEARCH_URL = "http://www.google.com/search?q=%s+ISBN" % query
headers = {'User-Agent': 'w3m/0.5.2'}
request = urllib2.Request(SEARCH_URL, headers=headers)
response = urllib2.urlopen(request)
content = response.read()
isbns = get_isbnlike(content)
for item in isbns:
ib = get_canonical_isbn(item, output='ISBN-13')
if ib: break
print("The ISBN of the most `spoken-about` book with this title is %s" % ib)
print("")
print("... and the book is:")
print("")
print((meta(ib)))
Then in a command line (in the same directory):
$ python isbn_tmsa_book.py 'noise'
In my case I get:
The ISBN of the most `spoken-about` book with this title is 9780143105985 ... and the book is: {'Publisher': u'Penguin Books', 'Language': u'eng', 'Title': u'White noise', 'Year': u'2009', 'ISBN-13': '9780143105985', 'Authors': u'Don DeLillo ; introduction by Richard Powers.'}
Have fun!
Install
From the command line enter (in some cases you have to preced the command by sudo):
$ pip install isbntools
or:
$ easy_install isbntools
or:
$ pip install isbntools-1.0.3.tar.gz
(first you have to download the file!)
For Devs
If you need high quality metadata in your app, the only solution is to use polling of several providers (soon I will provide a framework ``databee`` to do this fast and easy:)).
Known Issues
The meta method and the isbn_meta script sometimes give a wrong result (this is due to errors on the worldcat.org service), in alternative you could use the Google Books service (e.g. isbn_meta 9780143105985 goob).
The isbntools works internally with unicode, however this doesn’t solve errors of lost information due to bad encode/decode at the origin!
ISBN
To know about ISBN:
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.