Unified abstraction for handling xls, xlsx and CSV files in Python
Project description
# TableReader
TableReader is an unified abstraction for handling xls, xlsx and CSV files in Python. It also reads csv tables as unicode on request.
If you are familiar with csv.DictReader from the standard library, think of TableReader as DictReader on steroids.
from tablereader import TableReader
reader = TableReader("some_input.xlsx", sheet="The Data")
for row in reader:
print row['valuecolumn']
In case you have csv and need unicode support:
reader = TableReader("unicode_input.csv", force_type="unicodecsv")
for row in reader:
print row['valuecolumn']
If you want to strip out leading and trailing spaces while reading rows, you can:
reader = TableReader("input_with_whitespaces.csv", strip_whitespaces=True)
for row in reader:
print row['valuecolumn']
And if for some reason you have a file with wrong type by line-ending (commonly found when sharing xls), override auto-detection:
reader = TableReader("wrong_named.xls", force_type="xlsx")
for row in reader:
print row['valuecolumn']
Sometimes headers are not in the first line. So specify some header row search text and the entire row will be used as column name and all rows after are returned:
from tablereader import OffsetTableReader
reader = OffsetTableReader("wrong_named.xls", "BEGIN_DATA")
for row in reader:
print row['valuecolumn']
The library has been tested on CPython 2.6, 2.7 and 3.4 as well as PyPy 2.4.1.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.