Loads data from various formats
Project description
==============
data-dispenser
==============
.. image:: https://travis-ci.org/catherinedevlin/data-dispenser.svg?branch=master
:alt: Travis build status
Given a source of rowlike data, acts as a generator of OrderedDicts.
Usage::
src = Source('mydata.csv')
for row in src:
print(row)
data-dispenser thus serves as a single API for a variety of data sources.
* Free software: MIT license
Data source types supported
...........................
* file names / paths
* open file objects
* pymongo Collection objects
* strings interpretable as data
* URLs beginning with http:// or https://
Will work most reliably against filenames with extensions that indicate
the data format; otherwise data-dispenser may guess the input format wrong.
Data input formats supported
............................
* csv
* yaml (requires ``pyyaml``)
* json
* pickle
* ``eval``-able Python
* xls
* xml (experimental)
* HTML with ``<table>``s
Multiple files
..............
File paths with wildcards will be
effectively concatenated into one large data source.
Load limits
...........
Large data sources could overwhelm your system's memory. Passing a ``limit``
keyword to the ``Source`` constructor limits the rows returned from each
source. For file paths with wildcards, the limit applies to each file
source, not to the number of file sources.
Code
----
https://pypi.python.org/pypi/data_dispenser
Source and bug tracker
----------------------
https://github.com/catherinedevlin/data-dispenser
History
-------
0.1.0 (2014-05-21)
++++++++++++++++++
* First release on PyPI.
0.1.1 (2014-05-23)
++++++++++++++++++
* Fixed bugs in handling non-listlike YAML files
0.2.0 (2014-07-14)
++++++++++++++++++
* Support .xls
* Support URLs
* Support wildcards
0.2.1 (2014-27-14)
++++++++++++++++++
* Support .html
0.2.2 (2014-27-14)
++++++++++++++++++
* Fix bug with .html tables with <thead>
0.2.3 (2014-09-14)
++++++++++++++++++
* Fix for URLs with no file extension
0.2.4 (2014-09-14)
++++++++++++++++++
* Better error messages on parse failure
data-dispenser
==============
.. image:: https://travis-ci.org/catherinedevlin/data-dispenser.svg?branch=master
:alt: Travis build status
Given a source of rowlike data, acts as a generator of OrderedDicts.
Usage::
src = Source('mydata.csv')
for row in src:
print(row)
data-dispenser thus serves as a single API for a variety of data sources.
* Free software: MIT license
Data source types supported
...........................
* file names / paths
* open file objects
* pymongo Collection objects
* strings interpretable as data
* URLs beginning with http:// or https://
Will work most reliably against filenames with extensions that indicate
the data format; otherwise data-dispenser may guess the input format wrong.
Data input formats supported
............................
* csv
* yaml (requires ``pyyaml``)
* json
* pickle
* ``eval``-able Python
* xls
* xml (experimental)
* HTML with ``<table>``s
Multiple files
..............
File paths with wildcards will be
effectively concatenated into one large data source.
Load limits
...........
Large data sources could overwhelm your system's memory. Passing a ``limit``
keyword to the ``Source`` constructor limits the rows returned from each
source. For file paths with wildcards, the limit applies to each file
source, not to the number of file sources.
Code
----
https://pypi.python.org/pypi/data_dispenser
Source and bug tracker
----------------------
https://github.com/catherinedevlin/data-dispenser
History
-------
0.1.0 (2014-05-21)
++++++++++++++++++
* First release on PyPI.
0.1.1 (2014-05-23)
++++++++++++++++++
* Fixed bugs in handling non-listlike YAML files
0.2.0 (2014-07-14)
++++++++++++++++++
* Support .xls
* Support URLs
* Support wildcards
0.2.1 (2014-27-14)
++++++++++++++++++
* Support .html
0.2.2 (2014-27-14)
++++++++++++++++++
* Fix bug with .html tables with <thead>
0.2.3 (2014-09-14)
++++++++++++++++++
* Fix for URLs with no file extension
0.2.4 (2014-09-14)
++++++++++++++++++
* Better error messages on parse failure
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
data_dispenser-0.2.4.tar.gz
(9.9 kB
view hashes)
Built Distribution
data_dispenser-0.2.4-py3.4.egg
(16.5 kB
view hashes)