Loads data from various formats
Project description
Given a source of rowlike data, acts as a generator of OrderedDicts.
Usage:
src = Source('mydata.csv') for row in src: print(row)
data-dispenser thus serves as a single API for a variety of data sources.
Free software: MIT license
Data source types supported
file names / paths
open file objects
pymongo Collection objects
strings interpretable as data
Will work most reliably against filenames with extensions that indicate the data format; otherwise data-dispenser may guess the input format wrong.
Data input formats supported
csv
yaml (requires pyyaml)
json
pickle
eval-able Python
xls
xml (experimental)
Multiple files
File paths with wildcards will be effectively concatenated into one large data source.
Load limits
Large data sources could overwhelm your system’s memory. Passing a limit keyword to the Source constructor limits the rows returned from each source. For file paths with wildcards, the limit applies to each file source, not to the number of file sources.
Code
Source and bug tracker
History
0.1.0 (2014-05-21)
First release on PyPI.
0.1.1 (2014-05-23)
Fixed bugs in handling non-listlike YAML files
0.2.0 (2014-07-14)
Support .xls
Support URLs
Support wildcards
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.