Skip to main content

Read DBF Files with Python

Project description

DBF is a file format used by databases such dBase, Visual FoxPro, and FoxBase+. This library reads DBF files and returns the data as native Python data types for further processing. It is primarily intended for batch jobs and one-off scripts.

>>> from dbfread import DBF
>>> for record in DBF('people.dbf'):
...     print(record)
OrderedDict([('NAME', 'Alice'), ('BIRTHDATE', datetime.date(1987, 3, 1))])
OrderedDict([('NAME', 'Bob'), ('BIRTHDATE', datetime.date(1980, 11, 12))])

By default records are streamed directly from the file. If you have enough memory you can instead load them into a list. This allows for random access:

>>> table = DBF('people.dbf', load=True)
>>> print(table.records[1]['NAME'])
Bob
>>> print(table.records[0]['NAME'])
Alice

Full documentation at https://dbfread.readthedocs.io/

See docs/changes.rst for a full list of changes in each version.

Main Features

  • written for Python 3, but also works in 2.7

  • simple but flexible API

  • data is returned as native Python data types

  • records are ordered dictionaries, but can be reconfigured to be of any type

  • aims to handle all variants of DBF files. (Currently only widely tested with Visual FoxPro, but should work well with other variants.)

  • support for 18 field types. Custom types can be added by subclassing FieldParser

  • reads FPT and DBT memo files, both text and binary data

  • handles mixed case file names gracefully on case sensitive file systems

  • can retrieve deleted records

Installing

Requires Python 3.2 or 2.7.

pip install dbfread

dbfread is a pure Python module and doesn’t depend on any packages outside the standard library.

To build documentation locally:

python setup.py docs

This requires Sphinx. The resulting files can be found in docs/_build/.

Source code

http://github.com/olemb/dbfread/

API Changes

dbfread.open() and dbfread.read() are deprecated as of version 2.0, and will be removed in 2.2.

The DBF class is no longer a subclass of list. This makes the API a lot cleaner and easier to understand, but old code that relied on this behaviour will be broken. Iteration and record counting works the same as before. Other list operations can be rewritten using the record attribute. For example:

table = dbfread.read('people.dbf')
print(table[1])

can be rewritten as:

table = DBF('people.dbf', load=True)
print(table.records[1])

open() and read() both return DeprecatedDBF, which is a subclass of DBF and list and thus backward compatible.

License

dbfread is released under the terms of the MIT license.

Contact

Ole Martin Bjorndalen - ombdalen@gmail.com

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

td_dbfread-2.0.8.tar.gz (34.6 kB view details)

Uploaded Source

File details

Details for the file td_dbfread-2.0.8.tar.gz.

File metadata

  • Download URL: td_dbfread-2.0.8.tar.gz
  • Upload date:
  • Size: 34.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: Python-urllib/2.7

File hashes

Hashes for td_dbfread-2.0.8.tar.gz
Algorithm Hash digest
SHA256 d1e15fb77108e25e4172afc344590a1afffbb1b709c6ff56ca4047c09756d825
MD5 d524a511ff784edc98e411c7d5ec5f1c
BLAKE2b-256 d524d20665ed12f79553ceea3d73c7cebf01f991917ae602b42a38a7b4c42a35

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page