Skip to main content

Powerful data structures for data analysis and statistics

Project description

pandas is a Python package providing fast, flexible, and expressive data structures designed to make working with “relational” or “labeled” data both easy and intuitive. It aims to be the fundamental high-level building block for doing practical, real world data analysis in Python. Additionally, it has the broader goal of becoming the most powerful and flexible open source data analysis / manipulation tool available in any language. It is already well on its way toward this goal.

pandas is well suited for many different kinds of data:

  • Tabular data with heterogeneously-typed columns, as in an SQL table or Excel spreadsheet

  • Ordered and unordered (not necessarily fixed-frequency) time series data.

  • Arbitrary matrix data (homogeneously typed or heterogeneous) with row and column labels

  • Any other form of observational / statistical data sets. The data actually need not be labeled at all to be placed into a pandas data structure

The two primary data structures of pandas, Series (1-dimensional) and DataFrame (2-dimensional), handle the vast majority of typical use cases in finance, statistics, social science, and many areas of engineering. For R users, DataFrame provides everything that R’s data.frame provides and much more. pandas is built on top of NumPy and is intended to integrate well within a scientific computing environment with many other 3rd party libraries.

Here are just a few of the things that pandas does well:

  • Easy handling of missing data (represented as NaN) in floating point as well as non-floating point data

  • Size mutability: columns can be inserted and deleted from DataFrame and higher dimensional objects

  • Automatic and explicit data alignment: objects can be explicitly aligned to a set of labels, or the user can simply ignore the labels and let Series, DataFrame, etc. automatically align the data for you in computations

  • Powerful, flexible group by functionality to perform split-apply-combine operations on data sets, for both aggregating and transforming data

  • Make it easy to convert ragged, differently-indexed data in other Python and NumPy data structures into DataFrame objects

  • Intelligent label-based slicing, fancy indexing, and subsetting of large data sets

  • Intuitive merging and joining data sets

  • Flexible reshaping and pivoting of data sets

  • Hierarchical labeling of axes (possible to have multiple labels per tick)

  • Robust IO tools for loading data from flat files (CSV and delimited), Excel files, databases, and saving / loading data from the ultrafast HDF5 format

  • Time series-specific functionality: date range generation and frequency conversion, moving window statistics, moving window linear regressions, date shifting and lagging, etc.

Many of these principles are here to address the shortcomings frequently experienced using other languages / scientific research environments. For data scientists, working with data is typically divided into multiple stages: munging and cleaning data, analyzing / modeling it, then organizing the results of the analysis into a form suitable for plotting or tabular display. pandas is the ideal tool for all of these tasks.

Note

Windows binaries built against NumPy 1.6.1

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pandas-0.6.1.tar.gz (1.5 MB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

pandas-0.6.1.win-amd64-py2.7.exe (886.0 kB view details)

Uploaded Source

pandas-0.6.1.win-amd64-py2.6.exe (885.4 kB view details)

Uploaded Source

pandas-0.6.1.win32-py3.2.exe (784.0 kB view details)

Uploaded Source

pandas-0.6.1.win32-py3.1.exe (783.9 kB view details)

Uploaded Source

pandas-0.6.1.win32-py2.7.exe (781.4 kB view details)

Uploaded Source

pandas-0.6.1.win32-py2.6.exe (778.1 kB view details)

Uploaded Source

pandas-0.6.1.win32-py2.5.exe (644.1 kB view details)

Uploaded Source

File details

Details for the file pandas-0.6.1.tar.gz.

File metadata

  • Download URL: pandas-0.6.1.tar.gz
  • Upload date:
  • Size: 1.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for pandas-0.6.1.tar.gz
Algorithm Hash digest
SHA256 14373abd1be39be6da27bb4b49130908378c97ab89611059541029ea688d4f74
MD5 32fedd9e89ee3d788f3b34eac9c6f2e7
BLAKE2b-256 dac1eaf25c22d31a1f2f8072ae34b60bc2362d7d0f8261485c134bc0c1698f52

See more details on using hashes here.

File details

Details for the file pandas-0.6.1.win-amd64-py2.7.exe.

File metadata

File hashes

Hashes for pandas-0.6.1.win-amd64-py2.7.exe
Algorithm Hash digest
SHA256 383a35454229a26ab9904964c8fbeda226f09901e4903dad284749d2fdc24bfa
MD5 0ec6681fdfa351b79ade4cab21948f5f
BLAKE2b-256 dafeb3a38b70b833f2e1c5fd98d490a441b56b51ed5d8f354e789435bf24eb8c

See more details on using hashes here.

File details

Details for the file pandas-0.6.1.win-amd64-py2.6.exe.

File metadata

File hashes

Hashes for pandas-0.6.1.win-amd64-py2.6.exe
Algorithm Hash digest
SHA256 4ca2d9a30578b3e82f7bed4580df048769bbf16ebf1eb3432bd5ea560a055815
MD5 34da1f1d4513781300f9bfea21ce8c63
BLAKE2b-256 c45a69bbda6801034003323cb7f8c45931d9ac921b349571c8e1bd9f7ec4baec

See more details on using hashes here.

File details

Details for the file pandas-0.6.1.win32-py3.2.exe.

File metadata

File hashes

Hashes for pandas-0.6.1.win32-py3.2.exe
Algorithm Hash digest
SHA256 d9e5ba02aa70c3340b528c330527ca34496eccf2d52301581c43977f8cb0f2a7
MD5 e7d84cbae6e8ae80d78e97ad49d90154
BLAKE2b-256 abb66d0538f9852e61d2905605251faac43c57d7f74485411bf54784cb3f1bf8

See more details on using hashes here.

File details

Details for the file pandas-0.6.1.win32-py3.1.exe.

File metadata

File hashes

Hashes for pandas-0.6.1.win32-py3.1.exe
Algorithm Hash digest
SHA256 3579b12474160c5b39957611ecb9c5330efec9292e059043b74829e2f1ecdff2
MD5 7bc80b6fa73a5b72c39d5858b70fba2e
BLAKE2b-256 131c1aa3f635176093cc8660f0dd162a9c7461a694d8a2da34a59462539e6ad9

See more details on using hashes here.

File details

Details for the file pandas-0.6.1.win32-py2.7.exe.

File metadata

File hashes

Hashes for pandas-0.6.1.win32-py2.7.exe
Algorithm Hash digest
SHA256 fb7a9e3ff75adec5b1918fd191efb965533e1eeffe274ebf57c8a6b53e9987ea
MD5 9933f6040411c7cc5150a5acc50e7d9e
BLAKE2b-256 487de2d7c0b6f8a9070082b0a3381ef3296b476cf1e8f0b507b9180dad26ef8b

See more details on using hashes here.

File details

Details for the file pandas-0.6.1.win32-py2.6.exe.

File metadata

File hashes

Hashes for pandas-0.6.1.win32-py2.6.exe
Algorithm Hash digest
SHA256 c612a6c267053b0288913efc24e294677ba693ef8099b6011a21b1894ed65a3b
MD5 a1abdcd8417742e843a174e0f970c6ef
BLAKE2b-256 199d20e31d2203af97c7784343dccaf0b6f05a428fc03165da22020a7ec33d4e

See more details on using hashes here.

File details

Details for the file pandas-0.6.1.win32-py2.5.exe.

File metadata

File hashes

Hashes for pandas-0.6.1.win32-py2.5.exe
Algorithm Hash digest
SHA256 ab81d0a0bbafa7e328d051922f1889ea0d87c7667e14611a23f1af7bb5feae90
MD5 cc52987544e3e08742e513393fa7b08f
BLAKE2b-256 56d6cd7e93416e8961ef4d830448adf79cfff3033400c3c89eb93edfe6b2d66c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page