Skip to main content

Powerful data structures for data analysis, time series,and statistics

Project description

pandas is a Python package providing fast, flexible, and expressive data structures designed to make working with structured (tabular, multidimensional, potentially heterogeneous) and time series data both easy and intuitive. It aims to be the fundamental high-level building block for doing practical, real world data analysis in Python. Additionally, it has the broader goal of becoming the most powerful and flexible open source data analysis / manipulation tool available in any language. It is already well on its way toward this goal.

pandas is well suited for many different kinds of data:

  • Tabular data with heterogeneously-typed columns, as in an SQL table or Excel spreadsheet

  • Ordered and unordered (not necessarily fixed-frequency) time series data.

  • Arbitrary matrix data (homogeneously typed or heterogeneous) with row and column labels

  • Any other form of observational / statistical data sets. The data actually need not be labeled at all to be placed into a pandas data structure

The two primary data structures of pandas, Series (1-dimensional) and DataFrame (2-dimensional), handle the vast majority of typical use cases in finance, statistics, social science, and many areas of engineering. For R users, DataFrame provides everything that R’s data.frame provides and much more. pandas is built on top of NumPy and is intended to integrate well within a scientific computing environment with many other 3rd party libraries.

Here are just a few of the things that pandas does well:

  • Easy handling of missing data (represented as NaN) in floating point as well as non-floating point data

  • Size mutability: columns can be inserted and deleted from DataFrame and higher dimensional objects

  • Automatic and explicit data alignment: objects can be explicitly aligned to a set of labels, or the user can simply ignore the labels and let Series, DataFrame, etc. automatically align the data for you in computations

  • Powerful, flexible group by functionality to perform split-apply-combine operations on data sets, for both aggregating and transforming data

  • Make it easy to convert ragged, differently-indexed data in other Python and NumPy data structures into DataFrame objects

  • Intelligent label-based slicing, fancy indexing, and subsetting of large data sets

  • Intuitive merging and joining data sets

  • Flexible reshaping and pivoting of data sets

  • Hierarchical labeling of axes (possible to have multiple labels per tick)

  • Robust IO tools for loading data from flat files (CSV and delimited), Excel files, databases, and saving / loading data from the ultrafast HDF5 format

  • Time series-specific functionality: date range generation and frequency conversion, moving window statistics, moving window linear regressions, date shifting and lagging, etc.

Many of these principles are here to address the shortcomings frequently experienced using other languages / scientific research environments. For data scientists, working with data is typically divided into multiple stages: munging and cleaning data, analyzing / modeling it, then organizing the results of the analysis into a form suitable for plotting or tabular display. pandas is the ideal tool for all of these tasks.

Note

Windows binaries built against NumPy 1.7.1

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pandas-0.13.0.tar.gz (3.6 MB view details)

Uploaded Source

Built Distributions

pandas-0.13.0.win-amd64-py3.3.exe (2.9 MB view details)

Uploaded Source

pandas-0.13.0.win-amd64-py3.2.exe (2.9 MB view details)

Uploaded Source

pandas-0.13.0.win-amd64-py2.7.exe (2.9 MB view details)

Uploaded Source

pandas-0.13.0.win-amd64-py2.6.exe (2.9 MB view details)

Uploaded Source

pandas-0.13.0.win32-py3.3.exe (2.7 MB view details)

Uploaded Source

pandas-0.13.0.win32-py3.2.exe (2.7 MB view details)

Uploaded Source

pandas-0.13.0.win32-py2.7.exe (2.8 MB view details)

Uploaded Source

pandas-0.13.0.win32-py2.6.exe (2.8 MB view details)

Uploaded Source

File details

Details for the file pandas-0.13.0.tar.gz.

File metadata

  • Download URL: pandas-0.13.0.tar.gz
  • Upload date:
  • Size: 3.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for pandas-0.13.0.tar.gz
Algorithm Hash digest
SHA256 5bb3f7a5f24fe8280e84e5407dc129364e7825cbe366b79676bd5f37a9b2bf0e
MD5 92e285bd29f37254017c7613a5e0bc90
BLAKE2b-256 588532e945b75108cf159f73446360a5a86aaaaf7efe46d999a9fb1c8b0b5bf6

See more details on using hashes here.

File details

Details for the file pandas-0.13.0.win-amd64-py3.3.exe.

File metadata

File hashes

Hashes for pandas-0.13.0.win-amd64-py3.3.exe
Algorithm Hash digest
SHA256 d0a4ca1e0ff7ca2aedb925feef62ff04ae70988523e25f1e1db28c2422293a3b
MD5 504d555112a54c2b1b93f7931fa9fcc1
BLAKE2b-256 c6f7abbd814dc7c628209a8e897cccca9b22e41eb115d3bf14eb156430dff0a5

See more details on using hashes here.

File details

Details for the file pandas-0.13.0.win-amd64-py3.2.exe.

File metadata

File hashes

Hashes for pandas-0.13.0.win-amd64-py3.2.exe
Algorithm Hash digest
SHA256 061cd989d4260b912289c6ba20c21b8ad621c378feb9bd8964694ca05ea386a2
MD5 d598d487793b943fd227ca6341397666
BLAKE2b-256 9458618db4741aeca8c4ca06dd8d0215e7cd7c4e7ba840d1f99e6de8cb4e3c9b

See more details on using hashes here.

File details

Details for the file pandas-0.13.0.win-amd64-py2.7.exe.

File metadata

File hashes

Hashes for pandas-0.13.0.win-amd64-py2.7.exe
Algorithm Hash digest
SHA256 d8bb0cb043cd169505e2f3b6f557672a12fbcae70992a7a8e9920043c7ac7b66
MD5 df9b5b2dcf8a4b9eeb567d3c154ee7d7
BLAKE2b-256 88470a57af6ea179c2934d43a72d87062838bebf2682abd4a8c789aa450f6015

See more details on using hashes here.

File details

Details for the file pandas-0.13.0.win-amd64-py2.6.exe.

File metadata

File hashes

Hashes for pandas-0.13.0.win-amd64-py2.6.exe
Algorithm Hash digest
SHA256 c130aa0443b435edeada5c97b969def2119e555b95f9746b8b4252aeff49f6df
MD5 2461f4567ea6dc92f3ff9db517e05bc7
BLAKE2b-256 4e65dc79865ff9997ac524d5b60cc6e250c166bd7bf6d86d2f96b8214f97eaa2

See more details on using hashes here.

File details

Details for the file pandas-0.13.0.win32-py3.3.exe.

File metadata

File hashes

Hashes for pandas-0.13.0.win32-py3.3.exe
Algorithm Hash digest
SHA256 8e5f17f7bd414224be0addc447585d4f919313f200cdeaf0dea8e81227d71b2e
MD5 f42e26614e23aff4876d92f6439804df
BLAKE2b-256 6a7dd2c00c26840ed9d480591386211a5e9a148ec4ab0f94139a5b74c8b68fb8

See more details on using hashes here.

File details

Details for the file pandas-0.13.0.win32-py3.2.exe.

File metadata

File hashes

Hashes for pandas-0.13.0.win32-py3.2.exe
Algorithm Hash digest
SHA256 3b769a201542321016b5a414917c607e6b4aafc992eebd3bdf3ee250179ee629
MD5 34ac65ba9d9907b1370a92d38ebe6f04
BLAKE2b-256 0e7758f84519cdb463355f7c698193859a3a0d148bb43ceec41dd2726ea58b30

See more details on using hashes here.

File details

Details for the file pandas-0.13.0.win32-py2.7.exe.

File metadata

File hashes

Hashes for pandas-0.13.0.win32-py2.7.exe
Algorithm Hash digest
SHA256 fde4275111606926edca75e2e5cdceb2fee149238480e7cf55d51982ace03322
MD5 cfb1c04906d2d3a83459f1f31ba45f72
BLAKE2b-256 fed6915e6c71ab2c3b9333b5e3e77af870fe8a3fb92e9f862bdbee9bb171a39b

See more details on using hashes here.

File details

Details for the file pandas-0.13.0.win32-py2.6.exe.

File metadata

File hashes

Hashes for pandas-0.13.0.win32-py2.6.exe
Algorithm Hash digest
SHA256 ecc7ffde07b629477e32b16d99c0e29eea99e3d7c5f1a330c43cbce7f46e13fd
MD5 7922364e20f1bd05e6eb11cb9ce22654
BLAKE2b-256 73f279c4ab219327462a2a9af6068f0ad8c4c7abff66dddbf2ec3dedb3103a48

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page