Skip to main content

Powerful data structures for data analysis and statistics

Project description

pandas is a Python package providing fast, flexible, and expressive data structures designed to make working with “relational” or “labeled” data both easy and intuitive. It aims to be the fundamental high-level building block for doing practical, real world data analysis in Python. Additionally, it has the broader goal of becoming the most powerful and flexible open source data analysis / manipulation tool available in any language. It is already well on its way toward this goal.

pandas is well suited for many different kinds of data:

  • Tabular data with heterogeneously-typed columns, as in an SQL table or Excel spreadsheet

  • Ordered and unordered (not necessarily fixed-frequency) time series data.

  • Arbitrary matrix data (homogeneously typed or heterogeneous) with row and column labels

  • Any other form of observational / statistical data sets. The data actually need not be labeled at all to be placed into a pandas data structure

The two primary data structures of pandas, Series (1-dimensional) and DataFrame (2-dimensional), handle the vast majority of typical use cases in finance, statistics, social science, and many areas of engineering. For R users, DataFrame provides everything that R’s data.frame provides and much more. pandas is built on top of NumPy and is intended to integrate well within a scientific computing environment with many other 3rd party libraries.

Here are just a few of the things that pandas does well:

  • Easy handling of missing data (represented as NaN) in floating point as well as non-floating point data

  • Size mutability: columns can be inserted and deleted from DataFrame and higher dimensional objects

  • Automatic and explicit data alignment: objects can be explicitly aligned to a set of labels, or the user can simply ignore the labels and let Series, DataFrame, etc. automatically align the data for you in computations

  • Powerful, flexible group by functionality to perform split-apply-combine operations on data sets, for both aggregating and transforming data

  • Make it easy to convert ragged, differently-indexed data in other Python and NumPy data structures into DataFrame objects

  • Intelligent label-based slicing, fancy indexing, and subsetting of large data sets

  • Intuitive merging and joining data sets

  • Flexible reshaping and pivoting of data sets

  • Hierarchical labeling of axes (possible to have multiple labels per tick)

  • Robust IO tools for loading data from flat files (CSV and delimited), Excel files, databases, and saving / loading data from the ultrafast HDF5 format

  • Time series-specific functionality: date range generation and frequency conversion, moving window statistics, moving window linear regressions, date shifting and lagging, etc.

Many of these principles are here to address the shortcomings frequently experienced using other languages / scientific research environments. For data scientists, working with data is typically divided into multiple stages: munging and cleaning data, analyzing / modeling it, then organizing the results of the analysis into a form suitable for plotting or tabular display. pandas is the ideal tool for all of these tasks.

Note

Windows binaries built against NumPy 1.6.1

Project details


Release history Release notifications | RSS feed

This version

0.4.3

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pandas-0.4.3.tar.gz (1.8 MB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

pandas-0.4.3.win-amd64-py2.7.exe (744.2 kB view details)

Uploaded Source

pandas-0.4.3.win-amd64-py2.6.exe (744.0 kB view details)

Uploaded Source

pandas-0.4.3.win32-py3.2.exe (652.2 kB view details)

Uploaded Source

pandas-0.4.3.win32-py3.1.exe (652.2 kB view details)

Uploaded Source

pandas-0.4.3.win32-py2.7.exe (648.5 kB view details)

Uploaded Source

pandas-0.4.3.win32-py2.6.exe (648.2 kB view details)

Uploaded Source

pandas-0.4.3.win32-py2.5.exe (513.7 kB view details)

Uploaded Source

File details

Details for the file pandas-0.4.3.tar.gz.

File metadata

  • Download URL: pandas-0.4.3.tar.gz
  • Upload date:
  • Size: 1.8 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for pandas-0.4.3.tar.gz
Algorithm Hash digest
SHA256 56bc17ed08f0fb8605ba73d4f36987b719028a0972c93f40d66f6c470c5dba8e
MD5 ac862af17b569d125a896b6c9a045ccb
BLAKE2b-256 32e92b39765648a8a9a24f8aa6b7c61d320c17e3515a8856bb75d4a4f784b0e2

See more details on using hashes here.

File details

Details for the file pandas-0.4.3.win-amd64-py2.7.exe.

File metadata

File hashes

Hashes for pandas-0.4.3.win-amd64-py2.7.exe
Algorithm Hash digest
SHA256 0e84f1cc0cffb205850eabb71d8fb0189a12fb44895e7be44a029401097ce978
MD5 e565629ecda723ccf9ee964a39c043c3
BLAKE2b-256 aa602a0f9d64d3d8f1dcb4913e2bd68dc733d61f113129fe993c8dea2ac341a1

See more details on using hashes here.

File details

Details for the file pandas-0.4.3.win-amd64-py2.6.exe.

File metadata

File hashes

Hashes for pandas-0.4.3.win-amd64-py2.6.exe
Algorithm Hash digest
SHA256 9faffee273c1568ca330f3c14394c805a617f6bc64c6ef130a0758b61048764c
MD5 5e9d9506c5632b752060f545628ed8dd
BLAKE2b-256 d515d10a2cae386a179278158e9e742926cd1d70ee81d9448f82ad76ba481a58

See more details on using hashes here.

File details

Details for the file pandas-0.4.3.win32-py3.2.exe.

File metadata

File hashes

Hashes for pandas-0.4.3.win32-py3.2.exe
Algorithm Hash digest
SHA256 b738a26f8e70700743f5e05f161664cea17cfec1b07078225c3c8b94ea02c24e
MD5 60aeac438448365c022374cdcd7cb960
BLAKE2b-256 4e59805b8c80bcadc8fd5bbdb8416475f9192f496c4119dbc474eb9cdaa19f0f

See more details on using hashes here.

File details

Details for the file pandas-0.4.3.win32-py3.1.exe.

File metadata

File hashes

Hashes for pandas-0.4.3.win32-py3.1.exe
Algorithm Hash digest
SHA256 1243367082cc828c7a13f33a0414868b34177a4e147322ed078223996c2c1b6e
MD5 f82af3e1223db5e5d2e92d08ee20802a
BLAKE2b-256 3e04db5998015391a2c5c2afbb4e82507462a1f78e52272995cca64aa579626b

See more details on using hashes here.

File details

Details for the file pandas-0.4.3.win32-py2.7.exe.

File metadata

File hashes

Hashes for pandas-0.4.3.win32-py2.7.exe
Algorithm Hash digest
SHA256 7101406388674d5f4db50b56612569b3abca09c4e54ed324b969fbc48117fa1c
MD5 cb46ffff718d92a767c605007a4d5f1b
BLAKE2b-256 43f5accf9d477eae64bb1d7a425d52921c00d77a3d89888e9fa7a811591b9b25

See more details on using hashes here.

File details

Details for the file pandas-0.4.3.win32-py2.6.exe.

File metadata

File hashes

Hashes for pandas-0.4.3.win32-py2.6.exe
Algorithm Hash digest
SHA256 700bdbdcd13c6c1d3959ff4a09cfe93135939c22844e90cca0272ac6859c0988
MD5 647f66219a59cce01d23b307cb67ff0f
BLAKE2b-256 e9a07102c0c2da760f465ed98d512af0f56e10b54e10a0e35537bf7f2d2f5dbe

See more details on using hashes here.

File details

Details for the file pandas-0.4.3.win32-py2.5.exe.

File metadata

File hashes

Hashes for pandas-0.4.3.win32-py2.5.exe
Algorithm Hash digest
SHA256 a2d0f2a709549699e0bae8a31cc78dd3960051394ca6a123ba408df88889b840
MD5 c38fb08dad381ee566131b25ef402ba0
BLAKE2b-256 ecaa58fd22b55df5efab6b5e7f1ec0ae0befb84c8beebdfc716fb19bcb3e30a3

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page