Skip to main content

Powerful data structures for data analysis and statistics

Project description

pandas is a Python package providing fast, flexible, and expressive data structures designed to make working with “relational” or “labeled” data both easy and intuitive. It aims to be the fundamental high-level building block for doing practical, real world data analysis in Python. Additionally, it has the broader goal of becoming the most powerful and flexible open source data analysis / manipulation tool available in any language. It is already well on its way toward this goal.

pandas is well suited for many different kinds of data:

  • Tabular data with heterogeneously-typed columns, as in an SQL table or Excel spreadsheet

  • Ordered and unordered (not necessarily fixed-frequency) time series data.

  • Arbitrary matrix data (homogeneously typed or heterogeneous) with row and column labels

  • Any other form of observational / statistical data sets. The data actually need not be labeled at all to be placed into a pandas data structure

The two primary data structures of pandas, Series (1-dimensional) and DataFrame (2-dimensional), handle the vast majority of typical use cases in finance, statistics, social science, and many areas of engineering. For R users, DataFrame provides everything that R’s data.frame provides and much more. pandas is built on top of NumPy and is intended to integrate well within a scientific computing environment with many other 3rd party libraries.

Here are just a few of the things that pandas does well:

  • Easy handling of missing data (represented as NaN) in floating point as well as non-floating point data

  • Size mutability: columns can be inserted and deleted from DataFrame and higher dimensional objects

  • Automatic and explicit data alignment: objects can be explicitly aligned to a set of labels, or the user can simply ignore the labels and let Series, DataFrame, etc. automatically align the data for you in computations

  • Powerful, flexible group by functionality to perform split-apply-combine operations on data sets, for both aggregating and transforming data

  • Make it easy to convert ragged, differently-indexed data in other Python and NumPy data structures into DataFrame objects

  • Intelligent label-based slicing, fancy indexing, and subsetting of large data sets

  • Intuitive merging and joining data sets

  • Flexible reshaping and pivoting of data sets

  • Hierarchical labeling of axes (possible to have multiple labels per tick)

  • Robust IO tools for loading data from flat files (CSV and delimited), Excel files, databases, and saving / loading data from the ultrafast HDF5 format

  • Time series-specific functionality: date range generation and frequency conversion, moving window statistics, moving window linear regressions, date shifting and lagging, etc.

Many of these principles are here to address the shortcomings frequently experienced using other languages / scientific research environments. For data scientists, working with data is typically divided into multiple stages: munging and cleaning data, analyzing / modeling it, then organizing the results of the analysis into a form suitable for plotting or tabular display. pandas is the ideal tool for all of these tasks.

Note

Windows binaries built against NumPy 1.6.1

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pandas-0.4.0.tar.gz (1.5 MB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

pandas-0.4.0.win-amd64-py2.7.exe (658.0 kB view details)

Uploaded Source

pandas-0.4.0.win-amd64-py2.6.exe (658.0 kB view details)

Uploaded Source

pandas-0.4.0.win32-py2.7.exe (590.0 kB view details)

Uploaded Source

pandas-0.4.0.win32-py2.6.exe (589.6 kB view details)

Uploaded Source

pandas-0.4.0.win32-py2.5.exe (455.1 kB view details)

Uploaded Source

File details

Details for the file pandas-0.4.0.tar.gz.

File metadata

  • Download URL: pandas-0.4.0.tar.gz
  • Upload date:
  • Size: 1.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for pandas-0.4.0.tar.gz
Algorithm Hash digest
SHA256 6e8315e1b8096829040e16277db7b697df4bc1329d9cfdfa46c6b2210e096662
MD5 668c1147187aa41f65c07c006b006465
BLAKE2b-256 cf6bf6b7035dcb12a43ec6e8826aaa5daf327a638a553909a9eac0b26c61e9d4

See more details on using hashes here.

File details

Details for the file pandas-0.4.0.win-amd64-py2.7.exe.

File metadata

File hashes

Hashes for pandas-0.4.0.win-amd64-py2.7.exe
Algorithm Hash digest
SHA256 645eb1ee91fbb3fa3fc827b2a49b2cd8638435d3e7a93d50a2d94fce3ad16796
MD5 f49e4ad54b55cc815822dd5cb2a97896
BLAKE2b-256 88e4dbc9208fed3ccefec988339ccd0b91fec73b3069e8221aab7555bc35807c

See more details on using hashes here.

File details

Details for the file pandas-0.4.0.win-amd64-py2.6.exe.

File metadata

File hashes

Hashes for pandas-0.4.0.win-amd64-py2.6.exe
Algorithm Hash digest
SHA256 28d0757fcf58f65cf5a465f940e4c5fea5a3b0b4742a0f03b2d0113c62c0d045
MD5 fffc2dde6b6ba9e5b488a37935dbcb2a
BLAKE2b-256 a1882d2413204141ce9b2c25f179ff5e5a5fd7d27a2fa42174cd868f5d7c29be

See more details on using hashes here.

File details

Details for the file pandas-0.4.0.win32-py2.7.exe.

File metadata

File hashes

Hashes for pandas-0.4.0.win32-py2.7.exe
Algorithm Hash digest
SHA256 3ac1aff4763c194dc429f9ae1c45476ac6152de66b9945c9f431f051c4aa30b6
MD5 1eb817a2d755437dc0c6b38502ad7fc9
BLAKE2b-256 0615219c5afc2318f74b990c1a02db45b83952ada3dff5004f57b2b62e250c0b

See more details on using hashes here.

File details

Details for the file pandas-0.4.0.win32-py2.6.exe.

File metadata

File hashes

Hashes for pandas-0.4.0.win32-py2.6.exe
Algorithm Hash digest
SHA256 1bec28fc41b5598c76147355de635dbb082a2ccc634d5c03cd3ed844c38f6ea4
MD5 b2ac43ffbc1c3e29dd6700bb73b184ae
BLAKE2b-256 12e2814ce34c13c49634ebbb81f3c4a314f22413ae7e5fd7ac023434e35cc7fd

See more details on using hashes here.

File details

Details for the file pandas-0.4.0.win32-py2.5.exe.

File metadata

File hashes

Hashes for pandas-0.4.0.win32-py2.5.exe
Algorithm Hash digest
SHA256 88239a4218a25db6df1aed8c139d678df0bbb8757c021af34c3835e4992ecae8
MD5 4dc5cf23e84d3efc1a060b9a34876b79
BLAKE2b-256 328497d250bf309d2670e3504e80b2c4e978aba569ca5e9ce8c60b93ab5c8a72

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page