Skip to main content

mindpandas platform: linux, cpu: x86_64, git version: [sha1]:e07dbf1, [branch]: (HEAD -> master, origin/master, origin/HEAD)

Project description

Welcome to MindPandas

查看中文

What Is MindPandas

Overview

MindPandas uses distributed computing engine to accelerate pandas operations, seamlessly integrated and compatible with existing pandas code. Using MindPandas for calculations can use all CPU cores on the computer, which makes MindPandas works especially well on larger datasets.

Architecture

MindPandas is implemented based on distribution, while native pandas is implemented based on single thread. This means that only one CPU core can be used at a time.

However, MindPandas can use more threads and cores on the machine, or all cores of the entire cluster.

For detailed architecture design, please refer to official website document.

Installation Methods

Confirming System Environment Information

The following table lists the environment required for installing, compiling and running MindPandas:

software version
Linux-x86_64 Ubuntu >=18.04
Euler >=2.9
Python 3.8
glibc >=2.25
  • Make sure libxml2-utils is installed in your environment.
  • Please refer to requirements for other third party dependencies.

Installing from pip command

If you use the pip, please download the whl package from MindPandas page and install it.

Installing whl package will download MindPandas dependencies automatically (detail of dependencies is shown in requirements.txt) in the networked state, and other dependencies should be installed manually.

Installing from source code

Download source code, then enter the mindpandas directory to run build.sh script.

git clone https://gitee.com/mindspore/mindpandas.git
cd mindpandas
bash build.sh

The package is in output directory after compiled and you can install with pip.

pip install output/mindpandas-0.1.0-cp38-cp38-linux_x86_64.whl

Installation Verification

Execute the following command in shell. If no No module named 'mindpandas' error is reported, the installation is successful.

python -c "import mindpandas"

Quickstart

First import MindPandas with the following command.

import mindpandas as pd

Set the running mode of MindPandas with the following command, which can speed up your MindPandas workflow.

pd.set_concurrency_mode('multithread')

The complete example is as follows:

>>> import mindpandas as pd
>>> pd.set_concurrency_mode('multithread')
>>> pd.set_partition_shape((16, 2))
>>> pd_df = pd.DataFrame([[1, 2, 3], [4, 5, 6]])
>>> sum = pd_df.sum()
>>> print(sum)
0 5
1 7
2 9
Name: sum, dtype: int64

Docs

More details about installation guide, tutorials and APIs, please see the User Documentation.

Contributing

Welcome contributions. See our Contributor Wiki for more details.

Release Notes

The release notes, see our RELEASE.

License

Apache License 2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

mindpandas-0.2.0-cp39-cp39-manylinux1_x86_64.whl (94.8 MB view details)

Uploaded CPython 3.9

mindpandas-0.2.0-cp38-cp38-manylinux1_x86_64.whl (74.5 MB view details)

Uploaded CPython 3.8

File details

Details for the file mindpandas-0.2.0-cp39-cp39-manylinux1_x86_64.whl.

File metadata

  • Download URL: mindpandas-0.2.0-cp39-cp39-manylinux1_x86_64.whl
  • Upload date:
  • Size: 94.8 MB
  • Tags: CPython 3.9
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/33.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.63.0 importlib-metadata/4.11.2 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.7.1

File hashes

Hashes for mindpandas-0.2.0-cp39-cp39-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 3bc2f0d2e5f06d1fc39db998366c3342312829c28c9159f71e22b806973dbf68
MD5 61aa84989cacef35bba287799b272536
BLAKE2b-256 cc22161441994373067a2cf0f640bb6bf4e8c21fc4c51748523ac795d60204c8

See more details on using hashes here.

File details

Details for the file mindpandas-0.2.0-cp38-cp38-manylinux1_x86_64.whl.

File metadata

  • Download URL: mindpandas-0.2.0-cp38-cp38-manylinux1_x86_64.whl
  • Upload date:
  • Size: 74.5 MB
  • Tags: CPython 3.8
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/33.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.63.0 importlib-metadata/4.11.2 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.7.1

File hashes

Hashes for mindpandas-0.2.0-cp38-cp38-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 a7c24241caa91add38600cbd100954935aa40738db81f2614b35f20731468035
MD5 913066ab048a46f7ef0f8549644032a4
BLAKE2b-256 82fd9cb77d1c7e121cea2e29657f365339cbc09f1ce36e193e4e2c75e552d795

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page