mindpandas platform: linux, cpu: x86_64, git version: [sha1]:e07dbf1, [branch]: (HEAD -> master, origin/master, origin/HEAD)
Project description
Welcome to MindPandas
- What Is MindPandas
- Installation Methods
- Installation Verification
- Quickstart
- Docs
- Contributing
- Release Notes
- License
What Is MindPandas
Overview
MindPandas uses distributed computing engine to accelerate pandas operations, seamlessly integrated and compatible with existing pandas code. Using MindPandas for calculations can use all CPU cores on the computer, which makes MindPandas works especially well on larger datasets.
Architecture
MindPandas is implemented based on distribution, while native pandas is implemented based on single thread. This means that only one CPU core can be used at a time.
However, MindPandas can use more threads and cores on the machine, or all cores of the entire cluster.
For detailed architecture design, please refer to official website document.
Installation Methods
Confirming System Environment Information
The following table lists the environment required for installing, compiling and running MindPandas:
software | version |
---|---|
Linux-x86_64 | Ubuntu >=18.04 Euler >=2.9 |
Python | 3.8 |
glibc | >=2.25 |
- Make sure libxml2-utils is installed in your environment.
- Please refer to requirements for other third party dependencies.
Installing from pip command
If you use the pip, please download the whl package from MindPandas page and install it.
Installing whl package will download MindPandas dependencies automatically (detail of dependencies is shown in requirements.txt) in the networked state, and other dependencies should be installed manually.
Installing from source code
Download source code, then enter the mindpandas
directory to run build.sh script.
git clone https://gitee.com/mindspore/mindpandas.git
cd mindpandas
bash build.sh
The package is in output directory after compiled and you can install with pip.
pip install output/mindpandas-0.1.0-cp38-cp38-linux_x86_64.whl
Installation Verification
Execute the following command in shell. If no No module named 'mindpandas'
error is reported, the installation is successful.
python -c "import mindpandas"
Quickstart
First import MindPandas with the following command.
import mindpandas as pd
Set the running mode of MindPandas with the following command, which can speed up your MindPandas workflow.
pd.set_concurrency_mode('multithread')
The complete example is as follows:
>>> import mindpandas as pd
>>> pd.set_concurrency_mode('multithread')
>>> pd.set_partition_shape((16, 2))
>>> pd_df = pd.DataFrame([[1, 2, 3], [4, 5, 6]])
>>> sum = pd_df.sum()
>>> print(sum)
0 5
1 7
2 9
Name: sum, dtype: int64
Docs
More details about installation guide, tutorials and APIs, please see the User Documentation.
Contributing
Welcome contributions. See our Contributor Wiki for more details.
Release Notes
The release notes, see our RELEASE.
License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
File details
Details for the file mindpandas-0.2.0-cp39-cp39-manylinux1_x86_64.whl
.
File metadata
- Download URL: mindpandas-0.2.0-cp39-cp39-manylinux1_x86_64.whl
- Upload date:
- Size: 94.8 MB
- Tags: CPython 3.9
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/33.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.63.0 importlib-metadata/4.11.2 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.7.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3bc2f0d2e5f06d1fc39db998366c3342312829c28c9159f71e22b806973dbf68 |
|
MD5 | 61aa84989cacef35bba287799b272536 |
|
BLAKE2b-256 | cc22161441994373067a2cf0f640bb6bf4e8c21fc4c51748523ac795d60204c8 |
File details
Details for the file mindpandas-0.2.0-cp38-cp38-manylinux1_x86_64.whl
.
File metadata
- Download URL: mindpandas-0.2.0-cp38-cp38-manylinux1_x86_64.whl
- Upload date:
- Size: 74.5 MB
- Tags: CPython 3.8
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/33.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.63.0 importlib-metadata/4.11.2 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.7.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a7c24241caa91add38600cbd100954935aa40738db81f2614b35f20731468035 |
|
MD5 | 913066ab048a46f7ef0f8549644032a4 |
|
BLAKE2b-256 | 82fd9cb77d1c7e121cea2e29657f365339cbc09f1ce36e193e4e2c75e552d795 |