Streamline your data science setup with dsbundle in one effortless install.
Project description
dsbundle
Enhance your Python data science workflow with dsbundle, an all-in-one package that consolidates essential libraries and tools to empower users in data manipulation, visualization, statistical analysis, machine learning, and beyond. This comprehensive bundle simplifies the setup process, ensuring that crucial dependencies are readily available for immediate use.
Comprehensive Library Coverage
Data Manipulation and Analysis
- numpy: Efficient numerical computing with powerful array operations and linear algebra capabilities.
- pandas: Data structures and tools for data manipulation and analysis, ideal for handling structured data.
- polars: A fast DataFrame library in Rust, focusing on performance and ease of use for data manipulation tasks.
- xarray: N-D labeled arrays and datasets, extending pandas to support multidimensional data.
- pyarrow: Columnar data format for efficient storage and processing of large datasets.
- h5py: Interface to HDF5, a versatile file format and data model for scientific computing.
- openpyxl: Read/write Excel files in Python, useful for integrating with spreadsheet data.
Visualization and Plotting
- matplotlib: Comprehensive 2D plotting library for creating static, animated, and interactive visualizations.
- seaborn: Statistical data visualization based on matplotlib, providing a high-level interface for drawing informative statistical graphics.
- plotly: Interactive plotting library for creating web-based charts and dashboards.
- bokeh: Interactive visualization library that targets modern web browsers for presentation.
- altair: Declarative statistical visualization library for creating interactive visualizations in a concise syntax.
- plotnine: Implementation of the grammar of graphics in Python, based on ggplot2.
Machine Learning and Deep Learning
- scikit-learn: Simple and efficient tools for data mining and data analysis, including classification, regression, and clustering algorithms.
- tensorflow: End-to-end open-source platform for machine learning, with extensive support for deep learning.
- pytorch: Deep learning framework that facilitates research and production deployment with flexibility and speed.
- keras: High-level neural networks API, capable of running on top of TensorFlow, Theano, or CNTK.
- fastai: Simplified deep learning library built on top of PyTorch, focusing on usability and best practices.
Natural Language Processing
- nltk: Natural Language Toolkit for symbolic and statistical natural language processing.
- spacy: Industrial-strength natural language processing library with pre-trained models and support for over 50 languages.
- gensim: Topic modeling for human-readable data, providing efficient implementations of common algorithms like Word2Vec. NOTE: You have to install
Gensimmanually with tar.gz file, click here to download & install.
Big Data and Distributed Computing
- dask: Parallel computing library for scaling out Python computations across multiple cores and clusters.
- pyspark: Apache Spark Python API, enabling large-scale data processing with distributed computing.
- ray: Distributed computing framework that supports both task and actor models for scalable and efficient execution.
Additional Tools and Utilities
- jupyter: Interactive computing environment for creating notebooks that integrate code execution, rich text, mathematics, plots, and media.
- pytest: Framework for building simple and scalable test cases in Python.
- mkdocs: Static site generator for creating beautiful project documentation.
- streamlit: Framework for turning data scripts into shareable web apps.
- dash: Framework for building analytical web applications in Python.
- gradio: GUI platform for sharing machine learning models as web apps.
Installation and Usage
To install dsbundle and gain access to this extensive suite of libraries and tools, simply execute the following command using pip:
Copy code
pip install dsbundle
This command automates the installation process, ensuring all included libraries are installed and ready for use in your Python environment.
License
This package is licensed under the MIT License, granting users the freedom to use, modify, and distribute the software. For detailed license terms, please refer to the LICENSE file included in the repository.
This detailed description provides an extensive overview of dsbundle, highlighting its comprehensive coverage of essential libraries and tools for data science, machine learning, visualization, and beyond. It emphasizes ease of installation, robust functionality, and community-driven development, making it an invaluable resource for data scientists, researchers, and developers seeking a unified solution for Python-based data analysis and machine learning projects.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file dsbundle-1.0.tar.gz.
File metadata
- Download URL: dsbundle-1.0.tar.gz
- Upload date:
- Size: 5.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5e9839fa4b16c0b23eed2926c3e438679e9bc12d2e69595289e111e5200b051d
|
|
| MD5 |
538189c30003acc1bca6e06c72f9af75
|
|
| BLAKE2b-256 |
6fe87dd3589e73821c50612fec435e25dfa75b7032f425e9f3463e370413dc76
|
File details
Details for the file dsbundle-1.0-py3-none-any.whl.
File metadata
- Download URL: dsbundle-1.0-py3-none-any.whl
- Upload date:
- Size: 4.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1a0e43f626732b517e50e73e4d8b6fa6fe62ed75d7f2d3db4ad6cc8d40067577
|
|
| MD5 |
c5379c7a47f362b98e2bef78eb475cb8
|
|
| BLAKE2b-256 |
139df9d698188f4059d543b42c5ead77cbdf315a7c217e382b2a7e173552a463
|