Beam Datascience package

These details have not been verified by PyPI

Project links

Project description

BeamDS (Beam Data Science)

What is Beam for? ✨

Beam was created by data-science practitioners for data-science practitioners. It is designed as an ecosystem for developing and deploying data-driven algorithms in Python. It aims to increase productivity, efficiency, and performance in the research phase and to provide production-grade tools in the deployment part.

Our Guiding Principles ✍

Support all phases of data-driven algorithm development:
1. Data exploration
2. Data manipulation, preprocessing, and ETLs (Extract, Transform and Load)
3. Algorithm selection
4. Algorithm training
5. Hyperparameter tuning
6. Model deployment
7. Lifelong learning
Production level coding from the first line of code: no more quick and dirty Proof Of Concepts (POC). Every line of code counts toward a production model.
Consume effectively all resources: use multi-core, multi-GPUs, distributed computing, remote storage solutions, and databases to enable as much as possible productivity by the resources at hand.
Be agile: Development and production environments can change rapidly. Beam minimizes the friction of changing environments, filesystems, and computing resources to almost zero.
Be efficient: every line of code in Beam is optimized to be as efficient as possible and to avoid unnecessary overheads.
Easy to deploy and use algorithms: make deployment as easy as a line of code, import remote algorithms and services by their URI, and no more.
Excel your algorithms: Beam comes with some state-of-the-art deep neural network implementations. Beam will help you store, analyze, and return to your running experiments with ease. When you are done, with development, beam will help you optimize your hyperparameters on your GPU machines.
Data can be a hassle: beam can manipulate complex and nested data structures, including reading, processing, chunking, multi-processing, error handling, and writing.
Be relevant: Beam is committed to staying relevant and updating towards the future of AI, adding support for Large Language Models (LLMs) and more advanced algorithms.
Beam is the Swiss army knife that gets into your pocket: it is easy to install and maintain and it comes with the Beam Docker Image s.t. you can start developing and creating with zero effort even without an internet connection.

Installation 🧷

To install the full package from PyPi use:

pip install beam-ds[all]

If you want to install only the data-science related components use:

pip install beam-ds[ds]

To install only the LLM (Large Language Model) related components use:

pip install beam-ds[llm]

The prerequisite packages will be installed automatically, they can be found in the setup.cfg file.

Build from source 🚂

This BeamDS implementation follows the guide at https://packaging.python.org/tutorials/packaging-projects/

install the build package:

python -m pip install --upgrade build

to reinstall the package after updates use:

Now run this command from the same directory where pyproject.toml is located:

python -m build

reinstall the package with pip:

pip install dist/*.whl --force-reinstall

Getting Started 🚀

There are several examples both in .py files (in the examples folder) and in jupyter notebooks (in the notebooks folder). Specifically, you can start by looking into the beam_resources.ipynb notebook which makes you familiar the different resources available in Beam.

Go To the beam_resource.ipynb page

The Beam-DS Docker Image 🛸

We provide a Docker Image which contains all the necessary packages to run Beam-DS as well as many other data-science related packages which are useful for data-science development. We use it as our base image in our daily development process. It is based on the official NVIDIA PyTorch image.

To pull the image from Docker Hub use:

docker pull eladsar/beam:20240708

Building the Beam-DS docker image from source 🌱

The docker image is based on the latest official NVIDIA pytorch image. To build the docker image from Ubuntu host, you need to:

update nvidia drivers to the latest version: https://linuxconfig.org/how-to-install-the-nvidia-drivers-on-ubuntu-20-04-focal-fossa-linux
install docker: https://docs.docker.com/desktop/linux/install/ubuntu/
Install NVIDIA container toolkit: https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html#install-guide
Install and configure NVIDIA container runtime: https://stackoverflow.com/a/61737404

Build the sphinx documentation

Follow https://github.com/cimarieta/sphinx-autodoc-example

Profiling your code with Scalene

Scalene is a high-performance python profiler that supports GPU profiling. To analyze your code with Scalene use the following arguments:

scalene --reduced-profile --outfile OUTFILE.html --html --- your_prog.py <your additional arguments>

Uploading the package to PyPi 🌏

Install twine:

python -m pip install --user --upgrade twine

Build the package:

python -m build

Upload the package:

python -m twine upload --repository pypi dist/*

Upload the package with poetry

# poetry config pypi-token.pypi YOUR_PYPI_API_TOKEN
bash build.sh
bash init.sh
poetry lock
poetry publish --build

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

2.8.3

Aug 3, 2025

2.8.2

Jul 17, 2025

2.8.1rc2 pre-release

Jun 26, 2025

2.8.1rc0 pre-release

Jun 25, 2025

2.8.0rc0 pre-release

May 28, 2025

2.8.0b0 pre-release

May 28, 2025

2.7.7

Feb 9, 2025

2.7.6

Feb 6, 2025

2.7.5

Feb 3, 2025

2.7.2

Nov 14, 2024

2.7.0

Oct 7, 2024

2.6.7

Aug 1, 2024

2.6.6

Aug 1, 2024

2.6.4

Jul 18, 2024

2.6.4b1 pre-release

Jul 18, 2024

2.6.3

Jul 16, 2024

2.6.3b0 pre-release

Jul 16, 2024

2.6.2

Jul 16, 2024

2.6.1b0 pre-release

Jul 15, 2024

2.6.0

Jul 7, 2024

2.5.11

Jun 19, 2024

2.5.9

Jun 13, 2024

2.5.8b1 pre-release

Jun 6, 2024

2.5.7

Jun 4, 2024

2.5.4

May 27, 2024

2.5.0b0 pre-release

Mar 31, 2024

2.4.9b0 pre-release

Mar 19, 2024

2.4.0b1 pre-release

Jan 6, 2024

2.3.8 yanked

Dec 27, 2023

Reason this release was yanked:

contains import error

2.3.3

Nov 16, 2023

2.3.2

Nov 14, 2023

2.3.0

Oct 26, 2023

2.2.0a1 pre-release

Sep 6, 2023

2.0.3

May 9, 2023

2.0.2

Apr 16, 2023

2.0.1

Apr 16, 2023

2.0.0

Apr 16, 2023

0.2.1

Nov 22, 2022

0.2.0

Sep 29, 2022

0.1.1

Sep 5, 2022

0.0.10

Jun 18, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

beam_ds-2.8.3.tar.gz (569.0 kB view details)

Uploaded Aug 3, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

beam_ds-2.8.3-py3-none-any.whl (652.4 kB view details)

Uploaded Aug 3, 2025 Python 3

File details

Details for the file beam_ds-2.8.3.tar.gz.

File metadata

Download URL: beam_ds-2.8.3.tar.gz
Upload date: Aug 3, 2025
Size: 569.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.1.1 CPython/3.13.5 Darwin/24.5.0

File hashes

Hashes for beam_ds-2.8.3.tar.gz
Algorithm	Hash digest
SHA256	`c011ab887c4d5da1d9b2b78cdbe4bcdbe41c263ad25c778595a3cde888507875`
MD5	`52d1b3d6f6d2f044db904595724bc425`
BLAKE2b-256	`1cd975f003bb01ad4ffd0e02d7c497d45f12ee56de96806aa79172533ea3e98a`

See more details on using hashes here.

File details

Details for the file beam_ds-2.8.3-py3-none-any.whl.

File metadata

Download URL: beam_ds-2.8.3-py3-none-any.whl
Upload date: Aug 3, 2025
Size: 652.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.1.1 CPython/3.13.5 Darwin/24.5.0

File hashes

Hashes for beam_ds-2.8.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6a07ff71a3765ce9e7773562f4c5eb04d779a3100f2ad6025e3dafac25ca85bd`
MD5	`ac3acec5fbffcd45cbb245a2f410459a`
BLAKE2b-256	`f36ae5f623b0d64da49f98095de6fbfe6b3b4e65339a10592b920c567c56ae23`

See more details on using hashes here.

beam-ds 2.8.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

BeamDS (Beam Data Science)

What is Beam for? ✨

Our Guiding Principles ✍

Installation 🧷

Build from source 🚂

Getting Started 🚀

The Beam-DS Docker Image 🛸

Building the Beam-DS docker image from source 🌱

Build the sphinx documentation

Profiling your code with Scalene

Uploading the package to PyPi 🌏

Upload the package with poetry

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes