Skip to main content

Distributed computing for everyone in modern Python.

Project description

achilles

Extremely accessible distributed/parallel computing (BYOB - Build Your Own Beowulf) in modern Python based on:

  • Twisted;
  • cloudpickle;
  • multiprocessing; and,
  • lessons learned from studying Hadoop, Apache Spark, Apache Arrow, Parallel Python, Ray, Jug, Pathos, Disco, SCOOP, Cluster-Workers and the questions asked regarding these projects on StackOverflow.

The goal of the achilles framework is to drastically reduce the barriers to entry for developers to use all of the computational power available on their networks and provide engingeers with an accessible, highly flexible framework for building distributed/parallel computing into their applications applications.

As achilles continues to be actively developed, the project will continue to be designed in alignment with our four demands for the project:

  • achilles must be simple to use;
  • achilles must 'just work' at all scales, from running all of the components on an individual's laptop to running on powerful clusters in distributed systems;
  • achilles must be scalable and accommodate 'big data' workloads (i.e. files too large to open in memory on a single machine);
  • achilles should be fast as hell and regularly benchmarked against alternative solutions; and,
  • achilles must be SECURE. Currently, achilles is not suitable for deployment outside of localhost. We are currently working on an SSH implementation.

Consists of:

  • achilles_server.py
    • Runs a Twisted TCP server listening at the endpoint described in the .env configuration file. Establishes and maintains persistent connections with multiple achilles_nodes (potentially running across numerous machines), distributes computations among them at the instruction of the achilles_controller, and sends the result(s) to the achilles_controller in the response_mode specified in achilles_compute().
  • achilles_node.py
    • Connect to the achilles_server and receive ID assignment. Wait for a job to be started, receive a function over the wire, receive an argument over the wire and then keep feeding the server results/receiving new arguments until all arguments are exhausted.
  • achilles_controller.py
    • Connect to the achilles_server and instruct it which function to perform against which arguments with which response_mode. Verify the job and wait for the results to be returned - achilles_server will do the rest.
    • Available commands in achilles_controller command interface:
      • achilles_compute, cluster_status, kill_cluster, help

Configuration files:

  • achilles_config.yaml
    • Use pyYaml to load the configuration file for a job into achilles_controller.py. See below for additional information on how to structure this file.
  • achilles_function.py
    • achilles_function() is cloudpickled and distributed to all connected clients as a part of the self.startJob() handshake initiated after achilles_controller.py verifies the commencement of a job.
  • .env
    • Use python-dotenv to load the local file into the system's environmental variables for configuration and authentication.
    • Basic security precaution. To-do: explore encryption.

Installation

pip install achilles

Usage

Deploy achilles server:

  1. from achilles.lineReceiver.achilles_server import genConfig, runAchillesServer
  2. genConfig() -> enter preferred settings or use default
  3. runAchillesServer() -> run an achilles_server using the specified HOST and PORT in the generated .env file

achilles is in the early stages of active development and your suggestions/contributions are kindly welcomed. achilles is written and maintained by Alejandro Peña. Email me at adpena<3gmail.com.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

achilles-0.0.14.tar.gz (15.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

achilles-0.0.14-py3-none-any.whl (25.6 kB view details)

Uploaded Python 3

File details

Details for the file achilles-0.0.14.tar.gz.

File metadata

  • Download URL: achilles-0.0.14.tar.gz
  • Upload date:
  • Size: 15.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/3.7.3

File hashes

Hashes for achilles-0.0.14.tar.gz
Algorithm Hash digest
SHA256 768f323efa2560b8a00bbc199c12330d21b81c5a7a18abb1dc5f55b869834ee3
MD5 d9482df66ad7526fdc2a89d683b9f69a
BLAKE2b-256 fa38a87cc1b47c333d13e8cc57741fc6216dd59e8288c764399201e3f6b32657

See more details on using hashes here.

File details

Details for the file achilles-0.0.14-py3-none-any.whl.

File metadata

  • Download URL: achilles-0.0.14-py3-none-any.whl
  • Upload date:
  • Size: 25.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/3.7.3

File hashes

Hashes for achilles-0.0.14-py3-none-any.whl
Algorithm Hash digest
SHA256 232e251b73c064d2e958548bfd051ba4b63d1263ac899da33971ad6039d94e4e
MD5 4d8f15eb45d4f6e48bd8161c3483fc43
BLAKE2b-256 09815f0ab577d116bd915012179f7d40ce9b32a5d962fe08381f445a50352e58

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page