Skip to main content

A framework for research code

Project description

pyfra

The Python Framework for Research Applications.

Documentation Status

Design Philosophy

Research code has some of the fastest shifting requirements of any type of code. It's nearly impossible to plan ahead of time the proper abstractions, because it is exceedingly likely that in the course of the project what you originally thought was your main focus suddenly no longer is. Further, research code (especially in ML) often involves big and complicated pipelines, typically involving many different machines, which are either run by hand or using shell scripts that are far more complicated than any shell script ever should be.

Therefore, the objective of pyfra is to make it as fast and low-friction as possible to write research code involving complex pipelines over many machines. This entails making it as easy as possible to implement a research idea in reality, at the cost of fine-grained control and the long-term maintainability of the system. In other words, pyfra expects that code will either be rapidly obsoleted by newer code, or rewritten using some other framework once it is no longer a research project and requirements have settled down.

Pyfra is in its very early stages of development. The interface may change rapidly and without warning.

Features:

  • Spin up an internal webserver complete with a permissions system using only a few lines of code
  • Extremely elegant shell integration—run commands on any server seamlessly. All the best parts of bash and python combined
  • Automated remote environment setup, so you never have to worry about provisioning machines by hand again
  • (WIP) Tools for painless functional programming in python
  • (Coming soon) High level API for experiment management/scheduling and resource provisioning
  • (Coming soon) Idempotent resumable data pipelines with no cognitive overhead

Want to dive in? See the documentation.

Example code

from pyfra import *

loc = Remote()
rem = Remote("user@example.com")
nas = Remote("user@example2.com")

@page("Run experiment", dropdowns={'server': ['local', 'remote']})
def run_experiment(server: str, config_file: str, some_numerical_value: int, some_checkbox: bool):
    r = loc if server == 'local' else rem

    env = r.env("neox", "https://github.com/EleutherAI/gpt-neox")

    # rsync as a function can do local-local, local-remote, and remote-remote
    rsync(config_file, env.file("configs/my-config.yml"))
    rsync(nas.file('some_data_file'), env.file('data/whatever'))

    return env.sh('python main.py')

@page("Write example file and copy")
def example():
    rem.file("testing.txt").fwrite("hello world")

    # tlocal files can be specified as just a string
    rsync(rem.file('testing123.txt'), 'test1.txt')
    rsync(rem.file('testing123.txt'), loc.file('test2.txt'))

    loc.sh('cat test1.txt')

    assert loc.file('test1.txt').read() == loc.file('test2.txt').read()
    assert loc.file('test1.txt').read() == rem.file('testing123.txt').read()

    # ls as a function returns a list of files (with absolute paths) on the selected remote.
    # the returned value is displayed on the webpage.
    return '\n'.join(rem.ls('/'))

@page("List files in some directory")
def list_files(directory):
    return sh(f"ls -la {directory | quote}")


# start internal webserver
webserver()

Installation

pip3 install pyfra

Webserver screenshots

image image

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyfra-0.1.3.tar.gz (516.0 kB view details)

Uploaded Source

Built Distribution

pyfra-0.1.3-py3-none-any.whl (552.4 kB view details)

Uploaded Python 3

File details

Details for the file pyfra-0.1.3.tar.gz.

File metadata

  • Download URL: pyfra-0.1.3.tar.gz
  • Upload date:
  • Size: 516.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.5.0 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.4

File hashes

Hashes for pyfra-0.1.3.tar.gz
Algorithm Hash digest
SHA256 266bda2762f4a0a5088f62cdb5feca6ce05d06af90785754f95fca08469e7e3e
MD5 346df7a90cf34b58011005b22294cee9
BLAKE2b-256 9ddb6657cb7a669e77037c79428d90e28ef7e15c30d7be4ed6dbb3a7807f09a8

See more details on using hashes here.

File details

Details for the file pyfra-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: pyfra-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 552.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.5.0 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.4

File hashes

Hashes for pyfra-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 ec645bc10d43055d5b1e06f7b0dd63065706b808c952616a8cafb01e9d05bcbc
MD5 463dc63989f8acc47ffc38fea81b25f5
BLAKE2b-256 04014829229b23fffde8baeb61d6874cf373d15752f4504646c7af0267b2bd94

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page