Client library to download and publish environments, agents on the huggingbutt.com hub

These details have not been verified by PyPI

Project links

Project description

HuggingButt

Client library to download and publish reinforcement learning environments/agents on the huggingbutt.com hub

Installation

The base dependency mlagents-envs has been update to version 1.0.0. However, this version may cause error(on Windows 10). This is due to the package specifying numpy version 1.21.2, which might fail to build on Python 3.10 (Windows 10). I have had two colleagues test it and they encountered the same issues. So mlagents-envs==0.30.0, NumPy==1.21.2 in this project. MacOS is OK.

Create a new python environment using anaconda/miniconda. Only Python 3.9.

conda create -n hb python==3.9.18

activate the new python environment.

conda activate hb

install huggingbutt from pypi

pip install huggingbutt==0.0.5

or from source code

git clone https://github.com/huggingbutt/huggingbutt.git
cd huggingbutt
python -m pip install .

If there is no error message printed during the installation, congratulations, you have successfully installed this package. Next, you need to apply an access token from the official website https://huggingbutt.com.

Click new token button generate a new token. This access token is mainly used to restrict the download times of each user, as the server cost is relatively high.

Congratulations, you now have an access token!

Just put the generated token in the task code and you're gooooood to go.

Here is a simple training code:

from huggingbutt import Env, Agent, set_access_token

if __name__ == '__main__':
    set_access_token('YOUR_TOKEN')
    env = Env.get("huggingbutt/juggle", 'mac', startup_args=['--time_scale', '10'])

    agent = Agent(
        env=env,
        algorithm='PPO',
        policy='MlpPolicy',
        batch_size=64
    )
    agent.learn(total_timesteps=10000)
    agent.save()
    env.close()

Inference:

from huggingbutt import Env, Agent, set_access_token

if __name__ == '__main__':
    set_access_token('YOUR_TOKEN')
    env = Env.get("huggingbutt/juggle", 'mac', startup_args=['--time_scale', '1'])
    agent = Agent.get(15, env)

    obs = env.reset()
    for i in range(100):
        act, _status_ = agent.predict(obs)
        obs, reward, done, info = env.step(act)
        if done:
            obs = env.reset()
    env.close()

todo

Support more types learning environment, such as native game wrapped by python, pygame, class gym...
Develop a framework, user can customize the observation, action and reward of the environment developed under this framework. We hope this makes it easier for everyone to iterate the environment's agent.
There are still many ideas, I will add them later when I think about them...

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.0.5

Oct 20, 2023

This version

0.0.5b0 pre-release

Oct 19, 2023

0.0.5a0 pre-release

Oct 19, 2023

0.0.4

Oct 17, 2023

0.0.4b0 pre-release

Oct 16, 2023

0.0.4a0 pre-release

Oct 16, 2023

0.0.3

Oct 12, 2023

0.0.3a0 pre-release

Oct 12, 2023

0.0.2

Sep 23, 2023

0.0.2a0 pre-release

Sep 23, 2023

0.0.1

Aug 24, 2023

0.0.1rc2 pre-release

Aug 24, 2023

0.0.1rc0 pre-release

Aug 24, 2023

0.0.1b0 pre-release

Aug 24, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

huggingbutt-0.0.5b0.tar.gz (16.9 kB view hashes)

Uploaded Oct 19, 2023 Source

Built Distribution

huggingbutt-0.0.5b0-py3-none-any.whl (18.2 kB view hashes)

Uploaded Oct 19, 2023 Python 3

Hashes for huggingbutt-0.0.5b0.tar.gz

Hashes for huggingbutt-0.0.5b0.tar.gz
Algorithm	Hash digest
SHA256	`9c99662c4f24aea70098fbee7f936fb2aa3d3912c3d2e4cdf286e02b8266eede`
MD5	`0d9ff413a260496e79097908c7f12c1d`
BLAKE2b-256	`0afbf506d6de84ccbfd8a5b25f9ace75ccf37a29e863544b54341698f505bea5`

Hashes for huggingbutt-0.0.5b0-py3-none-any.whl

Hashes for huggingbutt-0.0.5b0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8901e8a9465cb2be824d9b9977dec55750ad0fed184ecd9f31d1f179d9743e4e`
MD5	`8b730b9d9338269baacb02c6263cf718`
BLAKE2b-256	`55ad17e6840afc6a930963338fcaac5f966077290af8e3e81ef274627ef8a72d`