Additional code for Stable-baselines3 to load and upload models from the Hub.
Project description
Hugging Face x Stable-baselines3
A library to load and upload Stable-baselines3 models from the Hub.
Installation
With pip
pip install huggingface-sb3
Examples
[Todo: add colab tutorial]
Case 1: I want to download a model from the Hub
import gym
from huggingface_sb3 import load_from_hub
from stable_baselines3 import PPO
from stable_baselines3.common.evaluation import evaluate_policy
# Retrieve the model from the hub
## repo_id = id of the model repository from the Hugging Face Hub (repo_id = {organization}/{repo_name})
## filename = name of the model zip file from the repository
checkpoint = load_from_hub(repo_id="ThomasSimonini/stable-baselines3-ppo-CartPole-v1", filename="CartPole-v1")
model = PPO.load(checkpoint)
# Evaluate the agent
eval_env = gym.make('CartPole-v1')
mean_reward, std_reward = evaluate_policy(model, eval_env, n_eval_episodes=10, deterministic=True)
print(f"mean_reward={mean_reward:.2f} +/- {std_reward}")
# Watch the agent play
obs = eval_env.reset()
for i in range(1000):
action, _state = model.predict(obs)
obs, reward, done, info = eval_env.step(action)
#eval_env.render()
if done:
obs = eval_env.reset()
eval_env.close()
Case 2: I trained an agent and want to upload it to the Hub
First you need to be logged in to Hugging Face:
- If you're using Colab/Jupyter Notebooks:
from huggingface_hub import notebook_login
notebook_login()
- Else:
huggingface-cli login
Then:
import gym
from huggingface_sb3 import push_to_hub
from stable_baselines3 import PPO
# Create the environment
env = gym.make('CartPole-v1')
# Define a PPO MLpPolicy architecture
model = PPO('MlpPolicy', env, verbose=1)
# Train it for 10000 timesteps
model.learn(total_timesteps=10000)
# Save the model
model.save("CartPole-v1")
# Push this saved model to the hf repo
# If this repo does not exists it will be created
## repo_id = id of the model repository from the Hugging Face Hub (repo_id = {organization}/{repo_name})
## filename: the name of the file == "name" inside model.save("CartPole-v1")
push_to_hub(repo_id = "ThomasSimonini/test-CartPole-v1",
filename = "CartPole-v1",
commit_message = "Added Cartpole-v1 trained model")
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
huggingface_sb3-1.0.3.tar.gz
(3.8 kB
view hashes)
Built Distribution
Close
Hashes for huggingface_sb3-1.0.3-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 86181c18ebe217d26eb90c95fb5c1520112b5d72d222624949e974672c628982 |
|
MD5 | 26302564886a4207e2b972babfe977e7 |
|
BLAKE2b-256 | 635fdff641516d43f7146fc595460c18b13373c8160b9ed2353545444059d211 |