gpt4o - Pytorch
Project description
GPT4o
Community Open Source Implementation of GPT4o in PyTorch
Install
Architecture
- TikToken Tokenzier: We know fursure the tokenizer. Which is here
- Model understands Images and Audio Natively. There are 2 approaches, process them natively or use encoders for each. I think here they're using encoders like whisper and vit for simplicity and brevity.
- Using DALLE3 as the output head to generate images
- Tokens to denote when to generate an image or audio
- Whisper output head for the audio outputs
License
MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
gpt4o-0.0.1.tar.gz
(3.1 kB
view hashes)
Built Distribution
gpt4o-0.0.1-py3-none-any.whl
(3.1 kB
view hashes)