Kosmos - Pytorch
Project description
Kosmos2.5
My implementation of Kosmos2.5 from Microsoft research and the paper: "KOSMOS-2.5: A Multimodal Literate Model"
Appreciation
- Lucidrains
- Agorians
Install
Dataset Strategy
Here is a table summarizing the datasets used in the paper KOSMOS-2.5: A Multimodal Literate Model with metadata and source links:
Dataset | Modality | # Samples | Domain | Source |
---|---|---|---|---|
IIT-CDIP | Text + Layout | 27.6M pages | Scanned documents | Link |
arXiv papers | Text + Layout | 20.9M pages | Research papers | Link |
PowerPoint slides | Text + Layout | 6.2M pages | Presentation slides | Web crawl |
General PDF | Text + Layout | 155.2M pages | Diverse PDF files | Web crawl |
Web screenshots | Text + Layout | 100M pages | Webpage screenshots | Link |
README | Text + Markdown | 2.9M files | GitHub README files | Link |
DOCX | Text + Markdown | 1.1M pages | WORD documents | Web crawl |
LaTeX | Text + Markdown | 3.7M pages | Research papers | Link |
HTML | Text + Markdown | 6.3M pages | Webpages | Link |
License
MIT
Citations
@misc{2309.11419,
Author = {Tengchao Lv and Yupan Huang and Jingye Chen and Lei Cui and Shuming Ma and Yaoyao Chang and Shaohan Huang and Wenhui Wang and Li Dong and Weiyao Luo and Shaoxiang Wu and Guoxin Wang and Cha Zhang and Furu Wei},
Title = {Kosmos-2.5: A Multimodal Literate Model},
Year = {2023},
Eprint = {arXiv:2309.11419},
}
bold italics
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
kosmos2_torch-0.0.1.tar.gz
(4.3 kB
view details)
Built Distribution
File details
Details for the file kosmos2_torch-0.0.1.tar.gz
.
File metadata
- Download URL: kosmos2_torch-0.0.1.tar.gz
- Upload date:
- Size: 4.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.3.2 CPython/3.11.0 Darwin/22.4.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c8b426518c224c052a2387ee2e33c8da2ed64a4a02a7d244bf9a51d127df2ff3 |
|
MD5 | 897ce443f60a1a0f0521ef709a6b217e |
|
BLAKE2b-256 | f0e5fda311172cbc46c8c2b6ce196dc5fafd379e29185f88105e960c589ef5fb |
File details
Details for the file kosmos2_torch-0.0.1-py3-none-any.whl
.
File metadata
- Download URL: kosmos2_torch-0.0.1-py3-none-any.whl
- Upload date:
- Size: 4.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.3.2 CPython/3.11.0 Darwin/22.4.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | cbabda0ddfddeef7db1370b311f4e8a7fd5adc67df53bbe8bc6faef9347be159 |
|
MD5 | 426abd2596828d89322500c70e6f4851 |
|
BLAKE2b-256 | 994a5f0b4cf15224cf011da1ba9dde583bbc614dbc4615af2705412f6a788321 |