Kosmos - Pytorch
Project description
Kosmos2.5
My implementation of Kosmos2.5 from Microsoft research and the paper: "KOSMOS-2.5: A Multimodal Literate Model"
Appreciation
- Lucidrains
- Agorians
Install
Dataset Strategy
Here is a table summarizing the datasets used in the paper KOSMOS-2.5: A Multimodal Literate Model with metadata and source links:
| Dataset | Modality | # Samples | Domain | Source |
|---|---|---|---|---|
| IIT-CDIP | Text + Layout | 27.6M pages | Scanned documents | Link |
| arXiv papers | Text + Layout | 20.9M pages | Research papers | Link |
| PowerPoint slides | Text + Layout | 6.2M pages | Presentation slides | Web crawl |
| General PDF | Text + Layout | 155.2M pages | Diverse PDF files | Web crawl |
| Web screenshots | Text + Layout | 100M pages | Webpage screenshots | Link |
| README | Text + Markdown | 2.9M files | GitHub README files | Link |
| DOCX | Text + Markdown | 1.1M pages | WORD documents | Web crawl |
| LaTeX | Text + Markdown | 3.7M pages | Research papers | Link |
| HTML | Text + Markdown | 6.3M pages | Webpages | Link |
License
MIT
Citations
@misc{2309.11419,
Author = {Tengchao Lv and Yupan Huang and Jingye Chen and Lei Cui and Shuming Ma and Yaoyao Chang and Shaohan Huang and Wenhui Wang and Li Dong and Weiyao Luo and Shaoxiang Wu and Guoxin Wang and Cha Zhang and Furu Wei},
Title = {Kosmos-2.5: A Multimodal Literate Model},
Year = {2023},
Eprint = {arXiv:2309.11419},
}
bold italics
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
kosmos2_torch-0.0.1.tar.gz
(4.3 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file kosmos2_torch-0.0.1.tar.gz.
File metadata
- Download URL: kosmos2_torch-0.0.1.tar.gz
- Upload date:
- Size: 4.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.3.2 CPython/3.11.0 Darwin/22.4.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c8b426518c224c052a2387ee2e33c8da2ed64a4a02a7d244bf9a51d127df2ff3
|
|
| MD5 |
897ce443f60a1a0f0521ef709a6b217e
|
|
| BLAKE2b-256 |
f0e5fda311172cbc46c8c2b6ce196dc5fafd379e29185f88105e960c589ef5fb
|
File details
Details for the file kosmos2_torch-0.0.1-py3-none-any.whl.
File metadata
- Download URL: kosmos2_torch-0.0.1-py3-none-any.whl
- Upload date:
- Size: 4.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.3.2 CPython/3.11.0 Darwin/22.4.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
cbabda0ddfddeef7db1370b311f4e8a7fd5adc67df53bbe8bc6faef9347be159
|
|
| MD5 |
426abd2596828d89322500c70e6f4851
|
|
| BLAKE2b-256 |
994a5f0b4cf15224cf011da1ba9dde583bbc614dbc4615af2705412f6a788321
|