No project description provided
Project description
FBGEMM_GPU
FBGEMM_GPU (FBGEMM GPU Kernels Library) is a collection of high-performance PyTorch GPU operator libraries for training and inference. The library provides efficient table batched embedding bag, data layout transformation, and quantization supports.
FBGEMM_GPU is currently tested with cuda 12.1.0 and 11.8 in CI, and with PyTorch packages (2.1+) that are built against those CUDA versions.
Only Intel/AMD CPUs with AVX2 extensions are currently supported.
See our Documentation for more information.
Installation
The full installation instructions for the CUDA, ROCm, and CPU-only variants of FBGEMM_GPU can be found here. In addition, instructions for running example tests and benchmarks can be found here.
Build Instructions
This section is intended for FBGEMM_GPU developers only. The full build instructions for the CUDA, ROCm, and CPU-only variants of FBGEMM_GPU can be found here.
Join the FBGEMM_GPU Community
For questions, support, news updates, or feature requests, please feel free to:
- File a ticket in GitHub Issues
- Post a discussion in GitHub Discussions
- Reach out to us on the
#fbgemm
channel in PyTorch Slack
For contributions, please see the CONTRIBUTING
file for
ways to help out.
License
FBGEMM_GPU is BSD licensed, as found in the LICENSE
file.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
Hashes for fbgemm_gpu_cpu-0.6.0-cp312-cp312-manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3beec17be632a1374bacc58f088ae72d49592817010c2314e42b49cd13fc86ae |
|
MD5 | b3c1fc8a88cb6b8ec1f84fa963f6487d |
|
BLAKE2b-256 | 58e8a02541fb64bd20f173d0f476141212ba0182e48c7f7a750fdb53f372a0e5 |
Hashes for fbgemm_gpu_cpu-0.6.0-cp312-cp312-manylinux2014_aarch64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 203a98027df58f679076420fd964f25a400b4f7d3d4c0297fee0296d6640bc61 |
|
MD5 | 6a9493d02b2aae5b41f5fa4c9725fa34 |
|
BLAKE2b-256 | 818a1c7b4577533baedb502f944be2c9e65b202c94ef03669204f84ac4334150 |
Hashes for fbgemm_gpu_cpu-0.6.0-cp311-cp311-manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | d921ffde365ea1857f900c0e6daf4630f5cc44ed3e84239395cdee4fa5948f6b |
|
MD5 | 8c906df535f4d206208ae88e29547511 |
|
BLAKE2b-256 | 55bfaa338d2d4d8d8e3fed0204f69620dd08d76e772b0ff620ab95b04461de87 |
Hashes for fbgemm_gpu_cpu-0.6.0-cp311-cp311-manylinux2014_aarch64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 42fe0788e07715b35c93351378d05471d482b5d81001b2be543dedaa458affb4 |
|
MD5 | f1206f39e78f6c71f3c19db4c37427ab |
|
BLAKE2b-256 | 8c8fb94f6b7abb24051376e39ed13d1a7b4add418217957194a48612747667f0 |
Hashes for fbgemm_gpu_cpu-0.6.0-cp310-cp310-manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | c824d8a3f18d8dba25b9b39cddd9cb9046e43bfa136999180c1b7cbb11695558 |
|
MD5 | 45e3830441c0d1d4fe977767e946fc18 |
|
BLAKE2b-256 | a63867e6421d339dd169c36fbb4323f897fa5d286a5be97783d4f3cf446777ef |
Hashes for fbgemm_gpu_cpu-0.6.0-cp310-cp310-manylinux2014_aarch64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b78601f57b24d7e62c24030b1662f0df5569b933eb471cbe966822a59d03dbbe |
|
MD5 | 8242f59da42b63b05443f63517e54c5f |
|
BLAKE2b-256 | 4810300aa05a92d8e9e7a745cf47838ea418b7f95a1a2371605a5b436b3bd45c |
Hashes for fbgemm_gpu_cpu-0.6.0-cp39-cp39-manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3319422ed7d33ea592631ad8e7e9a78989bdcf267ecb1f2c58f9939092bd0b23 |
|
MD5 | 6858691336dd98142402014e05fce7b6 |
|
BLAKE2b-256 | 814421b710b99702c5bd8a5ccad2fa9aa2381eaaa22c4d4f688f1965e1789b8d |
Hashes for fbgemm_gpu_cpu-0.6.0-cp39-cp39-manylinux2014_aarch64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 900a6fba6feeaa8dda1f91f58e745d324f36b9d72b68bf346472cbdc98d5e2f2 |
|
MD5 | 0cd40143729498226f84c86c4e99048b |
|
BLAKE2b-256 | 7821f062d613c67830f9b881fc9793833e0bbee6c3b2eda7bc63be4b71ddde8e |
Hashes for fbgemm_gpu_cpu-0.6.0-cp38-cp38-manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a1b14a4d8022d36279a7fc0d0ab7ad559567dad38f5035dcc3bb9c2aca02f8ad |
|
MD5 | 759328d7a454d6a29e591bb212cf689d |
|
BLAKE2b-256 | 4ff7157b11a1f6fda85b544fb4752976360d0b778d99a055d7933f592fb7038b |
Hashes for fbgemm_gpu_cpu-0.6.0-cp38-cp38-manylinux2014_aarch64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 857879281d90d2868c0ea771047cb88862db3f0b446646001a1ea65ead2dde73 |
|
MD5 | 82a367b85f1f3b4332ede440d10cf200 |
|
BLAKE2b-256 | c7ee88b253dad59b5d3480dc06de936bce460aba4d33818e69ad7c45f025f570 |