No project description provided

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

Project description

calculate-flops.pytorch

This tool(calflops) is designed to compute the theoretical amount of FLOPs(floating-point operations)、MACs(multiply-add operations) and Parameters in various neural networks such as Linear、 CNN、 RNN、 GCN、Transformer(Bert、LlaMA etc Large Language Model)，including any custom models via torch.nn.function.* as long as based on the Pytorch implementation.

In addition, the implementation process of this package inspired by ptflops and deepspeed libraries, for which I am very grateful for their efforts, they are both very good work. Meanwhile this package also improves some aspects(more simple、more model support) based on them.

Install the latest version

From PyPI:

pip install calflops

Example

from calflops import calculate_flops

# Deep Learning Model, such as alexnet.
from torchvision import models

model = models.alexnet()
batch_size = 1
flops, macs, params = calculate_flops(model=model, 
                                      input_shape=(batch_size, 3, 224, 224),
                                      print_results=False)
print("alexnet FLOPs:%s   MACs:%s   Params:%s \n" %(flops, macs, params))
# alexnet FLOPs:1.43 GFLOPS   MACs:714.188 MMACs   Params:61.101 M 

# Transformers Model, such as bert.
from transformers import AutoModel
from transformers import AutoTokenizer
batch_size = 1
max_seq_length = 128
model_name = "hfl/chinese-roberta-wwm-ext/"
model_save = "../pretrain_models/" + model_name
model = AutoModel.from_pretrained(model_save)
tokenizer = AutoTokenizer.from_pretrained(model_save)
flops, macs, params = calculate_flops_pytorch(model=model, 
                                              input_shape=(batch_size, max_seq_length),
                                              transformer_tokenizer=tokenizer,
                                              print_results=False)
print("bert(hfl/chinese-roberta-wwm-ext) FLOPs:%s   MACs:%s   Params:%s \n" %(flops, macs, params))
#bert(hfl/chinese-roberta-wwm-ext) FLOPs:22.363 GFLOPS   MACs:11.174 GMACs   Params:102.268 M 

# Large Languase Model, such as llama-7b.
from transformers import LlamaTokenizer
from transformers import LlamaForCausalLM
batch_size = 1
max_seq_length = 128
model_name = "original_llama2_hf_7B"
model_save = "../model/" + model_name
model = LlamaForCausalLM.from_pretrained(model_save)
tokenizer = LlamaTokenizer.from_pretrained(model_save)
flops, macs, params = calculate_flops(model=model,
                                      input_shape=(batch_size, max_seq_length),
                                                transformer_tokenizer=tokenizer,
                                                print_results=False)
print("llama2(7B) FLOPs:%s   MACs:%s   Params:%s \n" %(flops, macs, params))
#llama2(7B) FLOPs:1.7 TFLOPS   MACs:850.001 GMACs   Params:6.738 B

Concact Author

Author: MrYXJ

Mail: code_job@163.com

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

Release history Release notifications | RSS feed

0.3.2

Jun 7, 2024

0.3.1

Jun 7, 2024

0.3.0

Jun 7, 2024

0.2.9

Sep 13, 2023

0.2.8

Sep 13, 2023

0.2.7

Sep 9, 2023

0.2.6

Sep 5, 2023

0.2.5

Sep 3, 2023

0.2.0

Aug 24, 2023

0.1.0

Aug 23, 2023

0.0.9

Aug 23, 2023

0.0.8

Aug 22, 2023

0.0.7

Aug 22, 2023

0.0.6

Aug 22, 2023

0.0.5

Aug 22, 2023

0.0.4

Aug 21, 2023

0.0.3

Aug 21, 2023

0.0.2

Aug 21, 2023

This version

0.0.1

Aug 21, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

calflops-0.0.1.tar.gz (13.4 kB view hashes)

Uploaded Aug 21, 2023 Source

Built Distribution

calflops-0.0.1-py3-none-any.whl (16.4 kB view hashes)

Uploaded Aug 21, 2023 Python 3

Hashes for calflops-0.0.1.tar.gz

Hashes for calflops-0.0.1.tar.gz
Algorithm	Hash digest
SHA256	`4a5e5aad03c654a5553e6f896a78b0037d1c505b1a42d24347bc0dd0ea6780a0`
MD5	`7fbab223bfe24e4256a4130c5bdd1b99`
BLAKE2b-256	`63c6e212d9a470a3dceabb4b87cad77499e47e052c94ccacc6b29892664b9b9e`

Hashes for calflops-0.0.1-py3-none-any.whl

Hashes for calflops-0.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5024dda95406d1aaa067073d6bd11948816ce64c60b25be0c0239a52de14cb29`
MD5	`b6456f6dce924a2032a4129b9f442093`
BLAKE2b-256	`fcd05339ef8fd08751aa2195d7a2da70e97aec7e09d5101aba7ff83d802ac9eb`