Skip to main content

a tool to fix and simplify bib automatically.

Project description

SimBiber: A tool for simplifying bibtex with official info.

version Status-building PRs-Welcome stars FORK Issues


Motivation

We often need to simplify the official bib that consists of many information into a shorter version that only maintains necessary information (e.g., author, title, conference/journal name and etc) due to page limitation.

We introduce SimBiber, a simple tool in Python to simplify them automatically. Hope it's helpful for you.

We also highly recommend another wonderful tool for you Rebiber, which is a tool for normalizing bibtex with official info.

Tips: If you use first Rebiber and then Simbiber, you can get a better experience.

Disclaimer

SimBiber is a fairly new project and it is under active development. We hope that it will be quite useful in a variety of cases, but there is no guarantee that the results it produces will necessarily be strictly compliant with the official specification.

So you'd better check the accuracy of simplified bib files again.

All icons are collected from the Internet, if there is any infringement, please contact us to delete.

Changelog

  • 2023.02.33
    • Fix some bugs about with -keep parameter.
  • 2021.05.02
    • Fix some bugs about without -keep parameter.
  • 2021.05.01
    • Support to customize the keys you want to reserve.
  • 2021.04.23
    • Support IJCAI (Survey Track).
    • Unified README.
  • 2021.04.11
    • Support to pip install.
    • Simplify input args.
    • Add disclaimer.
  • 2021.03.02
    • Fix some bugs if remove duplications.
  • 2021.02.15
    • Fix a bug simplify ACL (like EACL) conference to ACL.
    • Support ACL Findings and EMNLP findings.
  • 2021.01.21
    • Support to remove duplication if your bib has some bibitems with same title. (automatically choose Conference citation)
    • Fix some bugs about some conferences.
    • Add more categories of conferences. (now support 113 conferences)
  • 2021.01.11
    • Fix a bug if output path is the same as input path.
    • Support to remove duplication if your bib has both of arXiv or Conference citation.
    • Support to simplify files by folder.
    • Support to use default output path.
    • Add more categories of conferences. (now support 112 conferences)
  • 2021.01.08 We fix a bug if booktitle contains { or } and add more categories of conferences. (now support 105 conferences)
  • 2021.01.06 We fix a few minor bugs and add more categories of conferences. (now support 84 conferences)
  • 2021.12.31 We build the first version and release it.

Installation

git clone https://github.com/MLNLP-World/Simbiber.git
cd Simbiber/
pip install -e .

OR

pip install simbiber

If you would like to use the latest github version with more bug fixes, please use the first installation method.

Finally, if you run simbiber without any args, you get the following result, then the installation is successful!

Usage(v0.8.0)

simbiber -i [input bib path] -o [output bib path] 

Tips: All path args support absolute and relative paths

simplified argument usage
-i --input_path The path to the input bib file or directory that you want to simplify.
-o --output_path [Optional] The path to the output bib file that you want to save.
PLEASE ATTENTION:
  • It only works in simplify single bib file.
    • If output_path==input_path, it will rewrite input file.
  • Without this param, it will be auto filled:
    • If simplifying single bib file, it will rewrite input file;
    • If simplifying bib directory, it will output to ./out dir.
-c --config_path [Optional]The path to the mapper config file. The path can be a file directory path, like config or a single file path, like config.json.
PLEASE ATTENTION: If you want to simplify a huge bib file, you'd better extract external json config file to achieve satisfactory speed.
-a --if_append_output [Optional] Whether append simplified data to output bib file.
-r --remove_duplicate [Optional] Whether remove duplication if your bib has both of arXiv or Conference citation.
PLEASE ATTENTION: If True, it might cost more time to write simplified bib file. Please keep patient.
-cch --cache_num [Optional]The number of bib items you want to simplify at once.
PLEASE ATTENTION: If you want to simplify a huge bib file, you'd better change it to achieve satisfactory speed.
-keep --keep_keys [Optional]The keys you want to keep in every bib item.
The total form is like -keep "pages,doi". NOTE: if raise unrecognized arguments error, it might be better to use --keep_keys

Example Input and Output

An example simplified output entry with the official information (The forms of bibitem like xxx="..." or xxx={...} are both supported):

@inproceedings{li-etal-2019-survey,
    title = "A Sophisticated Survey about Chinese Poem and Beers",
    author = "Li, Bai  and
     Ha, Pi  and
     Jin, Shibai  and
     Xue, Hua  and
     Mao, Tai",
    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)",
    month = nov,
    year = "2019",
    address = "Hong Kong, China",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/D19-1214",
    doi = "10.18653/v1/D19-1214",
    pages = "2078--2087",
    abstract = "Intent detection and slot filling are two main tasks for building a spoken language understanding (SLU) system. The two tasks are closely tied and the slots often highly depend on the intent. In this paper, we propose a novel framework for SLU to better incorporate the intent information, which further guiding the slot filling. In our framework, we adopt a joint model with Stack-Propagation which can directly use the intent information as input for slot filling, thus to capture the intent semantic knowledge. In addition, to further alleviate the error propagation, we perform the token-level intent detection for the Stack-Propagation framework. Experiments on two publicly datasets show that our model achieves the state-of-the-art performance and outperforms other previous methods by a large margin. Finally, we use the Bidirectional Encoder Representation from Transformer (BERT) model in our framework, which further boost our performance in SLU task.",
}

An example simplified output entry from the official information:

@inproceedings{li-etal-2019-survey,
    author = {Li, Bai  and
     Ha, Pi  and
     Jin, Shibai  and
     Xue, Hua  and
     Mao, Tai},
    booktitle = {Proc. of EMNLP},
    title = {A Sophisticated Survey about Chinese Poem and Beers},
    year = {2019}
}

Supported Conferences

The config dir contains a list of converted json files of the mapper between official full name and simplified name.

AI

Full Name Name
Association for the Advance of Artificial Intelligence AAAI
International Joint Conference on Autonomous Agents and Multiagent Systems AAMAS
ACM International Conference on Multimedia ACM MM
Artificial Intelligence and Statistics AISTATS
International Conference on Algorithmic Learning Theory ALT
IEEE Congress on Evolutionary Computation CEC
European Conference on Artificial Intelligence ECAI
IEEE International Conference on Fuzzy Systems FUZZ IEEE
Genetic and Evolutionary Computation Conference GECCO
International Conference on Artificial Neural Networks ICANN
International Conference on Automated Planning and Scheduling ICAPS
International Conference on Case-Based Reasoning and Development ICCBR
International Conference on Neural Information Processing ICONIP
International Conference on Robotics and Automation ICRA
International Conference on Tools with Artificial Intelligence ICTAI
International Joint Conference on Artificial Intelligence IJCAI
International Joint Conference on Artificial Intelligence (Survey Track) IJCAI(Survey Track)
International Joint Conference on Neural Networks IJCNN
International Conference on Intelligent Robots and Systems IROS
International Conference on Principles of Knowledge Representation and Reasoning KR
International conference on Knowledge Science, Engineering and Management KSEM
ACM SIGGRAPH Annual Conference SIGGRAPH
ACM Symposium on Theory of Computing STOC
International Conference on Uncertainty in Artificial Intelligence UAI
Parallel Problem Solving from Nature PPSN
Pacific Rim International Conference on Artificial Intelligence PRICAI
International Conference on Technologies and Applications of Artificial Intelligence TAAI

CV

Full Name Name
International Conference on 3D Vision 3DV
Asian Conference on Computer Vision ACCV
ACM International Conference on Multimedia ACM MM
British machine vision conference BMVC
International Conference on Computer Vision and Pattern Recogintion CVPR
European Conference on Computer Vision ECCV
International Conference on Computer Vision ICCV
International Conference on Document Analysis and Recognition ICDAR
IEEE International Conference on Image Processing ICIP
International conference on multimedia and expo ICME
International Conference on Pattern Recognition ICPR
IEEE visualization conference IEEE VIS
International Conference on Medical Image Computing and Computer Assisted Intervention Society MICCAI
ACM SIGGRAPH Annual Conference SIGGRAPH
IEEE Winter Conference on Applications of Computer Vision WACV

DM

Full Name Name
Automated Knowledge Base Construction AKBC
Asia Pacific Web Conference APWeb
International Conference on Information and Knowledge Management CIKM
Database Systems for Advanced Applications DASFAA
The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases ECML-PKDD
IEEE International Conference on Data Engineering ICDE
IEEE International Conference on Data Mining ICDM
International Conference on Database Theory ICDT
ACM SIGKDD Conference on Knowledge Discovery and Data Mining KDD
Language Resources and Evaluation Conference LREC
International Conference on Mobile Data Management MDM
Pacific-Asia Conference on Knowledge Discovery and Data Mining PAKDD
ACM Symposium on Principles of Database Systems PODS
The ACM Conference Series on Recommender Systems RecSys
SIAM International Conference on Data Mining SDM
ACM SIGMOD international conference on Management of data SIGMOD
International Conference on Very Large Data Base VLDB
ACM International Conference on Web Search and Data Mining WSDM
The Web Conference WWW
International Conference on Extending DB Technology EDBT
International Conference on Innovative Data Systems Research CIDR

IR

Full Name Name
European Conference on IR Research ECIR
Extended Semantic Web Conference ESWC
ACM International Conference on Multimedia Retrieval ICMR
The ACM SIGIR International Conference on the Theory of Information Retrieval ICTIR
International Semantic Web Conference ISWC
International Conference on Research on Development in Information Retrieval SIGIR

ML

Full Name Name
Asian Conference on Machine Learning ACML
International Conference on Artificial Intelligence and Statistics AISTATS
European Conference on Machine Learning ECML
International Conference on Learning Representations ICLR
International Conference on Machine Learning ICML
Machine Learning for Health ML4H
Neural Information Processing Systems NeurIPS
Conference on Uncertainty in Artificial Intelligence UAI

NLP

Full Name Name
Asian Chapter of the Association for Computational Linguistics AACL
Association for Computational Linguistics ACL
Chinese Computational Linguistics CCL
International Conference on Computational Linguistics COLING
Annual Conference on Computational Learning Theory COLT
Conference on Computational Natural Language Learning CoNLL
European Chapter of the Association for Computational Linguistics EACL
Empirical Methods in Natural Language Processing EMNLP
International Conference on Acoustics, Speech and Signal Processing ICASSP
International Conference on Document Analysis and Recognition ICDAR
International Conference on Neural Information Processing ICONIP
Conference of the International Speech Communication Association INTERSPEECH
Language Resources and Evaluation Conference LREC
North American Chapter of the Association for Computational Linguistics NAACL
Natural Language Processing and Chinese Computing NLPCC
Workshop on Representation Learning for NLP RepL4NLP
SIGdial Meeting on Discourse and Dialogue SIGDIAL
International Workshop on Semantic Evaluation SemEval
Workshop on Arabic natural language processing WANLP
Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis WASSA
Workshop on Online Abuse and Harms WOAH

Arch

Full Name Name
International Conference on Architectural Support for Programming Languages and Operating Systems ASPLOS
USENIX Annul Technical Conference ATC
Design, Automation & Test in Europe DATE
European Conference on Computer Systems EuroSys
Conference on File and Storage Technologies FAST
High Performance Computer Architecture HPCA
International Symposium on Computer Architecture ISCA
IEEE/ACM International Symposium on Microarchitecture MICRO
ACM SIGPLAN Symposium on Principles & Practice of Parallel Programming PPoPP
International Conference for High Performance Computing, Networking, Storage, and Analysis SC
ACM Symposium on Cloud Computing SoCC

System

Full Name Name
ACM SIGSOFT Symposium on the Foundation of Software Engineering/ European Software Engineering Conference FSE/ESEC
International Conference on Software Engineering ICSE
International Symposium on Software Testing and Analysis ISSTA
USENIX Symposium on Operating Systems Design and Implementations OSDI
ACM Symposium on Operating Systems Principles SOSP

Security

Full Name Name
Annual Computer Security Applications Conference ACSA
ACM Asia Conference on Computer and Communications Security AsiaCCS
ACM Conference on Computer and Communications Security CCS
Dependable Systems and Networks DSN
European Symposium on Research in Computer Security ESORICS
European Symposium on Security and Privacy EuroS&P
International Conference on Information and Communication Security ICICS
Network and Distributed System Security Symposium NDSS
International Symposium on Recent Advances in Intrusion Detection RAID
IEEE Symposium on Security and Privacy SP
Usenix Security Symposium USENIX Security

Adding a new conference

You can manually add any conferences from DBLP to config map.

Take ICLR as an example:

  • Step 1: Go to DBLP
  • Step 2: Find the full name of Conference
  • Step 3: Add map to config/ML.json or parserConfig.json(You should specify the config path)
{"International Conference on Learning Representations": "ICLR"}

Contact

Please email Libo Qin or Qiguang Chen to create Github issues here if you have any questions or suggestions.

And we welcome you to join us and update conferences at https://docs.qq.com/sheet/DWFF1aWlVV1hISU12?tab=h2idmj

Organizers

Contributors

Thanks to the contributors:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

simbiber-0.8.1.tar.gz (21.4 kB view details)

Uploaded Source

Built Distribution

simbiber-0.8.1-py3-none-any.whl (19.2 kB view details)

Uploaded Python 3

File details

Details for the file simbiber-0.8.1.tar.gz.

File metadata

  • Download URL: simbiber-0.8.1.tar.gz
  • Upload date:
  • Size: 21.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.15

File hashes

Hashes for simbiber-0.8.1.tar.gz
Algorithm Hash digest
SHA256 169580814c853906cef8caaa48a625abf2bc5af6098a8b7ed5935ad8520fb50d
MD5 c00ea5242ce6f355b77fdf7591c53dcb
BLAKE2b-256 fdef6bd0c8451c1d5c0ba0d66326b60b4f8047a38147fb62a0b820a1f2814464

See more details on using hashes here.

File details

Details for the file simbiber-0.8.1-py3-none-any.whl.

File metadata

  • Download URL: simbiber-0.8.1-py3-none-any.whl
  • Upload date:
  • Size: 19.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.15

File hashes

Hashes for simbiber-0.8.1-py3-none-any.whl
Algorithm Hash digest
SHA256 32e968ae5b1b1cbf5a70e28d23c4808053b5910f5043e7d15a8ef387d1df2d34
MD5 a38c75e302dfb06bc0d9a85036b1e313
BLAKE2b-256 af85dacae673b8a3ea3c2466cfddd9166b56fe45d21aac39f19df15a8c5b702d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page