This tool provides the state-of-the-art models for aspect term extraction (ATE), aspect polarity classification (APC), and text classification.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

PyABSA - Open Framework for Aspect-based Sentiment Analysis

PyPI - Python Version License

PyABSA is a personal project which received many contributions from all the contributors. Please feel free to help make it developing, with regards for all the people who contribute to PyABSA. I am glad if PyABSA helps you, please star this repo as Each Star helps PyABSA go further, many thanks.

Annotate Your Own Dataset

The repo ABSADatasets provides an open-source dataset annotating tool, you can easily annotate your dataset before using PyABSA.

Fit on Your Existing Dataset

First, refer to ABSADatasets to prepare your dataset into acceptable format.
You can PR to contribute your dataset and use it like ABDADatasets.your_dataset (All the datasets are for research only, shall not danger your data copyright)

Training based on Existing Checkpoints

Have no enough data to train your model, here are what you can do:

Combine multiple datasets with your dataset to train your model
Resume training from shared checkpoints, see train_based_on_checkpoint.py, train_atepc_based_on_checkpoint.py

Learn to Use FindFile

PyABSA uses FindFile to locate the target file(s) so you can specify a dataset/checkpoint path by keywords instead of using absolute path. e.g.,

dataset = './laptop' # relative path
dataset = 'ABSOLUTE_PATH/laptop/' # absolute path
dataset = 'laptop' # dataset name, char-case un-sensitive
dataset = 'lapto' # search any path containing the 'lapto' or 'aptop' string

checkpoint = 'lcfs' # checkpoint path assignment is similar to above methods

Learn to Use AutoCuda

Auto select the free cuda for training & inference PyABSA use the AutoCUDA to support automatic cuda assignment, but you can still set a preferred device.

auto_device = True  # to auto assign a cuda device for training / inference
auto_device = False  # to use cpu
auto_device = 'cuda:1'  # to specify a preferred device
auto_device = 'cpu'  # to specify a preferred device
auto_device = 'allcuda'  # use all cuda to train

Use Human-readable Labels in Your Dataset

PyABSA encourages you to use string labels instead of numbers. e.g., sentiment labels = {negative, positive, Neutral, unknown}

What labels you use in the dataset, what labels will be output in inference
You can train a model using multiple datasets with same sentiment labels, and you can even contribute and define a combination of datasets here!
The version information of PyABSA is also available in the output while loading checkpoints training args.

For Syntax-Parsing Models

The default SpaCy english model is en_core_web_sm, if you didn't install it, PyABSA will download/install it automatically.

If you would like to change english model (or other pre-defined options), you can get/set as following:

from pyabsa.functional.config.apc_config_manager import APCConfigManager
from pyabsa.functional.config.atepc_config_manager import ATEPCConfigManager
from pyabsa.functional.config.classification_config_manager import ClassificationConfigManager

# Set
APCConfigManager.set_apc_config_english({'spacy_model': 'en_core_web_lg'})
ATEPCConfigManager.set_atepc_config_english({'spacy_model': 'en_core_web_lg'})
ClassificationConfigManager.set_classification_config_english({'spacy_model': 'en_core_web_lg'})

# Get
APCConfigManager.get_apc_config_english()
ATEPCConfigManager.get_atepc_config_english()
ClassificationConfigManager.get_classification_config_english()

# Manually Set spaCy nlp Language object
from pyabsa.core.apc.dataset_utils.apc_utils import configure_spacy_model

nlp = configure_spacy_model(APCConfigManager.get_apc_config_english())

Package Overview

pyabsa	package root (including all interfaces)
pyabsa.functional	recommend interface entry
pyabsa.functional.checkpoint	checkpoint manager entry, inference model entry
pyabsa.functional.dataset	datasets entry
pyabsa.functional.config	predefined config manager
pyabsa.functional.trainer	training module, every trainer return a inference model

Installation

Please do not install the version without corresponding release note to avoid installing a test version.

install via pip

To use PyABSA, install the latest version from pip or source code:

pip install -U pyabsa

install via source

git clone https://github.com/yangheng95/PyABSA --depth=1
cd PyABSA 
python setup.py install

Quick Start

Create a new python environment (Recommended) and install latest pyabsa
Find a suitable demo script (ATEPC , APC , Text Classification) to prepare your training script. (Welcome to share your demo script)
Format or Annotate your dataset referring to ABSADatasets or use public dataset in ABSADatasets
Init your config to specify Model, Dataset, hyper-parameters
Training your model and get checkpoints
Share your checkpoint and dataset

Learning to Use Checkpoint

Get available checkpoints from Google Drive

PyABSA will check the latest available checkpoints before and load the latest checkpoint from Google Drive. To view available checkpoints, you can use the following code and load the checkpoint by name:

from pyabsa import available_checkpoints

checkpoint_map = available_checkpoints()  # show available checkpoints of PyABSA of current version

If you can not access to Google Drive, you can download our checkpoints and load the unzipped checkpoint manually. 如果您无法访问谷歌Drive，您可以从此处 (提取码：ABSA) 下载我们预训练的模型，并加载模型（本仓库为个人业余项目，没有精力再维护百度云，如果您可以帮助管理国内checkpoint的保存和下载请联系我）。

How to use our pretrained checkpoints on your dataset

How to share checkpoints (e.g., checkpoints trained on your custom dataset) with community

Datasets

More datasets are available at ABSADatasets.

Twitter
Laptop14
Restaurant14
Restaurant15
Restaurant16
Phone
Car
Camera
Notebook
MAMS
TShirt
Television
MOOC
Shampoo
Multilingual (The sum of all datasets.)

You don't have to download the datasets, as the datasets will be downloaded automatically.

Model Support

Except for the following models, we provide a template model involving LCF vec, you can develop your model based on the LCF-APC model template or LCF-ATEPC model template.

ATEPC

APC

Bert-based APC models

SLIDE-LCF-BERT (Faster & Performs Better than LCF/LCFS-BERT)
SLIDE-LCFS-BERT (Faster & Performs Better than LCF/LCFS-BERT)
LCF-BERT (Reimplemented & Enhanced)
LCFS-BERT (Reimplemented & Enhanced)
FAST-LCF-BERT (Faster with slightly performance loss)
FAST_LCFS-BERT (Faster with slightly performance loss)
LCF-DUAL-BERT (Dual BERT)
LCFS-DUAL-BERT (Dual BERT)
BERT-BASE
BERT-SPC
LCA-Net
DLCF-DCA-BERT *

Bert-based APC baseline models

GloVe-based APC baseline models

Contribution

We expect that you can help us improve this project, and your contributions are welcome. You can make a contribution in many ways, including:

Share your custom dataset in PyABSA and ABSADatasets
Integrates your models in PyABSA. (You can share your models whether it is or not based on PyABSA. if you are interested, we will help you)
Raise a bug report while you use PyABSA or review the code (PyABSA is a individual project driven by enthusiasm so your help is needed)
Give us some advice about feature design/refactor (You can advise to improve some feature)
Correct/Rewrite some error-messages or code comment (The comments are not written by native english speaker, you can help us improve documents)
Create an example script in a particular situation (Such as specify a SpaCy model, pretrained-bert type, some hyperparameters)
Star this repository to keep it active

Notice

The LCF is a simple and adoptive mechanism proposed for ABSA. Many models based on LCF has been proposed and achieved SOTA performance. Developing your models based on LCF will significantly improve your ABSA models. If you are looking for the original proposal of local context focus, please redirect to the introduction of LCF. If you are looking for the original codes of the LCF-related papers, please redirect to LC-ABSA / LCF-ABSA or LCF-ATEPC.

Acknowledgement

This work build from LC-ABSA/LCF-ABSA and LCF-ATEPC, and other impressive works such as PyTorch-ABSA and LCFS-BERT.

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

2.4.1.post1

Feb 28, 2024

2.4.1

Feb 23, 2024

2.4.0

Dec 27, 2023

2.3.4

Oct 8, 2023

2.3.4rc1 pre-release

Nov 7, 2023

2.3.4rc0 pre-release

Oct 8, 2023

2.3.3

Aug 11, 2023

2.3.2.2

Aug 9, 2023

2.3.2.1

Aug 7, 2023

2.3.2

Aug 5, 2023

2.3.1

Apr 15, 2023

2.3.1b0 pre-release

Apr 15, 2023

2.3.0

Apr 14, 2023

2.2.3

Apr 14, 2023

2.2.2

Apr 5, 2023

2.2.1

Apr 4, 2023

2.2.0

Mar 22, 2023

2.1.12

Mar 18, 2023

2.1.11

Mar 16, 2023

2.1.10.1

Mar 16, 2023

2.1.10

Mar 16, 2023

2.1.9

Mar 15, 2023

2.1.8.1

Mar 14, 2023

2.1.8

Mar 14, 2023

2.1.7.1

Mar 13, 2023

2.1.6

Mar 10, 2023

2.1.5

Mar 8, 2023

2.1.4

Mar 8, 2023

2.1.3

Mar 8, 2023

2.1.2

Mar 8, 2023

2.1.2a0 pre-release

Mar 8, 2023

2.1.1

Mar 6, 2023

2.1.0

Mar 6, 2023

2.0.28

Feb 13, 2023

2.0.28a2 pre-release

Feb 5, 2023

2.0.27

Feb 1, 2023

2.0.27a2 pre-release

Jan 31, 2023

2.0.27a1 pre-release

Jan 31, 2023

2.0.27a0 pre-release

Jan 30, 2023

2.0.26

Jan 28, 2023

2.0.25.2

Jan 27, 2023

2.0.25.1

Jan 27, 2023

2.0.25

Jan 27, 2023

2.0.24.1

Jan 24, 2023

2.0.24

Jan 24, 2023

2.0.23

Jan 3, 2023

2.0.22

Dec 27, 2022

2.0.20

Dec 25, 2022

2.0.19

Dec 15, 2022

2.0.18

Dec 13, 2022

2.0.17

Dec 6, 2022

2.0.16

Dec 4, 2022

2.0.15

Dec 2, 2022

2.0.14

Dec 2, 2022

2.0.13 yanked

Dec 2, 2022

2.0.12b0 pre-release

Dec 2, 2022

2.0.11

Nov 23, 2022

2.0.10

Nov 21, 2022

2.0.9a0 pre-release

Nov 21, 2022

2.0.8

Nov 21, 2022

2.0.7

Nov 19, 2022

2.0.6

Nov 19, 2022

2.0.5

Nov 18, 2022

2.0.4

Nov 17, 2022

2.0.3

Nov 17, 2022

2.0.2

Nov 8, 2022

2.0.0

Nov 7, 2022

1.16.28

Sep 15, 2023

1.16.27

Nov 7, 2022

1.16.25

Oct 31, 2022

1.16.24

Oct 28, 2022

1.16.23

Oct 27, 2022

1.16.22

Oct 26, 2022

1.16.21

Oct 24, 2022

1.16.20

Oct 24, 2022

1.16.19a0 pre-release

Oct 12, 2022

1.16.18

Sep 30, 2022

1.16.17

Sep 22, 2022

1.16.16

Sep 9, 2022

1.16.15

Sep 6, 2022

1.16.14

Aug 20, 2022

1.16.13

Aug 20, 2022

1.16.12

Aug 20, 2022

1.16.11

Aug 17, 2022

1.16.10

Aug 16, 2022

1.16.8

Aug 9, 2022

1.16.7

Aug 8, 2022

1.16.6.1

Aug 4, 2022

1.16.6

Aug 4, 2022

1.16.5

Jul 15, 2022

1.16.4

Jul 9, 2022

1.16.3

Jul 8, 2022

1.16.2

Jul 7, 2022

1.16.1

Jul 7, 2022

1.16.0

Jul 6, 2022

1.15.7

Jul 4, 2022

1.15.6

Jun 30, 2022

1.15.6a0 pre-release

Jun 29, 2022

1.15.5

Jun 23, 2022

1.15.4

Jun 22, 2022

1.15.3

Jun 20, 2022

1.15.0

Jun 16, 2022

1.14.8

May 29, 2022

1.14.7

May 20, 2022

1.14.6

May 19, 2022

1.14.5.post1

May 17, 2022

1.14.5

May 17, 2022

1.14.4

May 13, 2022

1.14.3

May 6, 2022

1.14.2

May 3, 2022

1.14.1

May 3, 2022

1.14.0

May 1, 2022

1.13.3

May 1, 2022

1.13.2

Apr 30, 2022

1.13.1

Apr 30, 2022

1.13.0

Apr 30, 2022

1.11.0

Apr 28, 2022

1.10.6

Apr 27, 2022

1.10.5

Apr 26, 2022

1.10.4

Apr 21, 2022

1.10.3

Apr 16, 2022

1.10.2

Apr 14, 2022

1.9.6

Apr 9, 2022

1.9.5

Apr 5, 2022

1.9.4

Apr 4, 2022

1.9.3

Mar 31, 2022

1.9.2

Mar 29, 2022

1.9.1

Mar 28, 2022

1.9.0

Mar 26, 2022

1.8.43

Mar 25, 2022

1.8.42

Mar 25, 2022

1.8.41

Mar 24, 2022

1.8.40

Mar 22, 2022

1.8.39

Mar 18, 2022

1.8.38

Mar 18, 2022

1.8.37

Mar 13, 2022

1.8.36

Mar 12, 2022

1.8.35

Mar 10, 2022

1.8.34

Mar 10, 2022

1.8.33

Mar 10, 2022

1.8.32

Mar 9, 2022

1.8.30

Mar 4, 2022

1.8.29

Feb 23, 2022

1.8.28

Feb 20, 2022

1.8.26

Feb 18, 2022

1.8.25

Feb 18, 2022

1.8.24

Feb 14, 2022

1.8.23

Feb 14, 2022

1.8.22

Feb 14, 2022

1.8.21

Feb 14, 2022

1.8.20

Feb 6, 2022

1.8.18

Feb 5, 2022

1.8.15

Jan 26, 2022

1.8.14

Jan 24, 2022

1.8.13

Jan 22, 2022

This version

1.8.12

Jan 19, 2022

1.8.11

Jan 19, 2022

1.8.10

Jan 16, 2022

1.8.9

Jan 15, 2022

1.8.8 yanked

Jan 14, 2022

Reason this release was yanked:

bug found

1.8.5

Jan 7, 2022

1.8.4

Jan 5, 2022

1.8.2

Dec 31, 2021

1.8.1

Dec 31, 2021

1.6.17

Dec 29, 2021

1.6.16

Dec 28, 2021

1.6.15

Dec 28, 2021

1.6.14.1

Dec 31, 2021

1.6.14

Dec 25, 2021

1.6.12

Dec 17, 2021

1.6.11

Dec 16, 2021

1.6.10

Dec 14, 2021

1.6.8

Dec 7, 2021

1.6.7

Dec 6, 2021

1.6.6

Dec 6, 2021

1.6.5

Dec 6, 2021

1.6.4

Dec 6, 2021

1.6.3

Dec 6, 2021

1.6.2

Dec 6, 2021

1.6.1

Dec 5, 2021

1.6

Dec 5, 2021

1.5.4

Dec 4, 2021

1.5.3

Dec 3, 2021

1.5.2

Dec 3, 2021

1.5.1

Dec 3, 2021

1.5

Dec 3, 2021

1.3.15

Dec 1, 2021

1.3.14

Nov 30, 2021

1.3.13

Nov 30, 2021

1.3.12

Nov 29, 2021

1.3.11

Nov 28, 2021

1.3.5

Nov 20, 2021

1.2.13

Nov 11, 2021

1.2.12

Oct 29, 2021

1.2.10

Oct 24, 2021

1.2.9

Oct 22, 2021

1.2.8

Oct 11, 2021

1.2.7

Oct 10, 2021

1.2.6

Oct 3, 2021

1.2.5

Oct 3, 2021

1.2.3

Oct 2, 2021

1.2.2

Oct 2, 2021

1.2.0

Sep 30, 2021

1.1.24

Sep 30, 2021

1.1.23

Sep 29, 2021

1.1.22

Sep 27, 2021

1.1.20

Sep 26, 2021

1.1.19

Sep 23, 2021

1.1.18

Sep 23, 2021

1.1.17

Sep 21, 2021

1.1.16

Sep 18, 2021

1.1.14

Sep 15, 2021

1.1.13

Sep 9, 2021

1.1.12

Sep 3, 2021

1.1.9

Aug 25, 2021

1.1.8

Aug 25, 2021

1.1.7a1 pre-release

Aug 24, 2021

1.1.3

Aug 21, 2021

1.1.1

Aug 19, 2021

0.9.9a0 pre-release

Apr 7, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

pyabsa-1.8.12-py3-none-any.whl (218.8 kB view hashes)

Uploaded Jan 19, 2022 Python 3

Hashes for pyabsa-1.8.12-py3-none-any.whl

Hashes for pyabsa-1.8.12-py3-none-any.whl
Algorithm	Hash digest
SHA256	`cba9036b34b96c92ef24e5c7a6c814302c0f3d9d91d62b50a2365dcf31125a80`
MD5	`ed05692fe100b2afa6c5bb4c013a99b2`
BLAKE2b-256	`36d1560e7946e6c1a23d8523bb49c46951fea70049c6c1a1191ff4ba1dbd4890`

pyabsa 1.8.12

Navigation

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Project description

PyABSA - Open Framework for Aspect-based Sentiment Analysis

Annotate Your Own Dataset

Fit on Your Existing Dataset

Training based on Existing Checkpoints

Learn to Use FindFile

Learn to Use AutoCuda

Use Human-readable Labels in Your Dataset

For Syntax-Parsing Models

Package Overview

Installation

install via pip

install via source

Quick Start

Learning to Use Checkpoint

Get available checkpoints from Google Drive

How to use our pretrained checkpoints on your dataset

How to share checkpoints (e.g., checkpoints trained on your custom dataset) with community

Datasets

Model Support

ATEPC

APC

Bert-based APC models

Bert-based APC baseline models

GloVe-based APC baseline models

Contribution

Notice

Acknowledgement

Project details

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution