Utilities for training and working with nlp models in pytorch
Project description
xt-nlp
Description
This repo contains common NLP pre/post processing functions, loss functions, metrics, and helper functions.
Installation
From PyPI:
pip install xt-nlp
From source:
git clone https://github.com/XtractTech/xt-nlp.git
pip install ./xt-nlp
Usage
See specific help on a class or function using help
. E.g., help(SESLoss)
.
Defining SES Metrics and Loss
from xt_nlp.metrics import SESF1
from xt_nlp.metrics import SESLoss
eval_metrics = {
'f1': SESF1(threshold=0.8)
}
loss_fn = SESLoss()
Read BRAT annotations for sequence extraction into data loader
from xt_nlp.utils import get_brat_examples, split_examples, get_features, build_ses_dataloader
# tokenizer =
# max_sequence_length =
# doc_stride =
# class_dict = Dictionary mapping classname ==> list of classes to group into this class
# classes =
# batch_size =
# workers =
examples = get_brat_examples(
datadir='./data/datadir',
classes=classes
)
train_examples, val_examples = split_examples(examples, train_prop=.9, seed=4000)
train_features = get_features(
examples=train_examples,
tokenizer=tokenizer,
all_ans_types=classes,
max_seq_len=max_sequence_length,
doc_stride=doc_stride,
mode='train'
)
train_loader = build_ses_dataloader(
train_features,
classes,
class_dict,
batch_size=batch_size,
workers=workers,
max_seq_length=max_sequence_length,
shuffle=True
)
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
xt-nlp-0.2.5.tar.gz
(18.8 kB
view details)
Built Distribution
xt_nlp-0.2.5-py3-none-any.whl
(21.2 kB
view details)
File details
Details for the file xt-nlp-0.2.5.tar.gz
.
File metadata
- Download URL: xt-nlp-0.2.5.tar.gz
- Upload date:
- Size: 18.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/44.0.0.post20200106 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.7.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b877db5f1db3d1be82c4a8eb47e573920f5cfe71dea4808bc807925d45671070 |
|
MD5 | ce8fca79074596186c810fe496d36991 |
|
BLAKE2b-256 | c236a88ab953d78fb719707db1032d520343f9a7fcb225184ec8be7c32135c9c |
File details
Details for the file xt_nlp-0.2.5-py3-none-any.whl
.
File metadata
- Download URL: xt_nlp-0.2.5-py3-none-any.whl
- Upload date:
- Size: 21.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/44.0.0.post20200106 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.7.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 88be43de64a56cf0d3cf24a2b58e535ffb46e3e61fa7b5b37ab6b78c8bfb83b8 |
|
MD5 | fe5b3f55046ae2dc29a3a90ed788d29a |
|
BLAKE2b-256 | 63082ea757d776c77fb8d99370137bf03b6e88df768e4d2850e3aea2c396eb58 |