Skip to main content

Easy-2-use long text classifier trainers.

Project description

DeepLoTX: Easy2UseLongTextClassifierTrainers

Installation

  • Install with pip

    pip install git+https://github.com/vortezwohl/DeepLoTX.git
    
  • Install with uv

    uv add git+https://github.com/vortezwohl/DeepLoTX.git
    

Quick Start

To train a binary classifier for text files:

from deeplotx.util import get_files, read_file
from deeplotx import TextBinaryClassifierTrainer, LongTextEncoder

long_text_encoder = LongTextEncoder(
  max_length=2048,
  chunk_size=512,
  overlapping=128
)

trainer = TextBinaryClassifierTrainer(
  long_text_encoder=long_text_encoder,
  batch_size=4,
  train_ratio=0.9
)

pos_data_path = './data/pos'
neg_data_path = './data/neg'
pos_data = [read_file(x) for x in get_files(pos_data_path)]
neg_data = [read_file(x) for x in get_files(neg_data_path)]
model = trainer.train(pos_data, neg_data, num_epochs=20, learning_rate=2e-5, train_loss_threshold=1)
model.save()

model = model.load()
model.predict(long_text_encoder.encode('这是一个测试文本.').squeeze())

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deeplotx-0.2.17.tar.gz (19.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

deeplotx-0.2.17-py3-none-any.whl (22.7 kB view details)

Uploaded Python 3

File details

Details for the file deeplotx-0.2.17.tar.gz.

File metadata

  • Download URL: deeplotx-0.2.17.tar.gz
  • Upload date:
  • Size: 19.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.6.14

File hashes

Hashes for deeplotx-0.2.17.tar.gz
Algorithm Hash digest
SHA256 c613d2095a464725c415ed2a474189746bbe006f75b171edf17b5bcfc8d71430
MD5 594fdb667420233d7f91097be63a8774
BLAKE2b-256 b03a22d327205a757a319b84154cea5ae697209e1ee181c454b4e62cd717bec4

See more details on using hashes here.

File details

Details for the file deeplotx-0.2.17-py3-none-any.whl.

File metadata

  • Download URL: deeplotx-0.2.17-py3-none-any.whl
  • Upload date:
  • Size: 22.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.6.14

File hashes

Hashes for deeplotx-0.2.17-py3-none-any.whl
Algorithm Hash digest
SHA256 1103bbd4ae7467f042975db33a90d450c2d9052d7184960313d3d281077d875b
MD5 136d759b4c5de7b58434afec9f6bb511
BLAKE2b-256 6ab4ffd8501223122690a8c485f9b87112a622963e06fd26c52e891e889fd7e5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page