Skip to main content

Easy-2-use long text classifier trainers.

Project description

DeepLoTX: Easy2UseLongTextClassifierTrainers

Installation

  • Install with pip

    pip install -U deeplotx
    
  • Install with uv

    uv add -U deeplotx
    
  • Install from github

    pip install -U git+https://github.com/vortezwohl/DeepLoTX.git
    

Quick Start

To train a binary classifier for text files:

from deeplotx.util import get_files, read_file
from deeplotx import TextBinaryClassifierTrainer, LongTextEncoder

long_text_encoder = LongTextEncoder(
  max_length=2048,
  chunk_size=512,
  overlapping=128
)

trainer = TextBinaryClassifierTrainer(
  long_text_encoder=long_text_encoder,
  batch_size=4,
  train_ratio=0.9
)

pos_data_path = './data/pos'
neg_data_path = './data/neg'
pos_data = [read_file(x) for x in get_files(pos_data_path)]
neg_data = [read_file(x) for x in get_files(neg_data_path)]
model = trainer.train(pos_data, neg_data, num_epochs=20, learning_rate=2e-5, train_loss_threshold=1)
model.save()

model = model.load()
model.predict(long_text_encoder.encode('这是一个测试文本.').squeeze())

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deeplotx-0.4.1.tar.gz (21.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

deeplotx-0.4.1-py3-none-any.whl (26.4 kB view details)

Uploaded Python 3

File details

Details for the file deeplotx-0.4.1.tar.gz.

File metadata

  • Download URL: deeplotx-0.4.1.tar.gz
  • Upload date:
  • Size: 21.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.6.9

File hashes

Hashes for deeplotx-0.4.1.tar.gz
Algorithm Hash digest
SHA256 b28570c9f1cb578f674eb6eda098397544b8c882f21c11db4b40d7f0b6d9b399
MD5 51b1045c38ada2699c3ca00f3bdcb552
BLAKE2b-256 fe8d2d8244381195b06805b4056bd7c727a28d7f83c4dad72ccb5378ecdfb690

See more details on using hashes here.

File details

Details for the file deeplotx-0.4.1-py3-none-any.whl.

File metadata

  • Download URL: deeplotx-0.4.1-py3-none-any.whl
  • Upload date:
  • Size: 26.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.6.9

File hashes

Hashes for deeplotx-0.4.1-py3-none-any.whl
Algorithm Hash digest
SHA256 f93a6bbe5a16fec344791f92cb4d620408654d39f5fc2bd224f1c204ca900732
MD5 62d99dd414be4283544fdd828f943b02
BLAKE2b-256 c521eded7676add2bf82e5333d3a945fb67ded5fb974ad968cb30861d0f0a356

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page