Skip to main content

Easy-2-use long text classifier trainers.

Project description

DeepLoTX: Easy2UseLongTextClassifierTrainers

Installation

  • Install with pip

    pip install git+https://github.com/vortezwohl/DeepLoTX.git
    
  • Install with uv

    uv add git+https://github.com/vortezwohl/DeepLoTX.git
    

Quick Start

To train a binary classifier for text files:

from deeplotx.util import get_files, read_file
from deeplotx import TextBinaryClassifierTrainer, LongTextEncoder

long_text_encoder = LongTextEncoder(
  max_length=2048,
  chunk_size=512,
  overlapping=128
)

trainer = TextBinaryClassifierTrainer(
  long_text_encoder=long_text_encoder,
  batch_size=4,
  train_ratio=0.9
)

pos_data_path = './data/pos'
neg_data_path = './data/neg'
pos_data = [read_file(x) for x in get_files(pos_data_path)]
neg_data = [read_file(x) for x in get_files(neg_data_path)]
model = trainer.train(pos_data, neg_data, num_epochs=20, learning_rate=2e-5, train_loss_threshold=1)
model.save()

model = model.load()
model.predict(long_text_encoder.encode('这是一个测试文本.').squeeze())

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deeplotx-0.2.18.tar.gz (19.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

deeplotx-0.2.18-py3-none-any.whl (22.7 kB view details)

Uploaded Python 3

File details

Details for the file deeplotx-0.2.18.tar.gz.

File metadata

  • Download URL: deeplotx-0.2.18.tar.gz
  • Upload date:
  • Size: 19.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.6.14

File hashes

Hashes for deeplotx-0.2.18.tar.gz
Algorithm Hash digest
SHA256 7b2f92293820a955595bee1d406420c9a175a7e0192a169896c855f963881bd0
MD5 7202655d1d26126b18e409e9997faf32
BLAKE2b-256 61d60607d79490b75e2d5d57da99b3b10de8a226dce358d2bfb46eb55267bb23

See more details on using hashes here.

File details

Details for the file deeplotx-0.2.18-py3-none-any.whl.

File metadata

  • Download URL: deeplotx-0.2.18-py3-none-any.whl
  • Upload date:
  • Size: 22.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.6.14

File hashes

Hashes for deeplotx-0.2.18-py3-none-any.whl
Algorithm Hash digest
SHA256 a51abe3023e0a92bc57549c2a344899a2d2f3bed93b6617da624f0540a3487ee
MD5 57ffa14a6652b9abbd9d6772c7c23585
BLAKE2b-256 84f2427b524fcb7e7f03276bde07d78e7f725a9e36784a0ccae381fc65c33bbe

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page