Skip to main content

Easy-2-use long text NLP toolkit.

Project description

Ask DeepWiki

DeepLoTX

An Easy-2-use long text NLP toolkit

Installation

  • Install with pip

    pip install -U deeplotx
    
  • Install with uv

    uv add -U deeplotx
    
  • Install from github

    pip install -U git+https://github.com/vortezwohl/DeepLoTX.git
    

Quick Start

To train a binary classifier from text files:

from deeplotx.util import get_files, read_file
from deeplotx import TextBinaryClassifierTrainer, LongTextEncoder

long_text_encoder = LongTextEncoder(
  max_length=2048,
  chunk_size=512,
  overlapping=128
)

trainer = TextBinaryClassifierTrainer(
  long_text_encoder=long_text_encoder,
  batch_size=4,
  train_ratio=0.9
)

pos_data_path = './data/pos'
neg_data_path = './data/neg'
pos_data = [read_file(x) for x in get_files(pos_data_path)]
neg_data = [read_file(x) for x in get_files(neg_data_path)]
model = trainer.train(pos_data, neg_data, num_epochs=20, learning_rate=2e-5, train_loss_threshold=1)
model.save()

model = model.load()
model.predict(long_text_encoder.encode('这是一个测试文本.').squeeze())

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deeplotx-0.4.5.tar.gz (21.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

deeplotx-0.4.5-py3-none-any.whl (26.6 kB view details)

Uploaded Python 3

File details

Details for the file deeplotx-0.4.5.tar.gz.

File metadata

  • Download URL: deeplotx-0.4.5.tar.gz
  • Upload date:
  • Size: 21.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.6.9

File hashes

Hashes for deeplotx-0.4.5.tar.gz
Algorithm Hash digest
SHA256 665da7eb00c1e81f06db034cf849743f18ce4c5131257bde9939562cfd42ac41
MD5 f3a10f7e0a63c2d14f6d1ef1f2bc7af6
BLAKE2b-256 27d97748f8103a97cfaabc8e7d69236cf6f90c4a4f47366c7a20b244c5ffa325

See more details on using hashes here.

File details

Details for the file deeplotx-0.4.5-py3-none-any.whl.

File metadata

  • Download URL: deeplotx-0.4.5-py3-none-any.whl
  • Upload date:
  • Size: 26.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.6.9

File hashes

Hashes for deeplotx-0.4.5-py3-none-any.whl
Algorithm Hash digest
SHA256 053ceeb06880c86deae7a16cf4005761cbafc3b7b9a513632a2ab457e6aed84e
MD5 6755b777b34259404559c081fd39527b
BLAKE2b-256 30009729477f8ad7703d7a364bb365ee8359bec57354b1e62f48fd003a9f35b2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page