A python library based on transformers for transfer learning

These details have not been verified by PyPI

Project links

Homepage

Project description

Predacons

Predacons is a Python library based on transformers used for transfer learning. It offers a suite of tools for data processing, model training, and text generation, making it easier to apply advanced machine learning techniques to your projects.

PyPI Downloads License Python Version

Features

Predacons provides a comprehensive set of features for working with transformer models, including:

Data Loading: Easily load data from directories or files.
Text Cleaning: Clean your text data with built-in functions.
Model Training: Train transformer models with custom data.
Text Generation: Generate text using trained models.
Text Streaming: Stream text generation using trained models.
Chat Generation: Generate chat responses using trained models.
Chat Streaming: Stream chat generation using trained models.
Embeddings: Generate embeddings for sentences using pre-trained transformer models. and is fully compatible with langchain methods

Installation

To install Predacons, use the following pip command:

pip install predacons

Usage

Here's a quick start guide to using Predacons in your Python projects:

from predacons import predacons

# Initialize the library
predacons.rollout()

# Load documents from a directory
predacons.read_documents_from_directory('your/directory/path')

# Clean text data
cleaned_text = predacons.clean_text("Your dirty text here")

# Train a model with your data
predacons.train(train_file_path="path/to/train/file",
                model_name="your_model_name",
                output_dir="path/to/output/dir",
                overwrite_output_dir=True,
                per_device_train_batch_size=4,
                num_train_epochs=3,
                save_steps=100)

# Generate text using a trained model
generated_text = predacons.generate_text(model_path="path/to/your/model",
                                         sequence="Seed text for generation",
                                         max_length=50)
# 

# Stream text generation using a trained model
for text in predacons.text_stream(model_path="path/to/your/model",
                                  sequence="Seed text for generation",
                                  max_length=50):
    print(text)

# Get text streamer
thread,streamer = predacons.text_generate(model=model, tokenizer=tokenizer, sequence=seq, max_length=100, temperature=0.1, stream=True)

# You can also use a processor instead of a tokenizer for model-based generation:
thread,streamer = predacons.text_generate(model=model, processor=processor, sequence=seq, max_length=100, temperature=0.1, stream=True)

thread.start()
try:
    out = ""
    for new_text in streamer:
        out = out + new_text
        print(new_text, end=" ")
finally:
    thread.join()

# Generate chat using a trained model
chat = [
    {"role": "user", "content": "Hey, what is a car?"}
]
chat_output = predacons.chat_generate(model=model,
        sequence=chat,
        max_length=50,
        tokenizer=tokenizers,
        trust_remote_code=True)
# You can also use a processor instead of a tokenizer for chat generation:
chat_output = predacons.chat_generate(model=model,
        sequence=chat,
        max_length=50,
        processor=processor,
        trust_remote_code=True)

# Stream chat generation using a trained model
for chat in predacons.chat_stream(model=model,
                                  sequence=chat,
                                  max_length=50,
                                  tokenizer=tokenizers,
                                  trust_remote_code=True):
    print(chat)
# You can also use a processor instead of a tokenizer for chat streaming:
for chat in predacons.chat_stream(model=model,
                                  sequence=chat,
                                  max_length=50,
                                  processor=processor,
                                  trust_remote_code=True):
    print(chat)

# get chat streamer
thread,streamer = predacons.chat_generate(model=model, tokenizer = tokenizer, sequence = chat, max_length=500, temperature=0.1,stream=True)

thread.start()
try:
    out = ""
    for new_text in streamer:
        out = out + new_text
        print(new_text, end="")
finally:
    thread.join()
# Generate embeddings for sentences
from predacons.src.embeddings import PredaconsEmbedding

# this embedding_model object can be used directly in every method langchain   
embedding_model = PredaconsEmbedding(model_name="sentence-transformers/paraphrase-MiniLM-L6-v2")
sentence_embeddings = embedding_model.get_embedding(["Your sentence here", "Another sentence here"])

Contributing

Contributions to the Predacons library are welcome! If you have suggestions for improvements or new features, please open an issue first to discuss your ideas. For code contributions, please submit a pull request.

License

This project is licensed under multiple licenses:

For free users, the project is licensed under the terms of the GNU Affero General Public License (AGPL). See LICENSE-AGPL for more details.
For paid users, there are two options:
- A perpetual commercial license. See LICENSE-COMMERCIAL-PERPETUAL for more details.
- A yearly commercial license. See LICENSE-COMMERCIAL-YEARLY for more details.

Please ensure you understand and comply with the license that applies to you.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

0.0.130

Jun 2, 2025

0.0.129

Feb 27, 2025

0.0.128

Nov 2, 2024

0.0.127

Nov 2, 2024

0.0.126

Oct 11, 2024

0.0.125

Sep 29, 2024

0.0.124

Sep 18, 2024

0.0.123

Sep 14, 2024

0.0.122

Sep 9, 2024

0.0.121

Sep 8, 2024

0.0.120

Jul 7, 2024

0.0.119

Jun 9, 2024

0.0.118

Jun 9, 2024

0.0.117

Mar 23, 2024

0.0.116

Mar 14, 2024

0.0.115

Mar 9, 2024

0.0.114

Mar 8, 2024

0.0.113

Feb 24, 2024

0.0.112

Feb 23, 2024

0.0.109

Feb 21, 2024

0.0.108

Dec 7, 2023

0.0.107

Nov 19, 2023

0.0.106

Nov 12, 2023

0.0.105

Nov 11, 2023

0.0.104

Nov 1, 2023

0.0.103

Nov 1, 2023

0.0.102

Oct 29, 2023

0.0.101

Oct 29, 2023

0.0.1

Oct 21, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

predacons-0.0.130.tar.gz (38.7 kB view details)

Uploaded Jun 2, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

predacons-0.0.130-py3-none-any.whl (45.1 kB view details)

Uploaded Jun 2, 2025 Python 3

File details

Details for the file predacons-0.0.130.tar.gz.

File metadata

Download URL: predacons-0.0.130.tar.gz
Upload date: Jun 2, 2025
Size: 38.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.9.22

File hashes

Hashes for predacons-0.0.130.tar.gz
Algorithm	Hash digest
SHA256	`9fdd23371f861e793ba3bee0c20fc99707d6932fd0a0de7785e2fc5c93a670ab`
MD5	`ea232ca3db06bcd9eee25ddc31765fad`
BLAKE2b-256	`8699dff9bec66d00975e3d96686f218e69f2cfbc008bab5874dc233e9414d13b`

See more details on using hashes here.

File details

Details for the file predacons-0.0.130-py3-none-any.whl.

File metadata

Download URL: predacons-0.0.130-py3-none-any.whl
Upload date: Jun 2, 2025
Size: 45.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.9.22

File hashes

Hashes for predacons-0.0.130-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2db5bbf6d0d2146411780b91884fbc263aff5f8fb53f0663eb8e091048db8002`
MD5	`43ae35d56eb7d485ae03ce1971e77e1a`
BLAKE2b-256	`54a67ed4cfcdaa55c42d73f9c6eb183e070290881b602787c0945b54cfaddd34`

See more details on using hashes here.

predacons 0.0.130

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Predacons

Features

Installation

Usage

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes