straight forward rnn model

These details have not been verified by PyPI

Project description

sentimentizer

GitHub CI

Lightweight PyTorch models for sentiment analysis. Small models can be pretty effective for classification tasks at a much smaller cost to deploy — all models were trained on a single 2080Ti GPU in minutes, and inference requires less than 1GB of memory.

Beta release — API is subject to change.

Install

pip install sentimentizer

Quick Start

from sentimentizer.tokenizer import get_trained_tokenizer
from sentimentizer.models.rnn import get_trained_model

model = get_trained_model(64, "cpu")
tokenizer = get_trained_tokenizer()

review_text = "greatest pie ever, best in town!"
positive_ids = tokenizer.tokenize_text(review_text)
model.predict(positive_ids)
# >> tensor(0.9701)

Scores range from 0 (very negative) to 1 (very positive).

Models

Three architectures are available:

Model	Module	Description
Encoder ⭐	`sentimentizer.models.encoder`	Transformer encoder with CLS token + positional encoding (4 layers, d_model=256) — recommended
RNN	`sentimentizer.models.rnn`	Bidirectional 2-layer LSTM (hidden=256) with GloVe embeddings — solid baseline
Decoder	`sentimentizer.models.decoder`	Encoder-Decoder Transformer with learnable query token + cross-attention (2 encoder + 4 decoder layers)

Why Encoder? Self-attention over the full token sequence with a CLS token is the most natural fit for sentence-level classification. The RNN processes tokens sequentially and can miss long-range dependencies, though bidirectionality helps. The Decoder uses cross-attention (a query token attends to encoded text), which is effective but adds encoder overhead — best reserved for cases where you want the Decoder's cross-attention pattern.

Each module exposes get_trained_model(batch_size, device) to load pre-trained weights.

Serving

Ray Serve (Python)

The serve.py entry point deploys a Ray Serve application that loads all three models (RNN, Encoder, Decoder) at startup. You can select which model to use per request via the model field.

serve run serve:app --host 0.0.0.0 --port 8000

Send a prediction request (defaults to RNN):

curl -X POST http://localhost:8000 \
  -H "Content-Type: application/json" \
  -d '{"text": "the food was terrific"}'

Use a specific model:

# Transformer Encoder (recommended)
curl -X POST http://localhost:8000 \
  -H "Content-Type: application/json" \
  -d '{"text": "the food was terrific", "model": "encoder"}'

# Encoder-Decoder Transformer
curl -X POST http://localhost:8000 \
  -H "Content-Type: application/json" \
  -d '{"text": "the food was terrific", "model": "decoder"}'

Response:

{
  "text": "the food was terrific",
  "model": "encoder",
  "sentiment_score": 0.9701,
  "prediction": "positive"
}

List all available models:

curl http://localhost:8000/models

Go CLI Client

A Go CLI client is included for interacting with the serve endpoint:

# Build and run
go run main.go -text "the food was terrific"

# Pipe input
echo "terrible service" | go run main.go

# Positional arguments
go run main.go "best restaurant in town"

# Raw JSON output
go run main.go -raw -text "amazing pasta"

# Custom endpoint
go run main.go -host http://remote:8000 -text "great coffee"

The client outputs colorized results with emoji indicators:

Text:       the food was terrific
Prediction: positive 👍
Score:      0.9701
Latency:    12ms

Training

Prerequisites

To retrain the model:

Get the Yelp dataset — download yelp_dataset.tar and place it in ../data/ (one level above the project root)
Get the GloVe 6B 100D embeddings — download glove.6B.zip and place it in ../data/ (one level above the project root)

The expected directory structure:

data/                            # one level above project root
├── yelp_dataset.tar             # Yelp dataset (downloaded)
└── glove.6B.zip                 # GloVe embeddings (downloaded)

torch-sentiment/                 # project root
├── sentimentizer/
│   └── data/
│       ├── yelp.dictionary      # Generated during training
│       ├── weights.pth          # Generated during training
│       └── ...
└── ...

Single-node training (recommended for laptops and single-GPU machines)

# NVIDIA GPU
python workflows/driver.py --device cuda --type new --save True

# Apple Silicon (M1/M2/M3/M4) — uses Metal Performance Shaders
python workflows/driver.py --device mps --type new --save True

# CPU only (slowest)
python workflows/driver.py --device cpu --type new --save True

# Quick iteration with less data
python workflows/driver.py --device mps --type new --save True --stop 5000

Tip: On a single machine, single-node training is always faster than distributed. Use --distributed only when you have multiple GPUs.

Distributed training with Ray Train (multi-GPU or multi-machine only)

# Run with 2 workers (default)
python workflows/driver.py --device cuda --distributed --save True

# Run with 4 workers
python workflows/driver.py --device cuda --distributed --num-workers 4 --save True

# Run on CPU only
python workflows/driver.py --device cpu --distributed --num-workers 2

The --distributed flag enables Ray Train, which distributes data and model training across multiple workers. Each worker gets a shard of the dataset and runs the training loop with PyTorch Distributed Data Parallel (DDP). Checkpoints and metrics are aggregated automatically by Ray Train.

Distributed training adds overhead (process group init, gradient sync, actor management) and is slower than single-node on a single GPU. Only use it when you have multiple GPUs or machines.

CLI arguments

Flag	Default	Description
`--device`	`cuda`	Device to use: `cuda`, `mps`, or `cpu`
`--model`	`rnn`	Model type: `rnn`, `encoder`, or `decoder`
`--type`	`new`	Run type: `new` (from scratch) or `update` (resume)
`--stop`	`10000`	Number of lines to load from the dataset
`--save`	`False`	Save model weights after training
`--distributed`	`False`	Enable distributed training with Ray Train
`--num-workers`	`2`	Ray Train workers (distributed mode only; single-node ignores this)
`--agent-tune`	`False`	Use Pydantic AI + LangGraph agent for hyperparameter tuning (GLM 5.1 via Ollama)
`--agent-config`	`None`	Path to agent config YAML (default: `sentimentizer/agent/config.yaml`)
`--checkpoint-dir`	`""`	Directory to save training checkpoints (empty = no checkpointing)
`--resume`	`False`	Resume training from the latest checkpoint in `--checkpoint-dir`

Checkpointing

Model checkpoints save the full training state (model weights, optimizer state, scheduler state, epoch number) so you can resume training after interruptions.

Enable checkpointing

# Save checkpoints every epoch to a directory
python workflows/driver.py --device mps --type new --checkpoint-dir checkpoints/

# Save checkpoints every N epochs (e.g., every 2 epochs)
python workflows/driver.py --device cuda --type new --checkpoint-dir checkpoints/ --checkpoint-every 2

This creates two types of checkpoints in --checkpoint-dir:

Periodic checkpoints: checkpoint_epoch_1.pth, checkpoint_epoch_2.pth, etc.
Best model checkpoint: best_model.pth (lowest validation loss seen so far)

Resume from a checkpoint

# Resume from the latest checkpoint
python workflows/driver.py --device mps --type new --checkpoint-dir checkpoints/ --resume

The --resume flag loads the latest periodic checkpoint and restores model weights, optimizer state, and scheduler state before continuing training.

Programmatic API

from sentimentizer.trainer import save_checkpoint, load_checkpoint, latest_checkpoint

# Save a checkpoint
save_checkpoint(model, optimizer, epoch=5, path="checkpoints/ckpt.pth", val_loss=0.32)

# Find the latest checkpoint
ckpt_path = latest_checkpoint("checkpoints/")

# Load and resume
checkpoint = load_checkpoint(ckpt_path, model, optimizer, scheduler, device="cpu")
print(f"Resuming from epoch {checkpoint['epoch']}")

Agent Tuning

An LLM-guided hyperparameter tuning agent that uses Pydantic AI Slim (GLM 5.1 via Ollama) for reasoning, LangGraph for workflow orchestration, and Ray Tune + Optuna for the search backend.

Architecture

analyze (GLM 5.1) → decide (GLM 5.1) → tune (Ray Tune + Optuna) → evaluate
     ↑                                                              │
     └──────────────────────────────────────────────────────────────┘
                          (loop until converged)

analyze — GLM 5.1 examines training metrics, detects overfitting/underfitting, assesses learning rate
decide — GLM 5.1 chooses a strategy (widen, narrow, change_focus, increase_epochs, stop) and produces a validated TuningDecision with an updated search space
tune — Ray Tune + Optuna executes the hyperparameter search with ASHA scheduling
evaluate — Checks convergence (improvement below threshold for 3 iterations, max iterations reached, or agent decides to stop)

Prerequisites

Install Ollama and pull the GLM 5.1 model:

ollama pull glm5.1

Usage

# Run the tuning agent with default config
python workflows/driver.py --model rnn --agent-tune

# With a custom agent config
python workflows/driver.py --model encoder --agent-tune --agent-config path/to/custom.yaml

# Save the best configuration to JSON
python workflows/driver.py --model rnn --agent-tune --save

Configuration

Agent settings are defined in sentimentizer/agent/config.yaml:

agent:
  model_name: glm5.1                    # Ollama model name
  ollama_base_url: http://localhost:11434/v1
  max_iterations: 5                      # Max agent loop iterations
  convergence_threshold: 0.005           # Stop if avg improvement < threshold over 3 iterations
  temperature: 0.3                       # LLM sampling temperature
  max_tokens: 2048                       # Max LLM output tokens
  checkpointing:
    enabled: true
    db_path: agent_checkpoints.db
  human_in_the_loop: false               # Require human approval (future)

tuner:
  scheduler: asha                        # asha, hyperband, or median
  metric: val_accuracy
  mode: max
  num_samples: 20                        # Trials per tuning iteration
  grace_period: 2
  reduction_factor: 3
  search_spaces:
    rnn:
      lr: { type: loguniform, low: 1e-5, high: 1e-2 }
      hidden_size: { type: choice, values: [128, 256, 512] }
      ...

Override the config path via the SENTIMENTIZER_AGENT_CONFIG environment variable.

Model Configuration

All model architecture parameters are configured via dataclasses in sentimentizer/config.py. To change layer dimensions, update the config and retrain:

from sentimentizer.config import RNNConfig, EncoderConfig, DecoderConfig

# Customize RNN — e.g., larger hidden state and 3 layers
rnn_config = RNNConfig(hidden_size=512, num_layers=3, dropout=0.3)

# Customize Encoder — e.g., wider model with 8 heads
encoder_config = EncoderConfig(d_model=512, n_heads=8, n_layers=6, ff_multiplier=4)

# Customize Decoder — e.g., deeper decoder
decoder_config = DecoderConfig(d_model=512, n_heads=8, n_encoder_layers=4, n_decoder_layers=8)

The config flows: config.py → DriverConfig → new_model() / get_trained_model() → model __init__ sets layer dimensions.

Config	Parameters	Defaults
`RNNConfig`	`hidden_size=256`, `num_layers=2`, `dropout=0.2`	Bidirectional LSTM
`EncoderConfig`	`d_model=256`, `n_heads=4`, `n_layers=4`, `dropout=0.2`, `ff_multiplier=4`	Transformer encoder + CLS token
`DecoderConfig`	`d_model=256`, `n_heads=4`, `n_encoder_layers=2`, `n_decoder_layers=4`, `dropout=0.2`, `ff_multiplier=4`	Encoder-decoder + query token

Architecture

The pipeline consists of three stages, all powered by Ray:

Extract — Reads raw JSON data from .zip or .tar archives using ray.data and tokenizes text
Transform — Converts tokens to numeric sequences using ray.data.map_batches() and writes processed parquet
Train — Fits the model using either single-node PyTorch or distributed Ray Train with TorchTrainer

Inference is served via Ray Serve (see serve.py and sentimentizer/serve.py).

Docker

Build and run the containerized service:

# Build
docker build -t sentimentizer .

# Run
docker run -p 8000:8000 -p 8265:8265 sentimentizer

The image uses a multi-stage build with Python 3.11-slim and CPU-only PyTorch. Port 8000 serves predictions; port 8265 exposes the Ray dashboard.

Kubernetes

Kubernetes manifests are in the k8s/ directory:

File	Resource	Purpose
`deployment.yaml`	Deployment	Pod template with the sentimentizer container
`service.yaml`	Service	ClusterIP service for internal routing
`hpa.yaml`	HorizontalPodAutoscaler	Auto-scaling based on CPU/memory usage
`ingress.yaml`	Ingress	HTTP ingress routing
`pdb.yaml`	PodDisruptionBudget	Minimum available replicas during disruptions

Development

With uv (recommended)

This project uses uv for dependency management:

# Install uv (if not already installed)
curl -LsSf https://astral.sh/uv/install.sh | sh

# Install dependencies
uv sync

# Install with dev dependencies
uv sync --extra dev

With conda

conda create -n sentimentizer
conda install pip
pip install -e .

Testing

# Run all tests
uv run pytest tests/ -v

# Run only Ray Train tests
uv run pytest tests/ -v -k "Ray"

# Run with coverage
uv run pytest tests/ -v --cov=sentimentizer --cov-report=term-missing

Project Structure

sentimentizer/
├── __init__.py          # Logging and timing utilities
├── config.py            # Configuration dataclasses, enums, and constants
├── extractor.py         # Ray Data extraction from zip/tar archives
├── loader.py            # Data loading utilities
├── tokenizer.py         # Text tokenizer with pre-trained support
├── trainer.py           # Training logic
├── tuner.py             # Ray Tune + Optuna hyperparameter search
├── serve.py             # Ray Serve deployment app
├── data/                # Training data (Yelp, GloVe)
├── agent/               # LLM-guided tuning agent
│   ├── __init__.py      # Package exports
│   ├── config.yaml      # Agent + tuner configuration (YAML)
│   ├── loader.py        # YAML → dataclass config loader
│   ├── models.py        # Pydantic models (AnalysisResult, TuningDecision, etc.)
│   ├── agents.py        # Pydantic AI agents (GLM 5.1 via Ollama)
│   ├── prompts.py       # System prompts for analysis & strategy agents
│   ├── state.py         # LangGraph AgentState TypedDict
│   ├── nodes.py         # LangGraph node functions (analyze, decide, tune, evaluate)
│   └── graph.py         # LangGraph StateGraph + run_agent_tuning() entry point
└── models/
    ├── __init__.py
    ├── rnn.py           # RNN model with GloVe embeddings
    ├── encoder.py       # Transformer encoder model
    └── decoder.py       # Transformer decoder model

License

MIT

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.101.1

May 4, 2026

0.101.0

May 3, 2026

This version

0.99.0

May 2, 2026

0.6.5

Mar 21, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sentimentizer-0.99.0.tar.gz (17.6 MB view details)

Uploaded May 2, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sentimentizer-0.99.0-py3-none-any.whl (17.3 MB view details)

Uploaded May 2, 2026 Python 3

File details

Details for the file sentimentizer-0.99.0.tar.gz.

File metadata

Download URL: sentimentizer-0.99.0.tar.gz
Upload date: May 2, 2026
Size: 17.6 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for sentimentizer-0.99.0.tar.gz
Algorithm	Hash digest
SHA256	`cbd8706785f9e888f239a11f01763ba643e34a5f2bd748eea45129cb5514c602`
MD5	`4fcfe6a42302c2c42c0382423b4ef70d`
BLAKE2b-256	`cd42bc18c6d3154d9e5dd31cfe0168f7f1c79804de99d67645e3b0b5080bfb08`

See more details on using hashes here.

Provenance

The following attestation bundles were made for sentimentizer-0.99.0.tar.gz:

Publisher: publish.yaml on eddiepyang/sentimentizer

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: sentimentizer-0.99.0.tar.gz
- Subject digest: cbd8706785f9e888f239a11f01763ba643e34a5f2bd748eea45129cb5514c602
- Sigstore transparency entry: 1429437442
- Sigstore integration time: May 2, 2026
Source repository:
- Permalink: eddiepyang/sentimentizer@02e6d7c82f954fa384b262c73aaa716a48485769
- Branch / Tag: refs/tags/0.99
- Owner: https://github.com/eddiepyang
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yaml@02e6d7c82f954fa384b262c73aaa716a48485769
- Trigger Event: release

File details

Details for the file sentimentizer-0.99.0-py3-none-any.whl.

File metadata

Download URL: sentimentizer-0.99.0-py3-none-any.whl
Upload date: May 2, 2026
Size: 17.3 MB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for sentimentizer-0.99.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e88c41b6353faa0cc467116454c564ce42020d68bddf75b33f2ae5ce0df48835`
MD5	`d44d098bc30554385cfec52e3c9cde9e`
BLAKE2b-256	`ab48ef6399309f9fa796c8b20bc2d6a45b06f6dc11141b14a136051fd0f43f42`

See more details on using hashes here.

Provenance

The following attestation bundles were made for sentimentizer-0.99.0-py3-none-any.whl:

Publisher: publish.yaml on eddiepyang/sentimentizer

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: sentimentizer-0.99.0-py3-none-any.whl
- Subject digest: e88c41b6353faa0cc467116454c564ce42020d68bddf75b33f2ae5ce0df48835
- Sigstore transparency entry: 1429437448
- Sigstore integration time: May 2, 2026
Source repository:
- Permalink: eddiepyang/sentimentizer@02e6d7c82f954fa384b262c73aaa716a48485769
- Branch / Tag: refs/tags/0.99
- Owner: https://github.com/eddiepyang
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yaml@02e6d7c82f954fa384b262c73aaa716a48485769
- Trigger Event: release

sentimentizer 0.99.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

sentimentizer

Install

Quick Start

Models

Serving

Ray Serve (Python)

Go CLI Client

Training

Prerequisites

Single-node training (recommended for laptops and single-GPU machines)

Distributed training with Ray Train (multi-GPU or multi-machine only)

CLI arguments

Checkpointing

Enable checkpointing

Resume from a checkpoint

Programmatic API

Agent Tuning

Architecture

Prerequisites

Usage

Configuration

Model Configuration

Architecture

Docker

Kubernetes

Development

With uv (recommended)

With conda

Testing

Project Structure

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance