Mathematical algorithms module for LitePolis platform, providing PCA and KMeans implementations for user vote data analysis

Project description

LitePolis Math Default Module

This module provides mathematical algorithms, specifically PCA and KMeans, for the LitePolis platform based on LitePolis-math. It is designed to work with user vote data stored in a database (primarily StarRocks, with potential support for PostgreSQL via the litepolis_database_default module) to produce outputs for API endpoints.

Installation

This module will be available on PyPI. You can install it using pip:

pip install litepolis-math-default

Configuration

This module relies on the database connection configured for the litepolis_database_default module. Ensure that the litepolis_database_default section within your ~/litepolis/litepolis.config file is correctly set up to connect to your database (StarRocks or PostgreSQL).

Example ~/litepolis/litepolis.config snippet:

[litepolis_database_default]
database_url = starrocks://user:password@host:port/database
# Or for PostgreSQL:
# database_url = postgresql://user:password@host:port/database
sqlalchemy_engine_pool_size = 10
sqlalchemy_pool_max_overflow = 20

The module uses the litepolis_database_default.utils.connect_db() function to obtain the database engine based on this configuration.

Usage

The primary use case for this module is to perform PCA on a user-vote matrix derived from your database. The results can then be used for further analysis or as input for other algorithms like KMeans clustering.

Here's a quick example demonstrating how to use the core components:

import logging
from litepolis_math_default import fetch_r_matrix
from litepolis_math_default.algorithms import PCA
# from litepolis_math_default.algorithms import KMeans # Uncomment if needed

# Configure logging (optional)
logging.basicConfig(level=logging.INFO, format='%(asctime)s - %(levelname)s - %(message)s')

try:
    # 1. Fetch and build the user-vote matrix (R matrix)
    r_matrix = fetch_r_matrix(engine)
    logging.info(f"Fetched R matrix with shape: {r_matrix.shape}")

    # Optional: Validate the matrix
    # from litepolis_math_default.validation import validate_matrix
    # validate_matrix(r_matrix)
    # logging.info("R matrix validated.")

    # 2. Apply PCA
    # The PCA algorithm expects a NumPy array
    pca = PCA(n_components=2)
    pca_result = pca.fit_transform(r_matrix.values)
    logging.info(f"PCA applied. Result shape: {pca_result.shape}")

    # 3. (Optional) Apply KMeans clustering on PCA results
    # kmeans = KMeans(n_clusters=3)
    # user_clusters = kmeans.fit_predict(pca_result)
    # logging.info("KMeans clustering applied.")
    # print("User clusters:", user_clusters)

    # 'pca_result' contains the PCA output (user coordinates in the reduced dimension space)
    # You can now use 'pca_result' in your API endpoint response or for further processing.

except Exception as e:
    logging.error(f"An error occurred: {e}")

Incremental PCA Updates

The PCA class now supports incremental updates, allowing you to update the principal components with new data without refitting the model on the entire dataset. This can be useful when your data (like the R matrix) changes over time and you want to update the PCA model efficiently.

To use the incremental update feature:

Initialize and fit the PCA model with an initial batch of data using fit_transform.
When new data becomes available, use the update method with the new data batch.

import numpy as np
from litepolis_math_default.algorithms import PCA

# Assume pca is already initialized and fitted with initial data
# pca = PCA(n_components=2)
# initial_data = np.random.rand(100, 10) # Example initial data
# pca.fit_transform(initial_data)

# When new data arrives
new_data_batch = np.random.rand(10, 10) # Example new data batch

# Update the PCA model with the new data
pca.update(new_data_batch)

# You can now transform new data using the updated components
transformed_new_data = pca.transform(new_data_batch)
print("Transformed new data shape:", transformed_new_data.shape)

# You can also transform the original data or any other data using the updated components
# transformed_initial_data = pca.transform(initial_data)
# print("Transformed initial data shape:", transformed_initial_data.shape)

This incremental update process can help maintain a relevant PCA model as your data evolves without the computational cost of refitting on the entire cumulative dataset.

Extending the Module

The module is structured to allow for easy extension. Additional algorithms can be added to the litepolis_math_default/algorithms/ directory.

Development

For development and running tests, you will need pytest, pandas, numpy, sqlalchemy, and potentially scikit-learn (for comparison/validation of custom algorithms).

pip install pytest pandas numpy sqlalchemy scikit-learn

Project details

Release history Release notifications | RSS feed

This version

0.1.0

Mar 15, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

litepolis_math_default-0.1.0.tar.gz (13.3 kB view details)

Uploaded Mar 15, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

litepolis_math_default-0.1.0-py3-none-any.whl (13.4 kB view details)

Uploaded Mar 15, 2026 Python 3

File details

Details for the file litepolis_math_default-0.1.0.tar.gz.

File metadata

Download URL: litepolis_math_default-0.1.0.tar.gz
Upload date: Mar 15, 2026
Size: 13.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for litepolis_math_default-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`0e7a573941a2dd9fafa71bce9c3cce6193321d571d717fd064445b20c2c77772`
MD5	`c013f32f2786853ec3e7427157bfded0`
BLAKE2b-256	`16062e5bd514edb36952fa71780310283b8d31dcceeeb31cf5884226bb1b253d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for litepolis_math_default-0.1.0.tar.gz:

Publisher: publish.yml on NewJerseyStyle/LitePolis-math-default

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: litepolis_math_default-0.1.0.tar.gz
- Subject digest: 0e7a573941a2dd9fafa71bce9c3cce6193321d571d717fd064445b20c2c77772
- Sigstore transparency entry: 1108520527
- Sigstore integration time: Mar 15, 2026
Source repository:
- Permalink: NewJerseyStyle/LitePolis-math-default@045e823c7851b8959e4c56b60b1db3b856990e47
- Branch / Tag: refs/heads/main
- Owner: https://github.com/NewJerseyStyle
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@045e823c7851b8959e4c56b60b1db3b856990e47
- Trigger Event: workflow_dispatch

File details

Details for the file litepolis_math_default-0.1.0-py3-none-any.whl.

File metadata

Download URL: litepolis_math_default-0.1.0-py3-none-any.whl
Upload date: Mar 15, 2026
Size: 13.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for litepolis_math_default-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4368e52223763f3c485484ed0cf2a47d358cce35038c5cee735b2f463a976ad5`
MD5	`b259310bc765f4ef56dacea52e9f4b80`
BLAKE2b-256	`c79da64d530e580e764871e696fe939cd143715f3d6d6c1b85f35692cd35ccf6`

See more details on using hashes here.

Provenance

The following attestation bundles were made for litepolis_math_default-0.1.0-py3-none-any.whl:

Publisher: publish.yml on NewJerseyStyle/LitePolis-math-default

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: litepolis_math_default-0.1.0-py3-none-any.whl
- Subject digest: 4368e52223763f3c485484ed0cf2a47d358cce35038c5cee735b2f463a976ad5
- Sigstore transparency entry: 1108520534
- Sigstore integration time: Mar 15, 2026
Source repository:
- Permalink: NewJerseyStyle/LitePolis-math-default@045e823c7851b8959e4c56b60b1db3b856990e47
- Branch / Tag: refs/heads/main
- Owner: https://github.com/NewJerseyStyle
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@045e823c7851b8959e4c56b60b1db3b856990e47
- Trigger Event: workflow_dispatch

litepolis-math-default 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

LitePolis Math Default Module

Installation

Configuration

Usage

Incremental PCA Updates

Extending the Module

Development

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance