Skip to main content

A machine learning–based method for transforming text into instrumental variables (IV).

Project description

TexIV


A machine learning–based package for transforming text into instrumental variables (IV).

PyPI version License: MIT Issue


icon

✨ Feature

  • Support multiple Chinese word segmentation and embedding methods
  • Customizable stopwords
  • Support keyword relevance filtering and two-stage filtering
  • Output includes frequency, total count, and ratio statistics

📦 Requirements

  • Python 3.11+
  • Recommended to use virtual environment (e.g., venv or conda)

🚀 Quickly Start

Install

pip install texiv

Usage

from typing import List

from texiv import TexIV

texiv = TexIV()
content: str = "This is a test text..."
keywords: List[str] = ["keyword1", "keyword2", "keyword3"]
texiv.texiv_it(content, keywords)

Output example:

{'freq': 7, 'count': 34, 'rate': 0.20588235294117646}

🖥️ Command Line Tool

The project also provides a command-line interface that can be used directly after installation:

tiv --help

🛠️ Configuration

All models and parameters can be adjusted through configuration files in ~/.texiv/config.toml.

📄 License

This project is licensed under the MIT License. See LICENSE for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

texiv-0.1.0.tar.gz (1.2 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

texiv-0.1.0-py3-none-any.whl (25.5 kB view details)

Uploaded Python 3

File details

Details for the file texiv-0.1.0.tar.gz.

File metadata

  • Download URL: texiv-0.1.0.tar.gz
  • Upload date:
  • Size: 1.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.7.17

File hashes

Hashes for texiv-0.1.0.tar.gz
Algorithm Hash digest
SHA256 ccb7f969b7c6cb33b5c2529b197330dcfad37a7cb85242454a117a5afeb70b4b
MD5 46d4ba3e67bf5e46ef8545e722769d4a
BLAKE2b-256 72977066f9bca5b31151cde4a0fe74194604ca53562f720d9f7d8985f6b526b5

See more details on using hashes here.

File details

Details for the file texiv-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: texiv-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 25.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.7.17

File hashes

Hashes for texiv-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 ee9996a0fdead9f8ff0c2554ccf8f83e1e566289c6b4497f06bf350e164344d1
MD5 af35ee97ca5e3d6692ebb0c6212e6ef5
BLAKE2b-256 c57ef94c2d92fedf681e585506b3249687ece2d754f1ae8f84e4f2dda121cbdc

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page