Skip to main content

A machine learning–based method for transforming text into instrumental variables (IV).

Project description

TexIV


A machine learning–based package for transforming text into instrumental variables (IV).

PyPI version License: MIT Issue


icon

✨ Feature

  • Support multiple Chinese word segmentation and embedding methods
  • Customizable stopwords
  • Support keyword relevance filtering and two-stage filtering
  • Output includes frequency, total count, and ratio statistics

📦 Requirements

  • Python 3.11+
  • Recommended to use virtual environment (e.g., venv or conda)

🚀 Quickly Start

Install

pip install texiv

Usage

from typing import List

from texiv import TexIV

texiv = TexIV()
content: str = "This is a test text..."
keywords: List[str] = ["keyword1", "keyword2", "keyword3"]
texiv.texiv_it(content, keywords)

Output example:

{'freq': 7, 'count': 34, 'rate': 0.20588235294117646}

🖥️ Command Line Tool

The project also provides a command-line interface that can be used directly after installation:

texiv --help

🛠️ Configuration

All models and parameters can be adjusted through configuration files in ~/.texiv/config.toml.

📄 License

This project is licensed under the MIT License. See LICENSE for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

texiv-0.1.1.tar.gz (1.2 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

texiv-0.1.1-py3-none-any.whl (26.2 kB view details)

Uploaded Python 3

File details

Details for the file texiv-0.1.1.tar.gz.

File metadata

  • Download URL: texiv-0.1.1.tar.gz
  • Upload date:
  • Size: 1.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.7.17

File hashes

Hashes for texiv-0.1.1.tar.gz
Algorithm Hash digest
SHA256 40f42326cbb67813836a29fbce32fba150818e4d9aded528f82f06914a8c05b8
MD5 3f455f619db37850a79658177f5d3b5d
BLAKE2b-256 f76101a2d3f8633eb7272c19fc86109e6d9c1191080a4c9ef37d29aaff08367b

See more details on using hashes here.

File details

Details for the file texiv-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: texiv-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 26.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.7.17

File hashes

Hashes for texiv-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 655e3872ed5b0c4e3913d9aa852959e75e6bbbd56eb1cfe5bb73b9f1991cd77f
MD5 27703365bfa0a13e737d835abe100727
BLAKE2b-256 745d8e3546824f2f76d47abc216cff4cacceb56e33516176c1de17e1aea11e4e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page