Skip to main content

A machine learning–based method for transforming text into instrumental variables (IV).

Project description

TexIV Logo


A machine learning–based package for transforming text into instrumental variables (IV).

PyPI version License: MIT Issue Wiki


🌰 Example

Visit Stata Example File and Python Example File.

There is a step-by-step example.

✨ Feature

  • Support multiple Chinese word segmentation and embedding methods
  • Customizable stopwords
  • Support keyword relevance filtering and two-stage filtering
  • Output includes frequency, total count, and ratio statistics

📦 Requirements

  • Python 3.11+
  • Recommended to use virtual environment (e.g., venv or conda)

🚀 Quickly Start

Install

pip install texiv

Usage

from typing import List

from texiv import TexIV

texiv = TexIV()
content: str = "This is a test text..."
keywords: List[str] = ["keyword1", "keyword2", "keyword3"]
texiv.texiv_it(content, keywords)

Output example:

{'freq': 7, 'count': 34, 'rate': 0.20588235294117646}

🖥️ Command Line Tool

The project also provides a command-line interface that can be used directly after installation:

texiv --help

🛠️ Configuration

All models and parameters can be adjusted through configuration files in ~/.texiv/config.toml.

📄 License

This project is licensed under the MIT License. See LICENSE for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

texiv-0.1.5.tar.gz (3.4 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

texiv-0.1.5-py3-none-any.whl (27.9 kB view details)

Uploaded Python 3

File details

Details for the file texiv-0.1.5.tar.gz.

File metadata

  • Download URL: texiv-0.1.5.tar.gz
  • Upload date:
  • Size: 3.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.7.19

File hashes

Hashes for texiv-0.1.5.tar.gz
Algorithm Hash digest
SHA256 5f4c53d3b2b64798da720d22412b1def047d23e4d049cc0b6b03fb739825c501
MD5 4957a07322bf1af1554dcee3490ebc4e
BLAKE2b-256 09abc7b9dc8fc5533eec00b5cc17af18a9416e5c50db3e2c2d210e87eacbe2e9

See more details on using hashes here.

File details

Details for the file texiv-0.1.5-py3-none-any.whl.

File metadata

  • Download URL: texiv-0.1.5-py3-none-any.whl
  • Upload date:
  • Size: 27.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.7.19

File hashes

Hashes for texiv-0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 9076cba1705761ace18030a759cab36ce6f8aa3e7fa28a1bf5882c681257d545
MD5 897c8956ec2925fb6cb3a1bdeaca7d45
BLAKE2b-256 4db70d7760dfb6a366dd12f663711a8c24af8da99fc83e0d751741154e177856

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page