Skip to main content

A machine learning–based method for transforming text into instrumental variables (IV).

Project description

TexIV Logo


A machine learning–based package for transforming text into instrumental variables (IV).

PyPI version License: MIT Issue Wiki Ask DeepWiki


🌰 Example

Visit Stata Example File and Python Example File.

There is a step-by-step example.

✨ Feature

  • Support multiple Chinese word segmentation and embedding methods
  • Customizable stopwords
  • Support keyword relevance filtering and two-stage filtering
  • Output includes frequency, total count, and ratio statistics

📦 Requirements

  • Python 3.11+
  • Recommended to use virtual environment (e.g., venv or conda)

🚀 Quickly Start

Install

pip install texiv

Usage

from typing import List

from texiv import TexIV

texiv = TexIV()
content: str = "This is a test text..."
keywords: List[str] = ["keyword1", "keyword2", "keyword3"]
texiv.texiv_it(content, keywords)

Output example:

{'freq': 7, 'count': 34, 'rate': 0.20588235294117646}

🖥️ Command Line Tool

The project also provides a command-line interface that can be used directly after installation:

texiv --help

🛠️ Configuration

All models and parameters can be adjusted through configuration files in ~/.texiv/config.toml.

📄 License

This project is licensed under the MIT License. See LICENSE for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

texiv-0.1.7.tar.gz (3.4 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

texiv-0.1.7-py3-none-any.whl (30.5 kB view details)

Uploaded Python 3

File details

Details for the file texiv-0.1.7.tar.gz.

File metadata

  • Download URL: texiv-0.1.7.tar.gz
  • Upload date:
  • Size: 3.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.7.19

File hashes

Hashes for texiv-0.1.7.tar.gz
Algorithm Hash digest
SHA256 43d001bc560975759df1cd94ccc84c0623c9df561eee023cce1b916de1e78452
MD5 0d5b9e4fb255206bd0783f3d485d305e
BLAKE2b-256 0e8d0f5849e73db3c37a3a444031d946007f5c51af45d3c6e4a4720119d58eac

See more details on using hashes here.

File details

Details for the file texiv-0.1.7-py3-none-any.whl.

File metadata

  • Download URL: texiv-0.1.7-py3-none-any.whl
  • Upload date:
  • Size: 30.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.7.19

File hashes

Hashes for texiv-0.1.7-py3-none-any.whl
Algorithm Hash digest
SHA256 011edf990145c8d185c499a54e1f333f751333cc5049829ec6777e77c13f968b
MD5 f44fae8fb3de2c06fe17258013c9e6e0
BLAKE2b-256 c3c545f73982f71fd3673c0e00aacce76426bf718b8571af616c7d8f0e10ebe0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page