Skip to main content

A machine learning–based method for transforming text into instrumental variables (IV).

Project description

TexIV Logo


A machine learning–based package for transforming text into instrumental variables (IV).

PyPI version License: MIT Issue Wiki Ask DeepWiki


🌰 Example

Visit Stata Example File and Python Example File.

There is a step-by-step example.

✨ Feature

  • Support multiple Chinese word segmentation and embedding methods
  • Customizable stopwords
  • Support keyword relevance filtering and two-stage filtering
  • Output includes frequency, total count, and ratio statistics

📦 Requirements

  • Python 3.11+
  • Recommended to use virtual environment (e.g., venv or conda)

🚀 Quickly Start

Install

pip install texiv

Usage

from typing import List

from texiv import TexIV

texiv = TexIV()
content: str = "This is a test text..."
keywords: List[str] = ["keyword1", "keyword2", "keyword3"]
texiv.texiv_it(content, keywords)

Output example:

{'freq': 7, 'count': 34, 'rate': 0.20588235294117646}

🖥️ Command Line Tool

The project also provides a command-line interface that can be used directly after installation:

texiv --help

🛠️ Configuration

All models and parameters can be adjusted through configuration files in ~/.texiv/config.toml.

📄 License

This project is licensed under the MIT License. See LICENSE for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

texiv-0.1.10.tar.gz (3.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

texiv-0.1.10-py3-none-any.whl (37.1 kB view details)

Uploaded Python 3

File details

Details for the file texiv-0.1.10.tar.gz.

File metadata

  • Download URL: texiv-0.1.10.tar.gz
  • Upload date:
  • Size: 3.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.7.21

File hashes

Hashes for texiv-0.1.10.tar.gz
Algorithm Hash digest
SHA256 c07e86285ee4e0f36bfa6f1d3813883f6d10d224d57de3a3f311aa0634c7ddf8
MD5 24a32a866924aa57ef99f11947b868a8
BLAKE2b-256 e9cc08c2142a9f00bec53cd958837302c4d95e28e78ded53e068cecc89e3f0a4

See more details on using hashes here.

File details

Details for the file texiv-0.1.10-py3-none-any.whl.

File metadata

  • Download URL: texiv-0.1.10-py3-none-any.whl
  • Upload date:
  • Size: 37.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.7.21

File hashes

Hashes for texiv-0.1.10-py3-none-any.whl
Algorithm Hash digest
SHA256 099dc44eb7bb51a72f6bd45f825060e552f2edcb5606bbc1a45e6ea6584fd0ab
MD5 8e2be71649814f3f6a395f1244d1aa9d
BLAKE2b-256 fa0040aa3a83e5ff95773f1e52a85d001c76856c9c38df8874ded0cc8d7e79f0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page