Skip to main content

Text processing utilities for scientific workflows — LaTeX formatting, colored output, string parsing, and plot text helpers

Project description

SciTeX Str (scitex-str)

SciTeX

Text processing utilities for scientific workflows.

Full Documentation · uv pip install scitex-str[all]

PyPI Python Tests Coverage Docs License: AGPL v3


Problem and Solution

# Problem Solution
1 LaTeX labels crash matplotlib when TeX isn't installed — CI runners, laptops without MacTeX, Colab without !apt install texlive all fail safe_latex_render(s) — auto-detects LaTeX; falls back to mathtext then unicode silently
2 ANSI color codes + grep/parse sprinkled as ad-hoc re patterns — each script reinvents the wheel Grab-bag of helpersprintc, color_text, grep, parse, replace, mask_api, readable_bytes — boring but consistent across 33 packages

Installation

Requires Python >= 3.10.

pip install scitex-str

Architecture

scitex_str/
├── _to_latex_style.py / _safe_to_latex_style.py   # LaTeX rendering with fallback
├── _color_text.py / _printc.py                     # ANSI color helpers
├── _parse.py / _grep.py / _search.py / _replace.py # text search & template parse
├── _format_plot_text.py                            # axis-label formatter
├── _readable_bytes.py / _factor_out_digits.py      # numeric formatting
├── _mask_api.py / _remove_ansi.py                  # sanitization
├── _squeeze_space.py / _title.py / _decapitalize.py# small string ops
└── ...                                              # ~20 boring helpers, one per file
flowchart LR
    LX[LaTeX/mathtext<br/>fallback] --> A[to_latex_style<br/>safe_latex_render]
    ANSI[ANSI tooling] --> B[printc / ct / remove_ansi]
    Tmpl[Template parsing] --> C[parse / grep / search / replace]
    Fmt[Numeric formatting] --> D[readable_bytes<br/>factor_out_digits]
    Plot[Plot text] --> E[format_plot_text]
    Sanit[Sanitization] --> F[mask_api_key]

Figure 1. Module layout. Each helper is a single-file leaf — boring on purpose, consistent across 33 ecosystem packages.

1 Interfaces

Python API
import scitex_str as ss

# LaTeX-style formatting (with safe fallback)
ss.to_latex_style("theta")              # r"$\theta$"
ss.safe_to_latex_style("unknown")       # "unknown" (no error)

# Colored terminal output
ss.printc("Success!", color="green")
ss.ct("Warning", color="yellow")        # returns colored string

# Parse structured paths
ss.parse("./data/Patient_23/Hour_12",
         "./data/Patient_{id}/Hour_{hour}")  # {'id': 23, 'hour': 12}

# Plot text formatting
ss.format_plot_text("amplitude_mv")     # "Amplitude [mV]"

# Numeric formatting
ss.readable_bytes(1_500_000)            # "1.43 MB"
ss.factor_out_digits([1000, 2000, 3000])

# Misc
ss.grep(pattern, lines)
ss.search(...)
ss.replace(...)
ss.mask_api_key("sk-...")
ss.remove_ansi(text)
ss.squeeze_space("a  b   c")            # "a b c"
ss.title_case("hello world")
ss.decapitalize("Hello")

Demo

import scitex_str as ss

# 1) LaTeX-safe label rendering — no crash if TeX missing
label = ss.safe_to_latex_style("theta")    # "$\\theta$" or unicode fallback

# 2) Colored terminal status
ss.printc("[ok] tunnel established", color="green")
ss.printc("[warn] retry in 3s",      color="yellow")

# 3) Parse a structured directory
ss.parse("./data/Patient_23/Hour_12",
         "./data/Patient_{id}/Hour_{hour}")  # → {'id': 23, 'hour': 12}

# 4) Human-readable byte size
ss.readable_bytes(1_500_000)               # → "1.43 MB"

# 5) Mask credentials before logging
ss.mask_api_key("sk-abcdef1234567890")     # → "sk-***7890"
flowchart LR
    A[Raw value] --> B{kind?}
    B -- bytes --> RB[readable_bytes]
    B -- path --> P[parse]
    B -- math --> L[safe_to_latex_style]
    B -- secret --> M[mask_api_key]
    B -- log line --> PC[printc]
    RB --> O[Pretty output]
    P --> O
    L --> O
    M --> O
    PC --> O
    style O fill:#27ae60,stroke:#2c3e50,color:#fff

Figure 2. Demo. Pick the helper by what you have, not by where it lives.

Part of SciTeX

scitex-str is part of SciTeX. Install via the umbrella with pip install scitex[str] to use as scitex.str (Python) or scitex str ... (CLI).

Four Freedoms for Research

  1. The freedom to run your research anywhere — your machine, your terms.
  2. The freedom to study how every step works — from raw data to final manuscript.
  3. The freedom to redistribute your workflows, not just your papers.
  4. The freedom to modify any module and share improvements with the community.

AGPL-3.0 — because we believe research infrastructure deserves the same freedoms as the software it runs on.

License

AGPL-3.0-only.


SciTeX

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scitex_str-0.1.12.tar.gz (8.7 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

scitex_str-0.1.12-py3-none-any.whl (8.4 MB view details)

Uploaded Python 3

File details

Details for the file scitex_str-0.1.12.tar.gz.

File metadata

  • Download URL: scitex_str-0.1.12.tar.gz
  • Upload date:
  • Size: 8.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for scitex_str-0.1.12.tar.gz
Algorithm Hash digest
SHA256 8c34e7341ccbfa0b0aa180f0acdcdd0f645a589a139f068b2335245bc89c612b
MD5 c4c10fed62f678bbfd7cfa248911a0b4
BLAKE2b-256 c836bb3fe4a414f87f5308ea01bd69230cbb7ef25d432ef7510c7f0facf1d15a

See more details on using hashes here.

Provenance

The following attestation bundles were made for scitex_str-0.1.12.tar.gz:

Publisher: pypi-publish-and-github-release-on-tag.yml on ywatanabe1989/scitex-str

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file scitex_str-0.1.12-py3-none-any.whl.

File metadata

  • Download URL: scitex_str-0.1.12-py3-none-any.whl
  • Upload date:
  • Size: 8.4 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for scitex_str-0.1.12-py3-none-any.whl
Algorithm Hash digest
SHA256 dcc07377b2c23c66fd483795ac51a00e786b6eea5f2ebb396e0833a2e8a5aea0
MD5 1bea2e56a4fe0132245d59f3323adb54
BLAKE2b-256 96d664cc99468ed9858330024e7a0baff09816d7d349dce880b6c4188c874c7e

See more details on using hashes here.

Provenance

The following attestation bundles were made for scitex_str-0.1.12-py3-none-any.whl:

Publisher: pypi-publish-and-github-release-on-tag.yml on ywatanabe1989/scitex-str

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page