Skip to main content

A Python library for generating and simplifying fuzzy linguistic summaries of data.

Project description

FuzzyLinguistics

PyPI version License

A powerful and flexible Python library for generating and simplifying fuzzy linguistic summaries of data. Turn complex datasets into human-readable, natural language insights.

This library implements a complete system for creating summaries of one or two datasets (e.g., comparing models), evaluating their truth and quality, and simplifying them from thousands of potential statements into a handful of concise, meaningful insights.

Key Features

  • Generate Summaries: Create summaries for single datasets or compare two datasets against each other.
  • Flexible Configuration: Define your linguistic variables programmatically in Python or through a clean JSON interface.
  • Powerful Simplification: A multi-stage graph-based algorithm prunes and combines thousands of raw statements into a coherent, high-level summary.
  • Quality Metrics: Evaluates summaries based on truth, focus, simplicity, and relevance.
  • Visualization: Includes tools to plot membership functions and export the summary graph for analysis.

Installation

Install fuzzylinguistics directly from PyPI:

pip install fuzzylinguistics

Quick Start

The easiest way to get started is with a JSON configuration file.

1. Create your configuration file (config.json)

This file defines your data, linguistic variables (summarizers, qualifiers), and quantifiers.

{
    "datasets": [
        {
            "category_name": "Cars",
            "model_name": "Car Example",
            "uses_qualifier": true,
            "input_dimension_labels": ["color"],
            "input_dimension_units": [null],
            "input_data": [
                [0],
                [0],
                [0],
                [0],
                [0],
                [0],
                [0],
                [0],
                [0],
                [1]
            ],
            "output_dimension_labels": ["speed"],
            "output_dimension_units": [null],
            "output_data": [
                [0],
                [0],
                [0],
                [0],
                [0],
                [0],
                [0],
                [0],
                [0],
                [1]
            ]
        }
    ],
    "summarizers": [
        {
            "dimension_name": "color",
            "predicates": ["Red", "Green", "Blue"],
            "trapezoidal_x_vals": [
                [0, 0, 0, 0],
                [0.5, 0.5, 0.5, 0.5],
                [1, 1, 1, 1]
            ],
            "relevancy_weights": [1, 1, 1]
        }
    ],
    "qualifiers": [
        {
            "dimension_name": "speed",
            "predicates": ["Slow", "Normal", "Fast"],
            "trapezoidal_x_vals": [
                [0, 0, 0, 0],
                [0.5, 0.5, 0.5, 0.5],
                [1, 1, 1, 1]
            ],
            "relevancy_weights": [1, 1, 1]
        }
    ],
    "quantifiers": {
        "dimension_name": "quantifier",
        "predicates": ["None", "None", "A Few", "Some", "Many", "All"],
        "trapezoidal_x_vals": [
            [null, null, null, null],
            [0, 0, 0, 0],
            [0.00001, 0.00001, 0.00001, 0.5],
            [0.0, 0.5, 0.5, 0.99999],
            [0.5, 0.99999, 0.99999, 0.99999],
            [1, 1, 1, 1]
        ],
        "relevancy_weights": [1.0, 1.0, 1.0, 1.0, 1.0, 1.0]
    }
}

2. Run the summarization in Python

# main.py
from fuzzylinguistics import setup_fls

# 1. Setup the system from the config file
fls = setup_fls(config_json_path="config.json")

# 2. Generate and simplify the summary
# Results will be saved to the specified directory
fls.generate_fls_one_model(results_dir='./car_summary_results')

# 3. Print the final, simplified summary
print("--- Final Simplified Summary ---")
for statement in fls.results["simplified_stage_4_summary"]:
    print(statement)

Example Output:

--- Final Simplified Summary ---
None are green color or normal speed.
Of the cars with blue color none are slow speed, and all are fast speed.
Of the cars with red color none are fast speed, and all are slow speed.

Examples

For more detailed examples, including how to compare two models and how to set up the system programmatically without JSON, please see the /examples directory in the project repository.

License

This project is licensed under the Apache License, Version 2.0. See the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fuzzylinguistics-0.1.0.tar.gz (30.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

fuzzylinguistics-0.1.0-py3-none-any.whl (28.2 kB view details)

Uploaded Python 3

File details

Details for the file fuzzylinguistics-0.1.0.tar.gz.

File metadata

  • Download URL: fuzzylinguistics-0.1.0.tar.gz
  • Upload date:
  • Size: 30.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for fuzzylinguistics-0.1.0.tar.gz
Algorithm Hash digest
SHA256 6596e0b844687020a3341d51135c76925cc94fc1d2285520d77490190445d5d2
MD5 37bda2dad40dab620775b96bda7f9e4b
BLAKE2b-256 f37e7125e57d8048d2cb3a9b63a0d3f157efc84a653461ae696250ad7e2877e1

See more details on using hashes here.

File details

Details for the file fuzzylinguistics-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for fuzzylinguistics-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 541daff8dfe2286341ee795e31a1e3afb141fb2a1661bb5bd05242861018f5d7
MD5 8d401608015b5855e292ad4a9ce054eb
BLAKE2b-256 97aa6ea92353b59c5bf583fd4cd436a497a51aba74b9585fba1cfc5685f450dd

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page