A library for taxonomy management.
Project description
Taxonomy Library README
This module provides functions for taxonomy creation, text classification, and header translation using AI models and vector similarity search.
Usage
Import the module with:
from bee_taxonomy import taxonomy
Installation
Install the library using pip:
pip install bee-taxonomy
Main Functions
1. taxonomy.propose_taxonomy(field: str, description: str, discrete_fields: list[str] = None)
Purpose: Generate taxonomy suggestions using OpenAI
Parameters:
field: Name of the field to categorizedescription: Description of the field's purposediscrete_fields: Optional specific values to consider
Example:
taxonomy.propose_taxonomy(
field="Color",
description="Vehicle paint color classification",
discrete_fields=["Red", "Blue", "Green", "Custom"]
)
# Returns: ["Red", "Blue", "Green", "Other"]
2. taxonomy.apply_taxonomy_similarity(discrete_fields: list[str], taxonomy: list[str], category_type: str = None)
Purpose: Classify values using semantic similarity with vector database
Parameters:
discrete_fields: Values to classifytaxonomy: List of allowed classification termscategory_type: Special processing for categories like 'streets'
Example:
taxonomy.apply_taxonomy_similarity(
discrete_fields=["Rd", "Street", "Ave"],
taxonomy=["Road", "Street", "Avenue"],
category_type="streets"
)
# Returns: {'Rd': {'match': 'Road', 'score': 0.92}, ...}
3. taxonomy.apply_taxonomy_reasoning(discrete_fields: list[str], taxonomy: list[str], classification_description: str, hash_file: str = None)
Purpose: Use AI reasoning to classify values into taxonomy
Parameters:
discrete_fields: List of values to classifytaxonomy: List of allowed categoriesclassification_description: Context for classificationhash_file: Optional file hash for progress tracking
Example:
taxonomy.apply_taxonomy_reasoning(
discrete_fields=["Quick Brown Fox", "Lazy Dog"],
taxonomy=["Animal", "Object", "Action"],
classification_description="Classify animal-related phrases"
)
# Returns: {'Quick Brown Fox': 'Animal', 'Lazy Dog': 'Animal'
4. taxonomy.translate_headers_reasoning(src_lang, dest_lang, headers)
Purpose: Translate headers between languages using AI reasoning
Parameters:
src_lang: Source language codedest_lang: Target language codeheaders: List of headers to translate
Example:
taxonomy.translate_headers_reasoning(
src_lang="en",
dest_lang="es",
headers=["Street Name", "Zip Code"]
)
# Returns: {'Street Name': 'Nombre de la Calle', 'Zip Code': 'Código Postal'
5. taxonomy.analyze_text_field(field_name: str, field_value: str, task: Literal["label", "summarize"] = "label")
Purpose: Analyze text fields for classification or summarization
Parameters:
field_name: Name of the text fieldfield_value: Text to analyzetask: "label" for classification or "summarize" for text summary
Example:
taxonomy.analyze_text_field(
field_name="Product Description",
field_value="This ergonomic chair provides lumbar support and adjustable height",
task="label"
)
# Returns: "Office Furniture"
Environment Variables
Users must rename .env.example to .env and fill in all the required fields with their specific values:
MODEL_NAME: Hugging Face model identifierSERVER_URL: Base URL for OpenAI-compatible APIAPI_KEY: Authentication token for the APIEMBEDDER_MODEL: Embedding model for semantic similarity
Features
- Validation workflow with Pydantic models
- Progress checkpointing for large datasets
- Google search integration for ambiguous classifications
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file bee_taxonomy-0.0.14.tar.gz.
File metadata
- Download URL: bee_taxonomy-0.0.14.tar.gz
- Upload date:
- Size: 26.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
751b240008443eafcb706f94230490c9806f8d954ac570ed8c0a22d7e79ebf97
|
|
| MD5 |
ecd2faf5d7f4d0e00dd6ecbff5b080cc
|
|
| BLAKE2b-256 |
5c8a7c8a0687b61bf1e8ef0d547e782b6a4dc3f3914be13294819a51f64861c7
|
File details
Details for the file bee_taxonomy-0.0.14-py3-none-any.whl.
File metadata
- Download URL: bee_taxonomy-0.0.14-py3-none-any.whl
- Upload date:
- Size: 14.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f97a8dfc83795a740212bcec7e9692d11d7af5bfb46a26159551bbed41aba7b0
|
|
| MD5 |
cdc10b74833c520d7f6d47819c8efcf9
|
|
| BLAKE2b-256 |
0bdbe80fc34369a557c2ae3468883f75102cf9bdafa4a060450c089f6c967151
|