Skip to main content

A professional evolutionary discovery tool for Humanin-like peptides (sORFs) using a Hybrid AI approach.

Project description

HumaninFinder v1.0.7 ๐Ÿงฌ๐Ÿค–

HumaninFinder Logo

DOI University: UMC Laboratory: LaBiOmicS Bioinformatics

PyPI Version Open Source Open Science License: MIT JOSS Status CI Status

Python Version Powered by Ollama Deep Learning


HumaninFinder is a professional, high-performance Python framework designed for the discovery and classification of Humanin-like peptides (sORFs) within mitochondrial genomes. It employs a Hybrid AI Engine that integrates deep structural embeddings from the ESM-2 Protein Language Model with explicit biophysical analysis to identify functional, non-canonical, and pseudogenic sequences across any taxonomic group.


HumaninFinder


๐Ÿ“‚ Repository Structure

.
โ”œโ”€โ”€ conda/                   # Bioconda recipe and metadata
โ”œโ”€โ”€ deploy/                  # Containerization (Dockerfile, Singularity.def)
โ”œโ”€โ”€ docs/                    # Technical documentation and user guides
โ”œโ”€โ”€ examples/                # Quick-start samples (FASTA genomes)
โ”œโ”€โ”€ galaxy/                  # Galaxy Tool wrapper and integration
โ”œโ”€โ”€ paper/                   # Publication manuscripts
โ”‚   โ”œโ”€โ”€ joss/                # Software description for JOSS
โ”‚   โ””โ”€โ”€ primate_study/       # Scientific case study on 61 primate genomes
โ”œโ”€โ”€ src/
โ”‚   โ””โ”€โ”€ humaninfinder/       # Main Python Package
โ”‚       โ”œโ”€โ”€ cli.py           # Subcommand-based Command-line interface
โ”‚       โ”œโ”€โ”€ core.py          # Locus localization and ORF finding logic
โ”‚       โ”œโ”€โ”€ classifier.py    # Hybrid AI Engine (ESM-2 + Biophysical)
โ”‚       โ”œโ”€โ”€ agent.py         # AI Research Agent (Ollama integration)
โ”‚       โ”œโ”€โ”€ data/            # HMM models and 16S probes
โ”‚       โ””โ”€โ”€ models/          # Pre-trained hybrid classifier weights
โ”œโ”€โ”€ tests/                   # Unit and biological validation tests
โ”œโ”€โ”€ pyproject.toml           # Build system and PyPI definitions
โ””โ”€โ”€ pixi.toml                # Modern environment management

๐Ÿงฉ Core Components Detail

  • Hybrid AI Engine: Combines mean-pooled embeddings from the ESM-2 transformer model (esm2_t6_8M_UR50D) with charge, pI, and hydrophobicity metrics.
  • Evolutionary Rescue: A high-sensitivity sliding-window scanner that "rescues" non-canonical and pseudogenic relics in diverged lineages.
  • AI Research Agent: An integrated specialist assistant powered by local LLMs (via Ollama) to provide biological interpretation of results in the context of mitochondrial aging and cytoprotection.
  • Biological Deduplication: A specialized filter that ensures independent evolutionary signals by removing technical windowing artifacts.

๐Ÿš€ Key Features

  • Organism Agnostic: Supports all 33 NCBI genetic codes, enabling MDP discovery in any mitochondria-bearing taxon.
  • Expert AI Agent: Built-in specialist in Humanin, mitochondrial signaling, and aging biology to interpret your findings.
  • High-Throughput Ready: Parallelized processing and optimized inference for large genomic collections.
  • Validated Science: Built-in reproduction of the 61-primate evolutionary case study.

๐Ÿ› ๏ธ Quick Start

1. Installation

Option A: via pip (Fastest)

pip install "humaninfinder[agent]"
humanin-finder setup

Note: Ensure HMMER3 is installed on your system.

Option B: via Conda / Mamba (Recommended)

Perfect for an isolated scientific environment:

# Create environment from the provided file
mamba env create -f environment.yml
mamba activate humanin_env

# Finalize setup
humanin-finder setup

2. Run Discovery Pipeline

humanin-finder predict -i genome.fasta -o results --hmm --rescue

3. Biological Interpretation

# Get a summary of your findings from the AI Specialist
humanin-finder agent --results results_results.csv

๐Ÿ“– Documentation

Detailed information is available in the paper/ directory:

  • ๐Ÿ› ๏ธ Software Paper: Architecture and methodology for JOSS.
  • ๐Ÿ—๏ธ Scientific Report: Evolutionary dynamics of Humanin in Primates.

๐Ÿค Contributing

Contributions are welcome! Please see our CONTRIBUTING.md for details.

๐Ÿ“„ License

This project is licensed under the MIT License - see the LICENSE file for details.


Developed by LaBiOmicS - Laboratory of Bioinformatics and Omics Sciences. Institution: Universidade de Mogi das Cruzes (UMC)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

humaninfinder-1.0.8.tar.gz (175.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

humaninfinder-1.0.8-py3-none-any.whl (171.5 kB view details)

Uploaded Python 3

File details

Details for the file humaninfinder-1.0.8.tar.gz.

File metadata

  • Download URL: humaninfinder-1.0.8.tar.gz
  • Upload date:
  • Size: 175.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for humaninfinder-1.0.8.tar.gz
Algorithm Hash digest
SHA256 ce2f1eb31b5c44280070c91dc9b223ea922e88371bc659b53ca7a0076e6dc89d
MD5 e132a5fd4839e0d1594dbc14b1768f2a
BLAKE2b-256 3638617b06d9d7d8bd3c6b4775d0a963f1c0d9a3b605f1eca1eec16951191b32

See more details on using hashes here.

File details

Details for the file humaninfinder-1.0.8-py3-none-any.whl.

File metadata

  • Download URL: humaninfinder-1.0.8-py3-none-any.whl
  • Upload date:
  • Size: 171.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for humaninfinder-1.0.8-py3-none-any.whl
Algorithm Hash digest
SHA256 5e5f4a21b572fae3119e71a9f470ed2b9837cf7cbdfb0a490ca912233e030232
MD5 4ce77d623b347d28416507009c1fd0b7
BLAKE2b-256 04b95316fc8b1f7b65d4621b85a30f357802248bfe95bd9474dd0c625499820f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page