Skip to main content

A professional evolutionary discovery tool for Humanin-like peptides (sORFs) using a Hybrid AI approach.

Project description

HumaninFinder v1.0.7 ๐Ÿงฌ๐Ÿค–

HumaninFinder Logo

DOI University: UMC Laboratory: LaBiOmicS Bioinformatics

PyPI Version Open Source Open Science License: MIT JOSS Status CI Status

Python Version Powered by Ollama Deep Learning


HumaninFinder is a professional, high-performance Python framework designed for the discovery and classification of Humanin-like peptides (sORFs) within mitochondrial genomes. It employs a Hybrid AI Engine that integrates deep structural embeddings from the ESM-2 Protein Language Model with explicit biophysical analysis to identify functional, non-canonical, and pseudogenic sequences across any taxonomic group.


๐Ÿ“‚ Repository Structure

.
โ”œโ”€โ”€ conda/                   # Bioconda recipe and metadata
โ”œโ”€โ”€ deploy/                  # Containerization (Dockerfile, Singularity.def)
โ”œโ”€โ”€ docs/                    # Technical documentation and user guides
โ”œโ”€โ”€ examples/                # Quick-start samples (FASTA genomes)
โ”œโ”€โ”€ galaxy/                  # Galaxy Tool wrapper and integration
โ”œโ”€โ”€ paper/                   # Publication manuscripts
โ”‚   โ”œโ”€โ”€ joss/                # Software description for JOSS
โ”‚   โ””โ”€โ”€ primate_study/       # Scientific case study on 61 primate genomes
โ”œโ”€โ”€ src/
โ”‚   โ””โ”€โ”€ humaninfinder/       # Main Python Package
โ”‚       โ”œโ”€โ”€ cli.py           # Subcommand-based Command-line interface
โ”‚       โ”œโ”€โ”€ core.py          # Locus localization and ORF finding logic
โ”‚       โ”œโ”€โ”€ classifier.py    # Hybrid AI Engine (ESM-2 + Biophysical)
โ”‚       โ”œโ”€โ”€ agent.py         # AI Research Agent (Ollama integration)
โ”‚       โ”œโ”€โ”€ data/            # HMM models and 16S probes
โ”‚       โ””โ”€โ”€ models/          # Pre-trained hybrid classifier weights
โ”œโ”€โ”€ tests/                   # Unit and biological validation tests
โ”œโ”€โ”€ pyproject.toml           # Build system and PyPI definitions
โ””โ”€โ”€ pixi.toml                # Modern environment management

๐Ÿงฉ Core Components Detail

  • Hybrid AI Engine: Combines mean-pooled embeddings from the ESM-2 transformer model (esm2_t6_8M_UR50D) with charge, pI, and hydrophobicity metrics.
  • Evolutionary Rescue: A high-sensitivity sliding-window scanner that "rescues" non-canonical and pseudogenic relics in diverged lineages.
  • AI Research Agent: An integrated specialist assistant powered by local LLMs (via Ollama) to provide biological interpretation of results in the context of mitochondrial aging and cytoprotection.
  • Biological Deduplication: A specialized filter that ensures independent evolutionary signals by removing technical windowing artifacts.

๐Ÿš€ Key Features

  • Organism Agnostic: Supports all 33 NCBI genetic codes, enabling MDP discovery in any mitochondria-bearing taxon.
  • Expert AI Agent: Built-in specialist in Humanin, mitochondrial signaling, and aging biology to interpret your findings.
  • High-Throughput Ready: Parallelized processing and optimized inference for large genomic collections.
  • Validated Science: Built-in reproduction of the 61-primate evolutionary case study.

๐Ÿ› ๏ธ Quick Start

1. Installation

Option A: via pip (Fastest)

pip install "humaninfinder[agent]"
humanin-finder setup

Note: Ensure HMMER3 is installed on your system.

Option B: via Conda / Mamba (Recommended)

Perfect for an isolated scientific environment:

# Create environment from the provided file
mamba env create -f environment.yml
mamba activate humanin_env

# Finalize setup
humanin-finder setup

2. Run Discovery Pipeline

humanin-finder predict -i genome.fasta -o results --hmm --rescue

3. Biological Interpretation

# Get a summary of your findings from the AI Specialist
humanin-finder agent --results results_results.csv

๐Ÿ“– Documentation

Detailed information is available in the paper/ directory:

  • ๐Ÿ› ๏ธ Software Paper: Architecture and methodology for JOSS.
  • ๐Ÿ—๏ธ Scientific Report: Evolutionary dynamics of Humanin in Primates.

๐Ÿค Contributing

Contributions are welcome! Please see our CONTRIBUTING.md for details.

๐Ÿ“„ License

This project is licensed under the MIT License - see the LICENSE file for details.


Developed by LaBiOmicS - Laboratory of Bioinformatics and Omics Sciences. Institution: Universidade de Mogi das Cruzes (UMC)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

humaninfinder-1.0.7.tar.gz (175.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

humaninfinder-1.0.7-py3-none-any.whl (171.5 kB view details)

Uploaded Python 3

File details

Details for the file humaninfinder-1.0.7.tar.gz.

File metadata

  • Download URL: humaninfinder-1.0.7.tar.gz
  • Upload date:
  • Size: 175.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for humaninfinder-1.0.7.tar.gz
Algorithm Hash digest
SHA256 9f58e926cd62962b8e4c18c28250ceb3a37481e4d2b8f15a040695f933b7696d
MD5 a3b94e055634ca9843428dc24ce9e7f0
BLAKE2b-256 a81eed75eb19f2a3e7f8117a8092f5f004f55dbfc196ca9fd94212a86b9b10af

See more details on using hashes here.

File details

Details for the file humaninfinder-1.0.7-py3-none-any.whl.

File metadata

  • Download URL: humaninfinder-1.0.7-py3-none-any.whl
  • Upload date:
  • Size: 171.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for humaninfinder-1.0.7-py3-none-any.whl
Algorithm Hash digest
SHA256 46daad47da22cbc8a92df52ae799918edc961e0ddd7c538554d702e7147faaef
MD5 49e7c5af15196deb20bd1c4fe9ac31d5
BLAKE2b-256 9cbccdab75bbcb10eecae8c888b282fde3c842a0b0ee6376cbdc6f40ddb6f30e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page