Skip to main content

A professional evolutionary discovery tool for Humanin-like peptides (sORFs) using a Hybrid AI approach.

Project description

HumaninFinder v1.0.6 ๐Ÿงฌ๐Ÿค–

HumaninFinder Logo

DOI University: UMC Laboratory: LaBiOmicS Bioinformatics

PyPI Version Open Source Open Science License: MIT JOSS Status CI Status

Python Version Powered by Ollama Deep Learning


HumaninFinder is a professional, high-performance Python framework designed for the discovery and classification of Humanin-like peptides (sORFs) within mitochondrial genomes. It employs a Hybrid AI Engine that integrates deep structural embeddings from the ESM-2 Protein Language Model with explicit biophysical analysis to identify functional, non-canonical, and pseudogenic sequences across any taxonomic group.


๐Ÿ“‚ Repository Structure

.
โ”œโ”€โ”€ conda/                   # Bioconda recipe and metadata
โ”œโ”€โ”€ deploy/                  # Containerization (Dockerfile, Singularity.def)
โ”œโ”€โ”€ docs/                    # Technical documentation and user guides
โ”œโ”€โ”€ examples/                # Quick-start samples (FASTA genomes)
โ”œโ”€โ”€ galaxy/                  # Galaxy Tool wrapper and integration
โ”œโ”€โ”€ paper/                   # Publication manuscripts
โ”‚   โ”œโ”€โ”€ joss/                # Software description for JOSS
โ”‚   โ””โ”€โ”€ primate_study/       # Scientific case study on 61 primate genomes
โ”œโ”€โ”€ src/
โ”‚   โ””โ”€โ”€ humaninfinder/       # Main Python Package
โ”‚       โ”œโ”€โ”€ cli.py           # Subcommand-based Command-line interface
โ”‚       โ”œโ”€โ”€ core.py          # Locus localization and ORF finding logic
โ”‚       โ”œโ”€โ”€ classifier.py    # Hybrid AI Engine (ESM-2 + Biophysical)
โ”‚       โ”œโ”€โ”€ agent.py         # AI Research Agent (Ollama integration)
โ”‚       โ”œโ”€โ”€ data/            # HMM models and 16S probes
โ”‚       โ””โ”€โ”€ models/          # Pre-trained hybrid classifier weights
โ”œโ”€โ”€ tests/                   # Unit and biological validation tests
โ”œโ”€โ”€ pyproject.toml           # Build system and PyPI definitions
โ””โ”€โ”€ pixi.toml                # Modern environment management

๐Ÿงฉ Core Components Detail

  • Hybrid AI Engine: Combines mean-pooled embeddings from the ESM-2 transformer model (esm2_t6_8M_UR50D) with charge, pI, and hydrophobicity metrics.
  • Evolutionary Rescue: A high-sensitivity sliding-window scanner that "rescues" non-canonical and pseudogenic relics in diverged lineages.
  • AI Research Agent: An integrated specialist assistant powered by local LLMs (via Ollama) to provide biological interpretation of results in the context of mitochondrial aging and cytoprotection.
  • Biological Deduplication: A specialized filter that ensures independent evolutionary signals by removing technical windowing artifacts.

๐Ÿš€ Key Features

  • Organism Agnostic: Supports all 33 NCBI genetic codes, enabling MDP discovery in any mitochondria-bearing taxon.
  • Expert AI Agent: Built-in specialist in Humanin, mitochondrial signaling, and aging biology to interpret your findings.
  • High-Throughput Ready: Parallelized processing and optimized inference for large genomic collections.
  • Validated Science: Built-in reproduction of the 61-primate evolutionary case study.

๐Ÿ› ๏ธ Quick Start

1. Installation

Option A: via pip (Fastest)

pip install "humaninfinder[agent]"
humanin-finder setup

Note: Ensure HMMER3 is installed on your system.

Option B: via Conda / Mamba (Recommended)

Perfect for an isolated scientific environment:

# Create environment from the provided file
mamba env create -f environment.yml
mamba activate humanin_env

# Finalize setup
humanin-finder setup

2. Run Discovery Pipeline

humanin-finder predict -i genome.fasta -o results --hmm --rescue

3. Biological Interpretation

# Get a summary of your findings from the AI Specialist
humanin-finder agent --results results_results.csv

๐Ÿ“– Documentation

Detailed information is available in the paper/ directory:

  • ๐Ÿ› ๏ธ Software Paper: Architecture and methodology for JOSS.
  • ๐Ÿ—๏ธ Scientific Report: Evolutionary dynamics of Humanin in Primates.

๐Ÿค Contributing

Contributions are welcome! Please see our CONTRIBUTING.md for details.

๐Ÿ“„ License

This project is licensed under the MIT License - see the LICENSE file for details.


Developed by LaBiOmicS - Laboratory of Bioinformatics and Omics Sciences. Institution: Universidade de Mogi das Cruzes (UMC)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

humaninfinder-1.0.6.tar.gz (175.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

humaninfinder-1.0.6-py3-none-any.whl (171.5 kB view details)

Uploaded Python 3

File details

Details for the file humaninfinder-1.0.6.tar.gz.

File metadata

  • Download URL: humaninfinder-1.0.6.tar.gz
  • Upload date:
  • Size: 175.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for humaninfinder-1.0.6.tar.gz
Algorithm Hash digest
SHA256 550101f23062cf4b154d32e209898f8eb91947ba49c8ca22040112ca112cd9eb
MD5 aba0654ec350d66d0d48c7186052d2e0
BLAKE2b-256 b6c5b7a49d514879fb77aa5c1db3169a509b01af906bda1e4bd5d2f42e868619

See more details on using hashes here.

File details

Details for the file humaninfinder-1.0.6-py3-none-any.whl.

File metadata

  • Download URL: humaninfinder-1.0.6-py3-none-any.whl
  • Upload date:
  • Size: 171.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for humaninfinder-1.0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 b3a0b03c089f9e759e2fa878a65a607536a111534ecff1ec8ea132157c5c8784
MD5 1ce20b14832e3c6608b7d4d69e7c781c
BLAKE2b-256 a46b5e6010b9c0816866861867c108c514f9dc11e451bb99f30f45ceca4686cd

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page