Skip to main content

A professional evolutionary discovery tool for Humanin-like peptides (sORFs) using a Hybrid AI approach.

Project description

HumaninFinder v1.0.5 ๐Ÿงฌ๐Ÿค–

HumaninFinder Logo

DOI University: UMC Laboratory: LaBiOmicS Bioinformatics

PyPI Version Open Source Open Science License: MIT JOSS Status CI Status

Python Version Powered by Ollama Deep Learning


HumaninFinder is a professional, high-performance Python framework designed for the discovery and classification of Humanin-like peptides (sORFs) within mitochondrial genomes. It employs a Hybrid AI Engine that integrates deep structural embeddings from the ESM-2 Protein Language Model with explicit biophysical analysis to identify functional, non-canonical, and pseudogenic sequences across any taxonomic group.


๐Ÿ“‚ Repository Structure

.
โ”œโ”€โ”€ conda/                   # Bioconda recipe and metadata
โ”œโ”€โ”€ deploy/                  # Containerization (Dockerfile, Singularity.def)
โ”œโ”€โ”€ docs/                    # Technical documentation and user guides
โ”œโ”€โ”€ examples/                # Quick-start samples (FASTA genomes)
โ”œโ”€โ”€ galaxy/                  # Galaxy Tool wrapper and integration
โ”œโ”€โ”€ paper/                   # Publication manuscripts
โ”‚   โ”œโ”€โ”€ joss/                # Software description for JOSS
โ”‚   โ””โ”€โ”€ primate_study/       # Scientific case study on 61 primate genomes
โ”œโ”€โ”€ src/
โ”‚   โ””โ”€โ”€ humaninfinder/       # Main Python Package
โ”‚       โ”œโ”€โ”€ cli.py           # Subcommand-based Command-line interface
โ”‚       โ”œโ”€โ”€ core.py          # Locus localization and ORF finding logic
โ”‚       โ”œโ”€โ”€ classifier.py    # Hybrid AI Engine (ESM-2 + Biophysical)
โ”‚       โ”œโ”€โ”€ agent.py         # AI Research Agent (Ollama integration)
โ”‚       โ”œโ”€โ”€ data/            # HMM models and 16S probes
โ”‚       โ””โ”€โ”€ models/          # Pre-trained hybrid classifier weights
โ”œโ”€โ”€ tests/                   # Unit and biological validation tests
โ”œโ”€โ”€ pyproject.toml           # Build system and PyPI definitions
โ””โ”€โ”€ pixi.toml                # Modern environment management

๐Ÿงฉ Core Components Detail

  • Hybrid AI Engine: Combines mean-pooled embeddings from the ESM-2 transformer model (esm2_t6_8M_UR50D) with charge, pI, and hydrophobicity metrics.
  • Evolutionary Rescue: A high-sensitivity sliding-window scanner that "rescues" non-canonical and pseudogenic relics in diverged lineages.
  • AI Research Agent: An integrated specialist assistant powered by local LLMs (via Ollama) to provide biological interpretation of results in the context of mitochondrial aging and cytoprotection.
  • Biological Deduplication: A specialized filter that ensures independent evolutionary signals by removing technical windowing artifacts.

๐Ÿš€ Key Features

  • Organism Agnostic: Supports all 33 NCBI genetic codes, enabling MDP discovery in any mitochondria-bearing taxon.
  • Expert AI Agent: Built-in specialist in Humanin, mitochondrial signaling, and aging biology to interpret your findings.
  • High-Throughput Ready: Parallelized processing and optimized inference for large genomic collections.
  • Validated Science: Built-in reproduction of the 61-primate evolutionary case study.

๐Ÿ› ๏ธ Quick Start

1. Installation

Option A: via pip (Fastest)

pip install "humaninfinder[agent]"
humanin-finder setup

Note: Ensure HMMER3 is installed on your system.

Option B: via Conda / Mamba (Recommended)

Perfect for an isolated scientific environment:

# Create environment from the provided file
mamba env create -f environment.yml
mamba activate humanin_env

# Finalize setup
humanin-finder setup

2. Run Discovery Pipeline

humanin-finder predict -i genome.fasta -o results --hmm --rescue

3. Biological Interpretation

# Get a summary of your findings from the AI Specialist
humanin-finder agent --results results_results.csv

๐Ÿ“– Documentation

Detailed information is available in the paper/ directory:

  • ๐Ÿ› ๏ธ Software Paper: Architecture and methodology for JOSS.
  • ๐Ÿ—๏ธ Scientific Report: Evolutionary dynamics of Humanin in Primates.

๐Ÿค Contributing

Contributions are welcome! Please see our CONTRIBUTING.md for details.

๐Ÿ“„ License

This project is licensed under the MIT License - see the LICENSE file for details.


Developed by LaBiOmicS - Laboratory of Bioinformatics and Omics Sciences. Institution: Universidade de Mogi das Cruzes (UMC)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

humaninfinder-1.0.5.tar.gz (175.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

humaninfinder-1.0.5-py3-none-any.whl (171.5 kB view details)

Uploaded Python 3

File details

Details for the file humaninfinder-1.0.5.tar.gz.

File metadata

  • Download URL: humaninfinder-1.0.5.tar.gz
  • Upload date:
  • Size: 175.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for humaninfinder-1.0.5.tar.gz
Algorithm Hash digest
SHA256 975a51b985fc84633f4d73288a77a4093df708b68f4b3d9a7e3a8820746fa27e
MD5 db1f44d6d4e675716c8a2bb06596b7af
BLAKE2b-256 20bedc96c9a0a6773982f34810247eb9891c3821b2272bb0308c500dd194b258

See more details on using hashes here.

File details

Details for the file humaninfinder-1.0.5-py3-none-any.whl.

File metadata

  • Download URL: humaninfinder-1.0.5-py3-none-any.whl
  • Upload date:
  • Size: 171.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for humaninfinder-1.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 7cbf32eca318b53145da2a2accb9caae0bd800561b4502937e33358733f149c4
MD5 aa41c7fa9974b2e42ded875600ddc9c4
BLAKE2b-256 14b5c0475bcee2feb63095d6d41c413ef8adb277468d5a735db69d7322937668

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page