Lightning-fast gene manipulation and analysis library.

These details have not been verified by PyPI

Project links

Homepage

Project description

SeqMat

Lightning-fast genomic sequence matrix library with mutation tracking

SeqMat is a comprehensive Python library for genomic sequence analysis, providing efficient tools for working with genes, transcripts, and genomic sequences. Features include full mutation tracking, splicing analysis, and high-performance sequence manipulation.

Key Features

Fast Sequence Operations: Vectorized genomic sequence matrix with efficient slicing and manipulation
Comprehensive Mutation Tracking: SNPs, insertions, deletions with full history and conflict detection
Gene & Transcript Analysis: Load and analyze gene structures, exons, introns, and splice sites
Conservation Integration: Built-in support for conservation scores and sequence analysis
Multi-organism Support: Human (hg38) and mouse (mm39) genome support
Data Inspection Tools: Comprehensive utilities to explore available data, gene counts, and biotypes
Command-line Interface: Full CLI for data management and inspection
Expression Data: Integration with GTEx tissue expression data
Flexible Installation: Core functionality with optional bioinformatics dependencies

Installation

Basic Installation

pip install seqmat

With Bioinformatics Features

# For protein translation
pip install seqmat[bio]

# For GTF parsing and genomics data setup
pip install seqmat[genomics]

# Install everything
pip install seqmat[all]

From Source

git clone https://github.com/yourusername/seqmat.git
cd seqmat
pip install -e .

Quick Start

1. Basic Sequence Operations

from seqmat import SeqMat

# Create a sequence
seq = SeqMat("ATCGATCGATCG", name="my_sequence")
print(f"Length: {len(seq)}")  # Length: 12
print(f"Sequence: {seq.seq}")  # Sequence: ATCGATCGATCG

# Apply mutations
seq.apply_mutations([
    (3, "C", "G"),       # SNP: C->G at position 3
    (6, "-", "AAA"),     # Insertion: insert AAA at position 6  
    (10, "TC", "-")      # Deletion: delete TC at position 10
])

print(f"Mutated: {seq.seq}")
print(f"Mutations: {len(seq.mutations)}")

2. Working with Genomic Coordinates

import numpy as np

# Create sequence with genomic coordinates
indices = np.arange(1000, 1012)  # Positions 1000-1011
seq = SeqMat("ATCGATCGATCG", indices=indices, name="chr1:1000-1011")

# Slice by genomic position
subseq = seq[1003:1008]  # Extract positions 1003-1007
print(f"Subsequence: {subseq.seq}")

# Access single positions
base = seq[1005]  # Get base at position 1005
print(f"Base at 1005: {base['nt'].decode()}")

3. Advanced Mutation Operations

# Multiple mutations with validation
mutations = [
    (1002, "T", "A"),     # SNP
    (1005, "-", "GGG"),   # Insertion
    (1008, "GAT", "-"),   # Deletion
    (1003, "CG", "AT")    # Complex substitution
]

seq.apply_mutations(mutations)

# Mutation history
for mut in seq.mutations:
    print(f"{mut['type']} at {mut['pos']}: {mut['ref']} -> {mut['alt']}")

# Check for conflicts (automatically validated)
conflicting_muts = [
    (1003, "C", "T"),
    (1003, "G", "A")  # Overlaps with previous
]
# This will warn about conflicts and skip invalid mutations

4. Sequence Transformations

# Complement and reverse complement
complement = seq.complement()
rev_comp = seq.clone()
rev_comp.reverse_complement()

print(f"Original:    {seq.seq}")
print(f"Complement:  {complement.seq}")
print(f"Rev comp:    {rev_comp.seq}")

# Remove regions (e.g., splice out introns)
introns = [(1003, 1005), (1008, 1009)]
spliced = seq.remove_regions(introns)
print(f"Spliced:     {spliced.seq}")

5. Working with Genes (requires genomics data setup)

from seqmat import Gene, setup_genomics_data

# One-time setup (downloads reference data)
setup_genomics_data("/path/to/data", organism="hg38")

# Load a gene
kras = Gene.from_file("KRAS", organism="hg38")
print(kras)  # Gene: KRAS, ID: ENSG00000133703, Chr: 12, Transcripts: 8

# Access transcripts
primary = kras.transcript()  # Primary transcript
all_transcripts = list(kras)  # All transcripts

# Analyze gene structure
acceptors, donors = kras.splice_sites()
print(f"Unique splice sites: {len(acceptors)} acceptors, {len(donors)} donors")

6. Transcript Analysis

# Get a transcript
transcript = kras.transcript()
print(f"Transcript: {transcript.transcript_id}")
print(f"Protein coding: {transcript.protein_coding}")

# Access exon/intron structure
print(f"Exons: {len(transcript.exons)}")
print(f"Introns: {len(transcript.introns)}")

# Generate mature mRNA (spliced)
transcript.generate_mature_mrna()
print(f"Mature mRNA length: {len(transcript.mature_mrna)} bp")

# Generate protein (if protein-coding and BioPython available)
if transcript.protein_coding:
    protein = transcript.generate_protein()
    print(f"Protein length: {len(transcript.protein)} aa")

7. Loading from FASTA Files

# Direct FASTA loading
genomic_seq = SeqMat.from_fasta_file(
    "chr12.fasta",
    "chr12", 
    start=25398284,
    end=25398384
)

# Apply mutations to genomic sequence
genomic_seq.apply_mutations([
    (25398290, "G", "A"),  # Pathogenic variant
    (25398300, "-", "T")   # Novel insertion
])

Data Setup and Configuration

SeqMat uses a flexible configuration system to manage organism data and file paths. This section explains how to set up data for existing organisms and add support for new ones.

Quick Setup for Supported Organisms

For the built-in organisms (hg38, mm39), setup is straightforward:

from seqmat import setup_genomics_data

# Download and setup human genome data
setup_genomics_data("/path/to/your/data", organism="hg38")

# Download and setup mouse genome data  
setup_genomics_data("/path/to/your/data", organism="mm39")

Or via command line:

# Setup human data
i /path/to/your/data --organism hg38

# Setup mouse data
seqmat setup --path /path/to/your/data --organism mm39 --force

Configuration System

SeqMat stores configuration in ~/.seqmat/config.json. This file contains:

Organism paths: Where each organism's data is stored
Default organism: Which organism to use when none specified
Directory structure: Customizable folder names
Data source URLs: Where to download reference data

Configuration File Structure

{
  "default_organism": "hg38",
  "directory_structure": {
    "chromosomes": "chromosomes",
    "annotations": "annotations"
  },
  "hg38": {
    "BASE": "/path/to/data/hg38",
    "CHROM_SOURCE": "/path/to/data/hg38/chromosomes",
    "MRNA_PATH": "/path/to/data/hg38/annotations",
    "fasta": "/path/to/data/hg38/chromosomes"
  },
  "mm39": {
    "BASE": "/path/to/data/mm39",
    "CHROM_SOURCE": "/path/to/data/mm39/chromosomes", 
    "MRNA_PATH": "/path/to/data/mm39/annotations",
    "fasta": "/path/to/data/mm39/chromosomes"
  }
}

Directory Structure

When you run setup_genomics_data, it creates this directory structure:

/your/data/path/
├── hg38/                          # Organism directory
│   ├── chromosomes/               # FASTA files (configurable)
│   │   ├── chr1.fasta
│   │   ├── chr2.fasta
│   │   └── ...
│   ├── annotations/               # Gene/transcript data (configurable)
│   │   ├── mrnas_ENSG00000133703_KRAS.pkl
│   │   ├── mrnas_ENSG00000141510_TP53.pkl
│   │   └── ...
│   ├── temp/                      # Temporary download files
│   ├── conservation.pkl           # Conservation scores
│   └── gtex_expression.gct.gz     # Expression data (human only)
└── mm39/                          # Mouse data (similar structure)
    ├── chromosomes/
    ├── annotations/
    └── ...

Adding Support for New Organisms

To add a new organism (e.g., dm6 for Drosophila), you have several options:

Option 1: Modify Configuration (Recommended)

Update the default organism data by editing your config or using the API:

from seqmat.config import load_config, save_config

# Load current config
config = load_config()

# Add new organism with data source URLs
config['dm6'] = {
    'name': 'Drosophila melanogaster (Fruit fly)',
    'urls': {
        'fasta': 'https://hgdownload.soe.ucsc.edu/goldenPath/dm6/bigZips/dm6.fa.gz',
        'gtf': 'https://ftp.ensembl.org/pub/release-109/gtf/drosophila_melanogaster/Drosophila_melanogaster.BDGP6.32.109.gtf.gz'
    }
}

# Save updated config
save_config(config)

# Now you can use it
setup_genomics_data("/path/to/data", organism="dm6")

Option 2: Extend Default Data Sources

For permanent additions, modify seqmat/config.py:

# In seqmat/config.py, add to DEFAULT_ORGANISM_DATA:
DEFAULT_ORGANISM_DATA = {
    'hg38': { ... },
    'mm39': { ... },
    'dm6': {
        'name': 'Drosophila melanogaster (Fruit fly)',
        'urls': {
            'fasta': 'https://hgdownload.soe.ucsc.edu/goldenPath/dm6/bigZips/dm6.fa.gz',
            'gtf': 'https://ftp.ensembl.org/pub/release-109/gtf/drosophila_melanogaster/Drosophila_melanogaster.BDGP6.32.109.gtf.gz'
        }
    }
}

Option 3: Manual Setup for Custom Data

For completely custom data sources:

# 1. Create directory structure manually
import os
from pathlib import Path

base_path = Path("/path/to/data/custom_genome")
base_path.mkdir(exist_ok=True)
(base_path / "chromosomes").mkdir(exist_ok=True)
(base_path / "annotations").mkdir(exist_ok=True)

# 2. Place your FASTA files in chromosomes/ directory
# Your files: chr1.fasta, chr2.fasta, etc.

# 3. Update configuration
from seqmat.config import load_config, save_config

config = load_config()
config['custom_genome'] = {
    'BASE': str(base_path),
    'CHROM_SOURCE': str(base_path / 'chromosomes'),
    'MRNA_PATH': str(base_path / 'annotations'),
    'fasta': str(base_path / 'chromosomes')
}
save_config(config)

# 4. Process your GTF file (if you have gene annotations)
from seqmat.utils import process_gtf_annotations
process_gtf_annotations(
    gtf_file="/path/to/your.gtf",
    output_dir=str(base_path / 'annotations'),
    organism="custom_genome"
)

Data Sources and URLs

SeqMat downloads data from these default sources:

Human (hg38):

FASTA: UCSC Genome Browser (latest hg38)
Annotations: Ensembl release-111 GTF
Conservation: Pre-computed conservation scores
Expression: GTEx v8 median TPM data

Mouse (mm39):

FASTA: UCSC Genome Browser (mm39)
Annotations: Ensembl release-112 GTF

Customizing Directory Structure

You can customize folder names by modifying the directory structure config:

from seqmat.config import load_config, save_config

config = load_config()
config['directory_structure'] = {
    'chromosomes': 'genomes',      # Custom name for FASTA directory
    'annotations': 'gene_data'     # Custom name for annotation directory
}
save_config(config)

# Future setups will use these custom names
setup_genomics_data("/path/to/data", organism="hg38")

Configuration Management

View current configuration:

from seqmat.config import load_config, get_available_organisms, get_default_organism

print("Default organism:", get_default_organism())
print("Available organisms:", get_available_organisms())
print("Full config:", load_config())

Reset to defaults:

from seqmat.config import DEFAULT_SETTINGS, save_config
save_config(DEFAULT_SETTINGS.copy())

Change default organism:

from seqmat.config import load_config, save_config

config = load_config()
config['default_organism'] = 'mm39'  # Switch default to mouse
save_config(config)

Troubleshooting Setup

Common issues:

"Organism not configured" - Run setup first or check config
Download failures - Check internet connection and URLs
Permission errors - Ensure write access to data directory
Disk space - Human genome data requires ~4GB, mouse ~3GB

Debugging:

# Check what's configured
from seqmat import list_available_organisms, print_data_summary

print("Configured organisms:", list_available_organisms()) 
print_data_summary()  # Detailed status

# Verify file paths
from seqmat.config import get_organism_config
config = get_organism_config("hg38")
print("Paths:", config)

Configuration Best Practices

For single-user systems:

Use the default ~/.seqmat/config.json location
Set up organisms as needed with setup_genomics_data()

For multi-user or shared systems:

Create a shared data directory: /shared/genomics_data/
Point all users' configs to the same data paths
Consider using environment variables for paths

For development/testing:

Use separate config files or directories
Set default_organism to your most-used organism
Keep test data in separate locations

Configuration sharing:

# Export configuration to share with others
from seqmat.config import load_config
import json

config = load_config()
with open('seqmat_config_template.json', 'w') as f:
    json.dump(config, f, indent=2)

This setup downloads and organizes:

Reference genome sequences (FASTA) → chromosomes/ directory
Gene annotations (GTF/processed) → annotations/ directory
Conservation scores → organism root directory
Expression data → organism root directory (human only)

Data Inspection and Management

Python API

Once data is set up, you can inspect what's available:

from seqmat import (
    list_supported_organisms, list_available_organisms,
    get_organism_info, list_gene_biotypes, count_genes,
    get_gene_list, search_genes, print_data_summary
)

# Check supported and configured organisms
print("Supported:", list_supported_organisms())  # ['hg38', 'mm39']  
print("Configured:", list_available_organisms())  # Organisms with data

# Get detailed organism information
info = get_organism_info('hg38')
print(f"Gene types: {info['data_available']['biotypes']}")
print(f"Chromosomes: {len(info['data_available']['chromosomes'])}")

# Explore gene biotypes and counts
biotypes = list_gene_biotypes('hg38')  # ['protein_coding', 'lncRNA', ...]
counts = count_genes('hg38')  # {'protein_coding': 19234, 'lncRNA': 7805, ...}

# Get gene lists
protein_genes = get_gene_list('hg38', 'protein_coding', limit=10)
print(f"First 10 protein-coding genes: {protein_genes}")

# Search for specific genes
results = search_genes('hg38', 'KRAS')  # Find genes matching 'KRAS'
kras_genes = search_genes('hg38', 'K', biotype='protein_coding', limit=5)

# Print comprehensive summary
print_data_summary()  # Formatted overview of all data

Command Line Interface

SeqMat provides a comprehensive CLI for data management. The CLI automatically detects available organisms from your configuration:

# Install data for supported organisms
seqmat setup --path /your/data/path --organism hg38
seqmat setup --path /your/data/path --organism mm39 --force

# The CLI will show all configured organisms as choices
seqmat setup --help  # Shows: --organism {hg38,mm39,dm6,...}

# Check what organisms are supported/configured
seqmat organisms

# Get comprehensive data summary
seqmat summary

# List gene biotypes for an organism  
seqmat biotypes --organism hg38

# Count genes by biotype
seqmat count --organism hg38                    # All biotypes
seqmat count --organism hg38 --biotype protein_coding  # Specific biotype

# List genes
seqmat list --organism hg38 --biotype protein_coding --limit 20

# Search for genes by name
seqmat search --organism hg38 --query KRAS
seqmat search --organism hg38 --query K --biotype protein_coding --limit 10

# Get detailed organism information
seqmat info --organism hg38

Example CLI Output

$ seqmat summary
🧬 SeqMat Genomics Data Summary
========================================
📊 Total: 2 organisms, 15 biotypes, 47,832 genes

🌍 Supported Organisms:
   hg38: Homo sapiens (Human) - ✅ Configured
   mm39: Mus musculus (Mouse) - ✅ Configured

📁 HG38 Data:
   Gene Types:
     protein_coding: 19,234 genes
     lncRNA: 7,805 genes
     pseudogene: 14,723 genes
     ...
   Chromosomes: 25 available (chr1, chr2, chr3, chr4, chr5...)

📁 MM39 Data:
   Gene Types:
     protein_coding: 21,815 genes
     lncRNA: 8,032 genes
     ...

API Reference

SeqMat Class

Core Methods:

SeqMat(nucleotides, indices=None, name="wild_type"): Create sequence
SeqMat.from_fasta_file(path, chrom, start, end): Load from FASTA
apply_mutations(mutations): Apply SNPs/indels
clone(start=None, end=None): Create copy
remove_regions(regions): Remove specified intervals
complement(): Get complement sequence
reverse_complement(): Reverse complement in place

Properties:

seq: Current sequence string
reference_seq: Original reference sequence
index: Genomic coordinates
mutations: List of applied mutations
mutated_positions: Set of mutated positions

Gene Class

Gene.from_file(gene_name, organism=None): Load gene from database (uses default organism if None)
transcript(tid=None): Get transcript by ID or primary
splice_sites(): Get all splice site positions
primary_transcript: Primary transcript ID

Transcript Class

generate_pre_mrna(): Create pre-mRNA sequence
generate_mature_mrna(): Create spliced mRNA
generate_protein(): Translate to protein (requires BioPython)
exons/introns: Genomic coordinates
protein_coding: Boolean flag

Configuration Management

Core Functions:

load_config(): Load configuration from ~/.seqmat/config.json
save_config(config): Save configuration to file
get_default_organism(): Get default organism from config
get_available_organisms(): Get list of all configured organisms
get_organism_config(organism=None): Get file paths for organism (uses default if None)
get_organism_info(organism): Get organism metadata including URLs
get_directory_config(): Get customizable directory structure

Configuration Examples:

from seqmat.config import *

# Check current settings
print("Default:", get_default_organism())         # 'hg38'
print("Available:", get_available_organisms())    # ['hg38', 'mm39']  
config = get_organism_config('hg38')              # File paths

# Modify settings
config = load_config()
config['default_organism'] = 'mm39'
save_config(config)

Data Inspection Utilities

Organism Management:

list_supported_organisms(): Get all supported organisms (dynamic from config)
list_available_organisms(): Get configured organisms
get_organism_info(organism): Detailed organism information
setup_genomics_data(basepath, organism=None, force=False): Download and setup data (uses default organism if None)

Gene Discovery:

list_gene_biotypes(organism): Get available gene types
count_genes(organism, biotype=None): Count genes by type
get_gene_list(organism, biotype, limit=None): List gene names
search_genes(organism, query, biotype=None, limit=10): Search genes by name

Data Summary:

data_summary(): Complete data overview (programmatic)
print_data_summary(): Formatted data summary (human-readable)

Command Line Interface

Setup Commands:

seqmat setup --path PATH --organism {dynamic_list} [--force]: Organism choices automatically detected from config
seqmat organisms: List organism status and availability

Exploration Commands:

seqmat summary: Data overview for all configured organisms
seqmat info --organism ORG: Detailed organism info
seqmat biotypes --organism ORG: List gene biotypes
seqmat count --organism ORG [--biotype TYPE]: Count genes
seqmat list --organism ORG --biotype TYPE [--limit N]: List genes
seqmat search --organism ORG --query PATTERN [--biotype TYPE] [--limit N]: Search genes

Note: All CLI commands automatically use your configured default organism when --organism is omitted, and available organism choices are dynamically loaded from your configuration.

Performance

SeqMat is optimized for performance:

Vectorized operations: NumPy-based sequence operations
Memory efficient: Structured arrays for sequence storage
Fast slicing: O(1) genomic coordinate access
Conflict detection: Efficient mutation validation
Lazy loading: Sequences loaded on demand

Dependencies

Core (always required):

numpy >= 1.20.0
pandas >= 1.3.0
pysam >= 0.19.0
requests >= 2.26.0
tqdm >= 4.62.0

Optional:

biopython >= 1.79 (for protein translation)
gtfparse >= 1.2.0 (for genomics data setup)

Examples

Check the examples/ directory for:

Basic sequence manipulation
Mutation analysis workflows
Gene structure analysis
Comparative genomics
Performance benchmarks

Contributing

Contributions welcome! Please see CONTRIBUTING.md for guidelines.

License

MIT License - see LICENSE file for details.

Citation

If you use SeqMat in your research:

SeqMat: Lightning-fast genomic sequence matrix library
[Your Name], 2024
GitHub: https://github.com/yourusername/seqmat

Support

Documentation: https://seqmat.readthedocs.io
Issues: https://github.com/yourusername/seqmat/issues
Discussions: https://github.com/yourusername/seqmat/discussions

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

1.5.1

May 15, 2026

1.5.0

May 15, 2026

1.4.0

May 15, 2026

1.3.1

Mar 9, 2026

1.3.0

Mar 9, 2026

1.2.2

Mar 9, 2026

1.2.1

Mar 9, 2026

1.2.0

Mar 9, 2026

1.1.1

Mar 7, 2026

1.1.0

Mar 3, 2026

1.0.0

Mar 1, 2026

0.1.59

Feb 28, 2026

0.1.58

Feb 28, 2026

0.1.57

Feb 28, 2026

0.1.56

Feb 28, 2026

0.1.55

Feb 28, 2026

0.1.54

Feb 21, 2026

0.1.53

Feb 21, 2026

0.1.52

Feb 21, 2026

0.1.51

Feb 21, 2026

0.1.50

Feb 21, 2026

0.1.49

Feb 21, 2026

0.1.48

Feb 20, 2026

0.1.47

Feb 2, 2026

0.1.46

Feb 1, 2026

0.1.45

Feb 1, 2026

0.1.44

Jan 31, 2026

0.1.43

Dec 5, 2025

0.1.42

Dec 5, 2025

This version

0.1.41

Nov 18, 2025

0.1.40

Nov 17, 2025

0.1.39

Nov 17, 2025

0.1.38

Nov 17, 2025

0.1.37

Nov 17, 2025

0.1.36

Nov 17, 2025

0.1.35

Nov 17, 2025

0.1.34

Nov 17, 2025

0.1.33

Nov 17, 2025

0.1.32

Nov 1, 2025

0.1.31

Oct 31, 2025

0.1.30

Aug 26, 2025

0.1.29

Aug 26, 2025

0.1.28

Aug 7, 2025

0.1.27

Aug 7, 2025

0.1.26

Aug 7, 2025

0.1.25

Jul 27, 2025

0.1.24

Jul 27, 2025

0.1.22

Jul 26, 2025

0.1.21

Jul 26, 2025

0.1.20

Jul 26, 2025

0.1.19

Jul 26, 2025

0.1.18

Jul 26, 2025

0.1.17

Jul 26, 2025

0.1.16

Jul 26, 2025

0.1.15

Jul 26, 2025

0.1.14

Jul 26, 2025

0.1.13

Jul 25, 2025

0.1.12

Jul 25, 2025

0.1.11

Jul 25, 2025

0.1.10

Jul 25, 2025

0.1.9

Jul 25, 2025

0.1.8

Jul 25, 2025

0.1.7

Jul 25, 2025

0.1.6

Jul 25, 2025

0.1.5

Jul 25, 2025

0.1.4

Jul 24, 2025

0.1.3

Jul 24, 2025

0.1.2

Jul 24, 2025

0.1.1

Jul 24, 2025

0.1.0

Jul 24, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

seqmat-0.1.41.tar.gz (48.5 kB view details)

Uploaded Nov 18, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

seqmat-0.1.41-py3-none-any.whl (41.3 kB view details)

Uploaded Nov 18, 2025 Python 3

File details

Details for the file seqmat-0.1.41.tar.gz.

File metadata

Download URL: seqmat-0.1.41.tar.gz
Upload date: Nov 18, 2025
Size: 48.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.10

File hashes

Hashes for seqmat-0.1.41.tar.gz
Algorithm	Hash digest
SHA256	`df62546467c8dbad5e69b452da1f30c49e258cdd4dc0ab680567c54b6c6e115c`
MD5	`365962497d621c039babd61a90868f32`
BLAKE2b-256	`fc5afa4704859f744d7cd77f9817069808bcc0cf5fe2f4459044bbd42b8d5ce9`

See more details on using hashes here.

File details

Details for the file seqmat-0.1.41-py3-none-any.whl.

File metadata

Download URL: seqmat-0.1.41-py3-none-any.whl
Upload date: Nov 18, 2025
Size: 41.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.10

File hashes

Hashes for seqmat-0.1.41-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f21c8c50a47be1fa391369d9f5a1768bdd213329b0dca85ec818b54e81f19781`
MD5	`bddb4d5bd47a8a433fd536bf72784d5e`
BLAKE2b-256	`b2e9c25e09b12c4ee1139d664339e203d695ad040f493bdf6c79d6359bf27dba`

See more details on using hashes here.

seqmat 0.1.41

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

SeqMat

Key Features

Installation

Basic Installation

With Bioinformatics Features

From Source

Quick Start

1. Basic Sequence Operations

2. Working with Genomic Coordinates

3. Advanced Mutation Operations

4. Sequence Transformations

5. Working with Genes (requires genomics data setup)

6. Transcript Analysis

7. Loading from FASTA Files

Data Setup and Configuration

Quick Setup for Supported Organisms

Configuration System

Configuration File Structure

Directory Structure

Adding Support for New Organisms

Option 1: Modify Configuration (Recommended)

Option 2: Extend Default Data Sources

Option 3: Manual Setup for Custom Data

Data Sources and URLs

Customizing Directory Structure

Configuration Management

Troubleshooting Setup

Configuration Best Practices

Data Inspection and Management

Python API

Command Line Interface

Example CLI Output

API Reference

SeqMat Class

Gene Class

Transcript Class

Configuration Management

Data Inspection Utilities

Command Line Interface

Performance

Dependencies

Examples

Contributing

License

Citation

Support

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes