Skip to main content

Pipeline components to extract features from variants.

Project description

Bearclaw

Components to help extract features from variants, to be used as part of a pipeline.

Installation

pip3 install bearclaw

Usage

from bearclaw.preprocessing import VariantDataGenerator
from bearclaw.transforms import spectrum


# Dataframe containing locations of VCF files and labels.
dataframe = DataFrame({
    "vcf": [
        "src/test/resources/GRCh37/sample1.vcf",
        "src/test/resources/GRCh37/sample2.vcf",
    ],
    "class": [1, 0],
})
# Transform VCF files into features using `spectrum`, which counts the number of variants by flanking context.
dg = VariantDataGenerator(transform=spectrum)

# Convert dataframe to label `y` and features `X_spectrum` using `spectrum`.
X_spectrum, y = dg.flow_from_dataframe(dataframe, x_col="vcf")

Reference documentation

https://hylkedonker.gitlab.io/bearclaw/

License

For open source projects, say how it is licensed.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bearclaw-0.0.2.tar.gz (131.5 kB view details)

Uploaded Source

Built Distribution

bearclaw-0.0.2-py3-none-any.whl (103.5 kB view details)

Uploaded Python 3

File details

Details for the file bearclaw-0.0.2.tar.gz.

File metadata

  • Download URL: bearclaw-0.0.2.tar.gz
  • Upload date:
  • Size: 131.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.6

File hashes

Hashes for bearclaw-0.0.2.tar.gz
Algorithm Hash digest
SHA256 6914e335cae06509b5898cb30543079306c66ad2cf388b6e49da6ae6da707b5c
MD5 e591672bce54c5879138c41a0619157b
BLAKE2b-256 0f8d9bc1a4053f647c18be4c2ec0c32ebd12907160f90cc450ca2d117dcadb2a

See more details on using hashes here.

File details

Details for the file bearclaw-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: bearclaw-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 103.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.6

File hashes

Hashes for bearclaw-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 848beb3ad33fc3fef18d79f6f183fa7185a4ca15dbc0aa03eb7965a2ac70f7a1
MD5 0a909245b4de5b80918f33fcb2c4525d
BLAKE2b-256 ad636f0ed29a38f85702407b45fa2b737d39630af28242307a2e77688fd2c066

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page