Skip to main content

Pipeline components to extract features from variants.

Project description

Bearclaw

Components to help extract features from variants, to be used as part of a pipeline.

Installation

pip3 install bearclaw

Usage

from bearclaw.preprocessing import VariantDataGenerator
from bearclaw.transforms import spectrum


# Dataframe containing locations of VCF files and labels.
dataframe = DataFrame({
    "vcf": [
        "src/test/resources/GRCh37/sample1.vcf",
        "src/test/resources/GRCh37/sample2.vcf",
    ],
    "class": [1, 0],
})
# Transform VCF files into features using `spectrum`, which counts the number of variants by flanking context.
dg = VariantDataGenerator(transform=spectrum)

# Convert dataframe to label `y` and features `X_spectrum` using `spectrum`.
X_spectrum, y = dg.flow_from_dataframe(dataframe, x_col="vcf")

Reference documentation

https://hylkedonker.gitlab.io/bearclaw/

License

For open source projects, say how it is licensed.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bearclaw-0.0.3.tar.gz (120.9 kB view details)

Uploaded Source

Built Distribution

bearclaw-0.0.3-py3-none-any.whl (103.6 kB view details)

Uploaded Python 3

File details

Details for the file bearclaw-0.0.3.tar.gz.

File metadata

  • Download URL: bearclaw-0.0.3.tar.gz
  • Upload date:
  • Size: 120.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.2

File hashes

Hashes for bearclaw-0.0.3.tar.gz
Algorithm Hash digest
SHA256 4ad5f6fb531c054e5887f22f12a4499f44a3bb156250eb1e30d2775bc43e9b5d
MD5 d520ffeaf9d0dadf66546f06288a324c
BLAKE2b-256 bad7aa951001e69a1e493d74946b6c486f22bf564d84eca9776e54f7a69366d7

See more details on using hashes here.

File details

Details for the file bearclaw-0.0.3-py3-none-any.whl.

File metadata

  • Download URL: bearclaw-0.0.3-py3-none-any.whl
  • Upload date:
  • Size: 103.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.2

File hashes

Hashes for bearclaw-0.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 b9265661baf98727b1094c826ae549e4b59028c5c59c2c9fe045a9e23c0c3da2
MD5 3033bd2deef9bdce5e154c5f3ceec9ee
BLAKE2b-256 27ffa6682adc7cf3027e904b154c5da4210b7497cb8664c9f4a62adf5859c198

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page