Skip to main content

An implementation of sequence alignment on the basis of cross-correlation technique

Project description

Contributors Forks Stargazers Issues MIT License LinkedIn


Logo

Sequence Alignment
by cross correlation

This is an implementation of the cross correlation algorithm for DNA/RNA alignment
Explore the docs »

Report Bug · Request Feature

Table of Contents
  1. About The Project
  2. Getting Started
  3. Usage
  4. Roadmap
  5. Contributing
  6. License
  7. Contact
  8. Acknowledgments

About The Project

The project is based on the paper Sequence Alignment by Cross-Correlation and bridges the gap between the knowledge and an useful implementation. The underlying technique is the cross correlation algorithm, which can be tested in this notebook.

Another notebook can be seen here. The focus of this notebook is to illustrate how we can use the cross correlation for genomic data.

The gene sequences were obtained using the NCBI database.

Example Screen Shot

(back to top)

Getting Started

This is an example of how you may give instructions on setting up your project locally. To get a local copy up and running follow these simple example steps.

Prerequisites

This is an example of how to list things you need to use the software and how to install them.

Installation

  1. Clone the repo
    git clone https://github.com/kuhjuice/sequence_alignment_by_cross_correlation.git
    
  2. Navigate into the repo
    cd sequence_alignment_by_cross_correlation
    
  3. Install requirements.txt
    pip install -r requirements.txt
    
    or
    pip3 install -r requirements.txt
    
  4. Ask the CLI for help with
    python3 sequence_alignment_by_cross_correlation.py --help
    

(back to top)

Usage

Example 1

If we want to get an understanding of what the tool can do for us, we can use the help function of the CLI.

python3 sequence_alignment_by_cross_correlation/sequence_alignment_by_cross_correlation.py --help

Help Screen Shot

Example 2

If we want to find the gene omcB in the genome of Chlamydia Trachomatis we can use the sequence files in sequencesToTest

Executing the following will search for the omcB Gene in the genome of Chlamydia Trachomatis

python3 sequence_alignment_by_cross_correlation/sequence_alignment_by_cross_correlation.py 'sequencesToTest/C_T_genome.fasta' 'sequencesToTest/omcB_C_T_100fit.fna'

Usage Screen Shot

(back to top)

Roadmap

  • Fix Bugs
  • Make it a real package
  • Evalueate if useful for other projects

See the open issues for a full list of proposed features (and known issues).

(back to top)

Contributing

Contributions are what make the open source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.

If you have a suggestion that would make this better, please fork the repo and create a pull request. You can also simply open an issue with the tag "enhancement". Don't forget to give the project a star! Thanks again!

  1. Fork the Project
  2. Create your Feature Branch (git checkout -b feature/AmazingFeature)
  3. Commit your Changes (git commit -m 'Add some AmazingFeature')
  4. Push to the Branch (git push origin feature/AmazingFeature)
  5. Open a Pull Request

(back to top)

License

Distributed under the MIT License. See LICENSE.txt for more information.

(back to top)

Contact

Bo Kern - @blood_in_the_water@chaos.social - bo@blackscript.de

Project Link: https://github.com/kuhjuice/sequence_alignment_by_cross_correlation

(back to top)

Acknowledgments

(back to top)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Built Distribution

File details

Details for the file sequence_alignment_by_cross_correlation-0.1.0.tar.gz.

File metadata

File hashes

Hashes for sequence_alignment_by_cross_correlation-0.1.0.tar.gz
Algorithm Hash digest
SHA256 292ca1505e32e01dd14b9d9bc38190e28313b1660071bbdc956a532ebc76459f
MD5 685b241d1d39b970e992a570ea15677e
BLAKE2b-256 8223b1ead3a419f7eb99245462925610aac901b1452830cc2d18c8d1bdbc684b

See more details on using hashes here.

File details

Details for the file sequence_alignment_by_cross_correlation-0.1.0-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for sequence_alignment_by_cross_correlation-0.1.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 6101ecd6f43f48f1068c5cc3393787c70049e79b434ce6d3619221d47aceb973
MD5 c1658a194b52de729954fb7c8dbf9737
BLAKE2b-256 231e9b02e7fa2375b1d23ff8bb0e21586a676cba24e3b7009e6105dfb882f954

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page