Skip to main content

A command-line tool for BRAD enrichment analysis

Project description

Gene Enrichment with BRAD

  1. note to self: openpyxl is an additional requirement beyond BRAD-DEV

Setup

Follow the below instructions to install the necessary dependencies and build the RAG database. This must be run one time when installing the system, and note that the dependencies identical to that of the BRAD repository with the addition of scrapetube and youtube_transcript_api.

Install Dependencies

Install BRAD This code must be run from the video-rag branch from the BRAD repository, and the roots of both repository must be located within the same directory. Run the below commands to install both repositories:

git clone https://github.com/Jpickard1/BRAD.git
git clone https://github.com/Jpickard1/BRAD-Video.git

Backend (python)

  1. Activate BRAD-DEV or BRAD-1 conda environment used for developing the BRAD according to the specifications of that repository.

If you wish to expand the library of searchable videos beyond those provided, install the following dependencies:

  1. pip install scrapetube
  2. pip install youtube_transcript_api

Frontend (javascript) Run the following set of commands from the root of this repository.

curl -o- https://raw.githubusercontent.com/creationix/nvm/master/install.sh | bash
source ~/.nvm/nvm.sh
nvm install 20.18.0
nvm use 20.18.0
npm install --prefix ./brad-chat

Turn On

In separate terminals, execute the following commands from the root of this repository:

cd brad-chat
npm start
export OPENAI_API_KEY=<PLACE YOUR OPENAI API KEY HERE>
flask --app app run --host=0.0.0.0 --port=5000

Note Slight variations may be required if you are running this on windows or other systems.

Build New Video RAG Databases from Youtube (optional)

To build the RAG database:

python youtube_database_construction.py

Modifications

  1. specify the youtube channel
  2. specify the videos (we should add a new argument for this)

Cite As

@article{pickard2024language,
  title={Language Model Powered Digital Biology with BRAD},
  author={Pickard, Joshua and Prakash, Ram and Choi, Marc Andrew and Oliven, Natalie and
          Stansbury, Cooper and Cwycyshyn, Jillian
          and Gorodetsky, Alex and Velasquez, Alvaro and Rajapakse, Indika},
  journal={arXiv preprint arXiv:2409.02864},
  url={https://arxiv.org/abs/2409.02864},
  year={2024}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

brad_enrichment-0.1.0.tar.gz (16.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

brad_enrichment-0.1.0-py3-none-any.whl (17.1 kB view details)

Uploaded Python 3

File details

Details for the file brad_enrichment-0.1.0.tar.gz.

File metadata

  • Download URL: brad_enrichment-0.1.0.tar.gz
  • Upload date:
  • Size: 16.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for brad_enrichment-0.1.0.tar.gz
Algorithm Hash digest
SHA256 6edbd4d371f8b0588bca02ea993aa084112306225e39c34130c7e3d63ac3ec40
MD5 5af99dbbd15600484263df1f618ab02e
BLAKE2b-256 2b6fc4c2adb81d52f06bf7299b353fc172d30715670fc4317db2afd29f9b5490

See more details on using hashes here.

File details

Details for the file brad_enrichment-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for brad_enrichment-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 5dc62959dc9267745fcec4830c5e042b111cc8af4a558bc5cf064a70ec90b1b5
MD5 1df62148dbfe6b753c30cba0b6077571
BLAKE2b-256 d9a487e90c86c33110cc5e17e92f2b4d7f9dfb0b2cd4330ded53f1164276166e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page