A python package for integrating data from multiple resources

These details have not been verified by PyPI

Project links

Project description

pyBioDataFuse

💪 Getting Started

We introduce BioDataFuse, a query-based Python tool for seamless integration of biomedical databases. BioDataFuse establishes a modular framework for efficient data wrangling, enabling context-specific knowledge graph creation and supporting graph-based analyses. With a user-friendly interface, it enables users to dynamically create knowledge graphs from their input data. Supported by a robust Python package, pyBiodatafuse, this tool excels in data harmonization, aggregating diverse sources through modular queries. Moreover, BioDataFuse provides plugin capabilities for Cytoscape and Neo4j, allowing local graph hosting. Ongoing refinements enhance the graph utility through tasks like link prediction, making BioDataFuse a versatile solution for efficient and effective biological data integration.

To know more about the package, read our documentation here.

Creating your own graph

To generate your own graph, check out our tutorial notebook in examples.

We support exporting of the graphs in Cytoscape, Neo4J and GraphDB. You can use the following functions:

# on neo4j
neo4j.load_graph(pygraph, uri="bolt://localhost:7687", username="YOUR_USERNAME", password="YOUR_PASSWORD")  # change username and password

# on cytoscape
cytoscape.load_graph(pygraph, network_name="YOUR_CUSTOM_NAME")

# rdf ttl files
bdf = BDFGraph(
    base_uri="https://biodatafuse.org/YOUR_CUSTOM_NAME/",
    version_iri="https://biodatafuse.org/example/YOUR_CUSTOM_NAME.ttl",
    orcid="YOUR_ORCID",
    author="YOUR_NAME",
)

bdf.generate_rdf(combined_df, combined_metadata)  # Generate the RDF from the (meta)data files from the example runs
bdf.serialize(
    "YOUR_CUSTOM_NAME.ttl",
    format="ttl",
)

🚀 Installation

The most recent release can be installed from PyPI with:

$ pip install pyBiodatafuse

The most recent code and data can be installed directly from GitHub with:

$ pip install git+https://github.com/BioDataFuse/pyBiodatafuse.git

👐 Contributing

Contributions, whether filing an issue, making a pull request, or forking, are appreciated. See CONTRIBUTING.md for more information on getting involved.

👋 Attribution

⚖️ License

The code in this package is licensed under the MIT License.

📖 Citation

The work was started as part of the Elixir BioHackathon 2023 integrating and bringing together multiple Core Data Resources together.

Gadiya, Y., Ammar, A., Willighagen, E., Martinat, D., Sima, A. C., Balci, H., & Abbassi Daloii, T. (2023). BioHackEU23 report: Extending interoperability of experimental data using modular queries across biomedical resources. BioHackrXiv Preprints. https://doi.org/10.37044/osf.io/mhsqp

🍪 Cookiecutter

This package was created with @audreyfeldroy's cookiecutter package using @cthoyt's cookiecutter-snekpack template.

🛠️ For Developers

See developer instructions

The final section of the README is for if you want to get involved by making a code contribution.

Development Installation

To install in development mode, use the following:

$ git clone git+https://github.com/BioDataFuse/pyBiodatafuse.git
$ cd pyBiodatafuse
$ pip install -e .

🥼 Testing

After cloning the repository and installing tox with pip install tox, the unit tests in the tests/ folder can be run reproducibly with:

$ tox

Additionally, these tests are automatically re-run with each commit in a GitHub Action.

📖 Building the Documentation

The documentation can be built locally using the following:

$ git clone git+https://github.com/BioDataFuse/pyBiodatafuse.git
$ cd pyBiodatafuse
$ tox -e docs
$ open docs/build/html/index.html

The documentation automatically installs the package as well as the docs extra specified in the setup.cfg. sphinx plugins like texext can be added there. Additionally, they need to be added to the extensions list in docs/source/conf.py.

📦 Making a Release

After installing the package in development mode and installing tox with pip install tox, the commands for making a new release are contained within the finish environment in tox.ini. Run the following from the shell:

$ tox -e finish

This script does the following:

Uses Bump2Version to switch the version number in the setup.cfg, src/pyBiodatafuse/version.py, and docs/source/conf.py to not have the -dev suffix
Packages the code in both a tar archive and a wheel using build
Uploads to PyPI using twine. Be sure to have a .pypirc file configured to avoid the need for manual input at this step
Push to GitHub. You'll need to make a release going with the commit where the version was bumped.
Bump the version to the next patch. If you made big changes and want to bump the version by minor, you can use tox -e bumpversion -- minor after.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.3.0

Feb 3, 2026

This version

1.2.0

Jul 1, 2025

1.1.0

Jan 31, 2025

1.0.0

Dec 2, 2024

1.0.0.dev0 pre-release

Dec 2, 2024

0.0.4

Nov 29, 2024

0.0.3

Oct 31, 2023

0.0.3.dev0 pre-release

Oct 20, 2023

0.0.1

Oct 18, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pybiodatafuse-1.2.0.tar.gz (1.8 MB view details)

Uploaded Jul 1, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pyBiodatafuse-1.2.0-py3-none-any.whl (174.4 kB view details)

Uploaded Jul 1, 2025 Python 3

File details

Details for the file pybiodatafuse-1.2.0.tar.gz.

File metadata

Download URL: pybiodatafuse-1.2.0.tar.gz
Upload date: Jul 1, 2025
Size: 1.8 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.12

File hashes

Hashes for pybiodatafuse-1.2.0.tar.gz
Algorithm	Hash digest
SHA256	`439a78f16a015968ca0915d10bdc905b109084aecbba00b9d5547d208e9720a2`
MD5	`88dbffe06a0f63d205c51986c4f6a747`
BLAKE2b-256	`a16acadc7c6416eaa1ca9c46fd0b9a3b0dd0e00344511a85ed29d06fa49c3b64`

See more details on using hashes here.

File details

Details for the file pyBiodatafuse-1.2.0-py3-none-any.whl.

File metadata

Download URL: pyBiodatafuse-1.2.0-py3-none-any.whl
Upload date: Jul 1, 2025
Size: 174.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.12

File hashes

Hashes for pyBiodatafuse-1.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`156c7c2624464b672e9affebd3a29a2080b09550e1d30ee3cc4807176275bd99`
MD5	`a91ea02b24c6f6d7eb7360f09d52007b`
BLAKE2b-256	`216d53353f117f1f4d13c48da74d1eb69e6295c7d8aba119f035d224d4e008c4`

See more details on using hashes here.

pybiodatafuse 1.2.0

Navigation

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Project description

pyBioDataFuse

💪 Getting Started

Creating your own graph

🚀 Installation

👐 Contributing

👋 Attribution

⚖️ License

📖 Citation

🍪 Cookiecutter

🛠️ For Developers

Development Installation

🥼 Testing

📖 Building the Documentation

📦 Making a Release

Project details

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes