Python package interface for the RCSB PDB search API service

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
Intended Audience
- Science/Research
License
- OSI Approved :: BSD License
Natural Language
- English
Operating System
- OS Independent
Programming Language
Topic
- Scientific/Engineering :: Bio-Informatics
Typing
- Typed

Project description

⛔️ [DEPRECATED] Active at py-rcsb-api

Please migrate to our new and improved package, rcsb-api, which contains all the same functionalities as rcsbsearchapi and more! New features will only be added to the new rcsb-api package. For more details, see https://github.com/rcsb/py-rcsbsearchapi/issues/51.

rcsbsearchapi

Python interface for the RCSB PDB Search API.

This package requires Python 3.7 or later.

Quickstart

Installation

Get it from PyPI:

pip install rcsbsearchapi

Or, download from GitHub

Getting Started

Full documentation available at readthedocs

Basic Query Construction

Full-text search

To perform a "full-text" search for structures associated with the term "Hemoglobin", you can create a TextQuery:

from rcsbsearchapi import TextQuery

# Search for structures associated with the phrase "Hemoglobin"
query = TextQuery(value="Hemoglobin")

# Execute the query by running it as a function
results = query()

# Results are returned as an iterator of result identifiers.
for rid in results:
    print(rid)

Attribute search

To perform a search for specific structure or chemical attributes, you can create an AttributeQuery.

from rcsbsearchapi import AttributeQuery

# Construct a query searching for structures from humans
query = AttributeQuery(
    attribute="rcsb_entity_source_organism.scientific_name",
    operator="exact_match",  # Other operators include "contains_phrase", "exists", and more
    value="Homo sapiens"
)

# Execute query and construct a list from results
results = list(query())
print(results)

Refer to the Search Attributes and Chemical Attributes documentation for a full list of attributes and applicable operators.

Alternatively, you can also construct attribute queries with comparative operators using the rcsb_attributes object (which also allows for names to be tab-completed):

from rcsbsearchapi import rcsb_attributes as attrs

# Search for structures from humans
query = attrs.rcsb_entity_source_organism.scientific_name == "Homo sapiens"

# Run query and construct a list from results
results = list(query())
print(results)

Grouping sub-queries

You can combine multiple queries using Python bitwise operators.

from rcsbsearchapi import rcsb_attributes as attrs

# Query for human epidermal growth factor receptor (EGFR) structures (UniProt ID P00533)
#  with investigational or experimental drugs bound
q1 = attrs.rcsb_polymer_entity_container_identifiers.reference_sequence_identifiers.database_accession == "P00533"
q2 = attrs.rcsb_entity_source_organism.scientific_name == "Homo sapiens"
q3 = attrs.drugbank_info.drug_groups == "investigational"
q4 = attrs.drugbank_info.drug_groups == "experimental"

# Structures matching UniProt ID P00533 AND from humans
#  AND (investigational OR experimental drug group)
query = q1 & q2 & (q3 | q4)

# Execute query and print first 10 ids
results = list(query())
print(results[:10])

These examples are in operator syntax. You can also make queries in fluent syntax. Learn more about both syntaxes and implementation details in Constructing and Executing Queries.

Supported Search Services

The list of supported search service types are listed in the table below. For more details on their usage, see Search Service Types.

Search service	QueryType
Full-text	`TextQuery()`
Attribute (structure or chemical)	`AttributeQuery()`
Sequence similarity	`SequenceQuery()`
Sequence motif	`SequenceMotifQuery()`
Structure similarity	`StructSimilarityQuery()`
Structure motif	`StructMotifQuery()`
Chemical similarity	`ChemSimilarityQuery()`

Learn more about available search services on the RCSB PDB Search API docs.

Jupyter Notebooks

A runnable jupyter notebook is available in notebooks/quickstart.ipynb, or can be run online using Google Colab:

An additional Covid-19 related example is in notebooks/covid.ipynb:

Supported Features

The following table lists the status of current and planned features.

Structure and chemical attribute search
- Attribute Comparison operations
- Query set operations
- Attribute contains, in_ (fluent only)
Option to include computed structure models (CSMs) in search
Sequence search
Sequence motif search
Structure similarity search
Structure motif search
Chemical similarity search
Rich results using the Data API

Contributions are welcome for unchecked items!

License

Code is licensed under the BSD 3-clause license. See LICENSE for details.

Citing rcsbsearchapi

Please cite the rcsbsearchapi package by URL:

https://rcsbsearchapi.readthedocs.io

You should also cite the RCSB PDB service this package utilizes:

Yana Rose, Jose M. Duarte, Robert Lowe, Joan Segura, Chunxiao Bi, Charmi Bhikadiya, Li Chen, Alexander S. Rose, Sebastian Bittrich, Stephen K. Burley, John D. Westbrook. RCSB Protein Data Bank: Architectural Advances Towards Integrated Searching and Efficient Access to Macromolecular Structure Data from the PDB Archive, Journal of Molecular Biology, 2020. DOI: 10.1016/j.jmb.2020.11.003

Attributions

The source code for this project was originally written by Spencer Bliven and forked from sbliven/rcsbsearch. We would like to express our tremendous gratitude for his generous efforts in designing such a comprehensive public utility Python package for interacting with the RCSB PDB search API.

Developers

For information about building and developing rcsbsearchapi, see CONTRIBUTING.md

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
Intended Audience
- Science/Research
License
- OSI Approved :: BSD License
Natural Language
- English
Operating System
- OS Independent
Programming Language
Topic
- Scientific/Engineering :: Bio-Informatics
Typing
- Typed

Release history Release notifications | RSS feed

This version

2.0.1

Mar 26, 2025

2.0.0

Oct 4, 2024

1.6.0

Jun 6, 2024

1.5.1

Feb 19, 2024

1.5.0

Feb 7, 2024

1.4.2

Oct 17, 2023

1.4.1

Sep 15, 2023

1.4.0

Sep 15, 2023

1.3.0

Jul 31, 2023

1.2.0

Jul 20, 2023

1.0.0

Jun 9, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rcsbsearchapi-2.0.1.tar.gz (182.2 kB view details)

Uploaded Mar 26, 2025 Source

File details

Details for the file rcsbsearchapi-2.0.1.tar.gz.

File metadata

Download URL: rcsbsearchapi-2.0.1.tar.gz
Upload date: Mar 26, 2025
Size: 182.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.16

File hashes

Hashes for rcsbsearchapi-2.0.1.tar.gz
Algorithm	Hash digest
SHA256	`50dac1e60f58cbaae93af304ceff1b3ae18611c27477c802a066eb2c79ff32cf`
MD5	`bae3c4243517eb3c90ef6ac6444932b1`
BLAKE2b-256	`e164e5b4009eac83ce59eb60a1e6774a3cfe8b711355c6d48b42a6d326716367`

See more details on using hashes here.

rcsbsearchapi 2.0.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

⛔️ [DEPRECATED] Active at py-rcsb-api

rcsbsearchapi

Quickstart

Quickstart

Installation

Getting Started

Basic Query Construction

Full-text search

Attribute search

Grouping sub-queries

Supported Search Services

Jupyter Notebooks

Supported Features

License

Citing rcsbsearchapi

Attributions

Developers

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes