Skip to main content

This package search for literature on multiple databases and allow uses to filter them using PRISMA method

Project description

PRISMA Search Literature Review Tool

Overview

Automate and document literature reviews using the PRISMA methodology. Input a research topic or keywords, manage PRISMA data, and output publication lists and methodology text for research papers.

Disclaimer:
Review and verify results before including them in research. Automated outputs assist, not replace, expert judgment.
API Usage Notice:
Use APIs responsibly to avoid rate limiting or loss of access.

Features

  • Flexible keyword input: list or comma-separated string
  • Smart keyword handling: derives keywords from research topic if not provided
  • Enhanced output: multiple CSVs for all and selected publications
  • Collects PRISMA-related values (inclusion/exclusion criteria, databases, date ranges)
  • Outputs:
    • PRISMA method data for literature review
    • CSVs: all_publications_found.csv, selected_publications.csv, output_results.csv
    • JSON file with results
    • PRISMA flow diagram in draw.io format (default or user template)
  • Supports multiple databases: PubMed, CrossRef, arXiv, CORE, SemanticScholar
  • Validates configuration files
  • Retries API requests with exponential backoff
  • Modular, extensible design

PRISMA Flow Diagram Template

Default template:
src/literature_search/prisma_flow_diagram.drawio
User template: place prisma_flow_diagram.drawio in output directory.
Filled diagram: prisma_flow_diagram_filled.drawio in output directory.

Configuration

Keywords

Provide keywords as a list or comma-separated string:

{
  "keywords": ["machine learning", "deep learning", "neural networks"]
}

or

{
  "keywords": "machine learning, deep learning, neural networks"
}

If missing, keywords are generated from research_topic with a warning.

Sample Configuration

{
  "research_topic": "fundus image dataset",
  "keywords": "medical-imaging, machine-learning, fundus, retinal, ophthalmology",
  "initial_prisma_values": {
    "inclusion_criteria": ["review", "thesis", "journal", "book"],
    "exclusion_criteria": ["non-english", "conference"],
    "databases": ["PubMed", "CrossRef", "arXiv", "CORE", "SemanticScholar"],
    "date_range": "2015-2025"
  }
}

Installation

Install via pip:

pip install literature_search

Or from wheel:

pip install dist/literature_search-*.whl

Usage

Console Script

Run from anywhere:

literature-search --config sample_input.json --logic OR --page_size 100 --output_dir output

Arguments:

  • --config: Path to config JSON (default: sample_input.json)
  • --logic: Keyword logic ('AND' or 'OR', default: OR)
  • --page_size: Results per database (default: 100)
  • --output_dir: Output directory (default: output)

Programmatic Usage

Import and use in Python:

from literature_search.config_loader import load_config
from literature_search.keywords import get_keywords
from literature_search.api_clients import (
    get_publications_europe_pmc,
    get_publications_crossref,
    get_publications_arxiv,
    get_publications_core,
    get_publications_semanticscholar
)
from literature_search.prisma_logs import output_prisma_results, create_prisma_drawio_diagram

config = load_config('sample_input.json')
keywords = get_keywords(config.get('research_topic', ''))
publications = get_publications_europe_pmc(keywords)
output_prisma_results(publications, criteria_counts, total_records, output_dir='output')
create_prisma_drawio_diagram(criteria_counts, total_records, output_dir='output')

Output Files

  • all_publications_found.csv: All publications with inclusion/exclusion info
  • selected_publications.csv: Publications meeting inclusion criteria
  • output_results.csv: All publications with inclusion status
  • results.json: Complete results
  • prisma_flow_diagram.drawio: PRISMA flow diagram

Requirements

  • Ubuntu 24.04.2 LTS
  • Python 3.10+
  • Python packages: requests, drawpyo, matplotlib, pytest

Contributing

See .github/copilot-instructions.md for coding standards.

Contribution Guide

  1. Fork and create a feature branch.
  2. Follow PEP 8 and PRISMA methodology.
  3. Write clear commit messages and documentation.
  4. Add tests for new features or fixes.
  5. Submit a pull request.

License

GPL-3

Future Work

  • Add more database APIs
  • AI-based abstract inclusion/exclusion

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

literature_search-0.0.1.tar.gz (16.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

literature_search-0.0.1-py3-none-any.whl (16.3 kB view details)

Uploaded Python 3

File details

Details for the file literature_search-0.0.1.tar.gz.

File metadata

  • Download URL: literature_search-0.0.1.tar.gz
  • Upload date:
  • Size: 16.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.1

File hashes

Hashes for literature_search-0.0.1.tar.gz
Algorithm Hash digest
SHA256 e71f5e8596c00b7a7b41928107bbce3c2d30b58967059029ac0584b1d32ce7d2
MD5 05c9c0549b7e804328eecb89dc1ec04f
BLAKE2b-256 17944a866f3cc00054176de0b623e50b31bf9b50d197c6c1b6a57fdc213bc1f6

See more details on using hashes here.

File details

Details for the file literature_search-0.0.1-py3-none-any.whl.

File metadata

File hashes

Hashes for literature_search-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 aa380eeadbf734781e6714ea1e84412bdc8480c15cb2646633fff84eaa1d039b
MD5 d086da10e65931dae5e03d51b6735b31
BLAKE2b-256 846e21b7c54213a427eaec2628f31a83c044b3001a8beda10ec6026c8b39f29f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page