Skip to main content

Integrated pipeline and dashboard for tracing conceptual emergence and evolution in semantic space (UDT case study).

Project description

Science of Science: UDT Concept Evolution Pipeline

This repository contains the Python implementation accompanying the bachelor thesis:

Duco Trompert (Universiteit van Amsterdam, Jan 23, 2026)

Science of Science: An Integrated Pipeline for Tracing Conceptual Emergence and Evolution in Semantic Space

The project implements an integrated pipeline for science mapping that links: data collection (OpenAlex) → pre-processing → network & embedding representations → analysis → interactive dashboard.

What it does

  • Collects and caches publication metadata from the OpenAlex API for a target concept (default: "Urban Digital Twin").
  • Builds keyword co-occurrence networks (overall and per-year slices).
  • Builds semantic similarity networks from Word2Vec embeddings trained on titles/abstracts/keywords.
  • (Optional) Builds concept-method bipartite networks using an LLM-based keyword labelling step (served via Ollama).
  • Provides an interactive Dash dashboard with network visualisations (dash-cytoscape) and time series (plotly).

Installation (Linux)

python -m venv .venv
source .venv/bin/activate
python -m pip install --upgrade pip
python -m pip install --upgrade science-of-science-pipeline-udt

Installation (Windows CMD)

python -m venv .venv
.\.venv\Scripts\activate.bat
python -m pip install --upgrade pip
python -m pip install --upgrade science-of-science-pipeline-udt

Run the dashboard

udt-dashboard

Open http://127.0.0.1:8050/ in your browser.

Deactivate the virtual environment (after usage)

deactivate

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

science_of_science_pipeline_udt-0.1.5.tar.gz (2.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file science_of_science_pipeline_udt-0.1.5.tar.gz.

File metadata

File hashes

Hashes for science_of_science_pipeline_udt-0.1.5.tar.gz
Algorithm Hash digest
SHA256 57a0df43faf7dfa9411303c880a70f9b04f9abe2bc7be62ac7157b1e18a801ef
MD5 2d9b5ed17eccd48d94c574d50fcbef6c
BLAKE2b-256 747a77f61ad7ea7866715c07dcb4dcd14c7319ae215724f7b31a08561ffe444d

See more details on using hashes here.

File details

Details for the file science_of_science_pipeline_udt-0.1.5-py3-none-any.whl.

File metadata

File hashes

Hashes for science_of_science_pipeline_udt-0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 422ae8af2188e4a4503b11cd0acb8f0c5e15e11fe5767f77e254f82b81ab047a
MD5 a4a14046a318a4e323c4d035748af7ce
BLAKE2b-256 7efc94bf50fc5fe906fe06490c7a1ca813aec199debc3e9d520851406927ca99

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page