Skip to main content

Integrated pipeline and dashboard for tracing conceptual emergence and evolution in semantic space (UDT case study).

Project description

Science of Science — UDT Concept Evolution Pipeline

This repository contains the Python implementation accompanying the bachelor thesis:

Duco Trompert (Universiteit van Amsterdam, Jan 23, 2026)

Science of Science: An Integrated Pipeline for Tracing Conceptual Emergence and Evolution in Semantic Space

The project implements an integrated pipeline for science mapping that links: data collection (OpenAlex) → pre-processing → network & embedding representations → analysis → interactive dashboard.

What it does

  • Collects and caches publication metadata from the OpenAlex API for a target concept (default: "Urban Digital Twin").
  • Builds keyword co-occurrence networks (overall and per-year slices).
  • Builds semantic similarity networks from Word2Vec embeddings trained on titles/abstracts/keywords.
  • (Optional) Builds concept–method bipartite networks using an LLM-based keyword labelling step (served via Ollama).
  • Provides an interactive Dash dashboard with network visualisations (dash-cytoscape) and time series (plotly).

Installation (Linux/macOS)

python -m venv .venv
source .venv/bin/activate
python -m pip install --upgrade pip
pip install science-of-science-pipeline-udt

Run the dashboard

udt-dashboard

Open http://127.0.0.1:8050/ in your browser.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

science_of_science_pipeline_udt-0.1.1.tar.gz (2.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file science_of_science_pipeline_udt-0.1.1.tar.gz.

File metadata

File hashes

Hashes for science_of_science_pipeline_udt-0.1.1.tar.gz
Algorithm Hash digest
SHA256 0fd48a2494b3d5a661892ece8d785ddf3b474dc98ce2e5bee52434a4d6d72de3
MD5 5eea5394af4b27c4c4cf850b83552cca
BLAKE2b-256 3f6a5eaf6aeec8ee2814c0808b797962724ee2e863d1c71c839b661249d77d06

See more details on using hashes here.

File details

Details for the file science_of_science_pipeline_udt-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for science_of_science_pipeline_udt-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 6c9e782f6675f5086bbce75442a809fa9c2a2f3e6a0c3e3f8ebaf47ef03b4d40
MD5 872476e8d50266227cbbad708a01e88f
BLAKE2b-256 f8ba414a1103bc6f4639ed088ac49c7e86bdd515d4255fce0db787fc53397ba0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page