Skip to main content

Integrated pipeline and dashboard for tracing conceptual emergence and evolution in semantic space (UDT case study).

Project description

Science of Science: UDT Concept Evolution Pipeline

This repository contains the Python implementation accompanying the bachelor thesis:

Duco Trompert (Universiteit van Amsterdam, Jan 23, 2026)

Science of Science: An Integrated Pipeline for Tracing Conceptual Emergence and Evolution in Semantic Space

The project implements an integrated pipeline for science mapping that links: data collection (OpenAlex) → pre-processing → network & embedding representations → analysis → interactive dashboard.

What it does

  • Collects and caches publication metadata from the OpenAlex API for a target concept (default: "Urban Digital Twin").
  • Builds keyword co-occurrence networks (overall and per-year slices).
  • Builds semantic similarity networks from Word2Vec embeddings trained on titles/abstracts/keywords.
  • (Optional) Builds concept-method bipartite networks using an LLM-based keyword labelling step (served via Ollama).
  • Provides an interactive Dash dashboard with network visualisations (dash-cytoscape) and time series (plotly).

Installation (Linux/macOS)

python -m venv .venv
source .venv/bin/activate
python -m pip install --upgrade pip
python -m pip install --upgrade science-of-science-pipeline-udt

Installation (Windows CMD)

python -m venv .venv
.\.venv\Scripts\activate.bat
python -m pip install --upgrade pip
python -m pip install --upgrade science-of-science-pipeline-udt

Run the dashboard

udt-dashboard

Open http://127.0.0.1:8050/ in your browser.

Deactivate the virtual environment (after usage)

deactivate

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

science_of_science_pipeline_udt-0.1.4.tar.gz (2.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file science_of_science_pipeline_udt-0.1.4.tar.gz.

File metadata

File hashes

Hashes for science_of_science_pipeline_udt-0.1.4.tar.gz
Algorithm Hash digest
SHA256 c7d0a5e703697e6ca12d919fb7a4a2219d089269052b9639148a7900bbc592c6
MD5 615521e7b957e7154a7d42234b11766f
BLAKE2b-256 841f40f2ba993e72e3c5db6219557d856ec17e265ff5cce8918dc78cd84e5fdd

See more details on using hashes here.

File details

Details for the file science_of_science_pipeline_udt-0.1.4-py3-none-any.whl.

File metadata

File hashes

Hashes for science_of_science_pipeline_udt-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 607fea90af2fe5802944cb759284c6997f6547257706ecff994f7baceaa636b0
MD5 04cb21b459f91b0e06975940f0ffa086
BLAKE2b-256 b54c77b38b2eca098e223b1e3a67969ac14c1a82ccf07bee2965b7ef045e14ef

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page