Skip to main content

Integrated pipeline and dashboard for tracing conceptual emergence and evolution in semantic space (UDT case study).

Project description

Science of Science: UDT Concept Evolution Pipeline

This repository contains the Python implementation accompanying the bachelor thesis:

Duco Trompert (Universiteit van Amsterdam, Jan 23, 2026)

Science of Science: An Integrated Pipeline for Tracing Conceptual Emergence and Evolution in Semantic Space

The project implements an integrated pipeline for science mapping that links: data collection (OpenAlex) → pre-processing → network & embedding representations → analysis → interactive dashboard.

What it does

  • Collects and caches publication metadata from the OpenAlex API for a target concept (default: "Urban Digital Twin").
  • Builds keyword co-occurrence networks (overall and per-year slices).
  • Builds semantic similarity networks from Word2Vec embeddings trained on titles/abstracts/keywords.
  • (Optional) Builds concept–method bipartite networks using an LLM-based keyword labelling step (served via Ollama).
  • Provides an interactive Dash dashboard with network visualisations (dash-cytoscape) and time series (plotly).

Installation (Linux/macOS)

python -m venv .venv
source .venv/bin/activate
python -m pip install --upgrade pip
pip install science-of-science-pipeline-udt

Installation (Windows PowerShell)

py -m venv .venv
..venv\Scripts\Activate.ps1
py -m pip install --upgrade pip
pip install science-of-science-pipeline-udt

If PowerShell blocks activation, run:

Set-ExecutionPolicy -ExecutionPolicy RemoteSigned -Scope CurrentUser

Then try activating again.

Run the dashboard

udt-dashboard

Open http://127.0.0.1:8050/ in your browser.

Deactivate the virtual environment (after usage)

deactivate

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

science_of_science_pipeline_udt-0.1.3.tar.gz (2.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file science_of_science_pipeline_udt-0.1.3.tar.gz.

File metadata

File hashes

Hashes for science_of_science_pipeline_udt-0.1.3.tar.gz
Algorithm Hash digest
SHA256 50ed77375261b916de9cb663e5495256c691c18f48dc34cde5f2d7f6b36b1955
MD5 7436ca2f0974a4a3c92e41921f01ac35
BLAKE2b-256 7e3b3c73f2b8ab19b1364c73178c92b9ec66c61f8891ebe229a4bdb30888cf50

See more details on using hashes here.

File details

Details for the file science_of_science_pipeline_udt-0.1.3-py3-none-any.whl.

File metadata

File hashes

Hashes for science_of_science_pipeline_udt-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 d399b66cdc2422c76ff18ad90df6ad820a12eda6694665045208b15c1ce22690
MD5 1d71dd8be517036cd38e4f7ef044b786
BLAKE2b-256 824c431045acc26c0a12d2ffecb6b549612b30c8c3f67e71c34b6c2ad99f5bb4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page