Skip to main content

Integrated pipeline and dashboard for tracing conceptual emergence and evolution in semantic space (UDT case study).

Project description

Science of Science: UDT Concept Evolution Pipeline

This repository contains the Python implementation accompanying the bachelor thesis:

Duco Trompert (Universiteit van Amsterdam, Jan 23, 2026)

Science of Science: An Integrated Pipeline for Tracing Conceptual Emergence and Evolution in Semantic Space

The project implements an integrated pipeline for science mapping that links: data collection (OpenAlex) → pre-processing → network & embedding representations → analysis → interactive dashboard.

What it does

  • Collects and caches publication metadata from the OpenAlex API for a target concept (default: "Urban Digital Twin").
  • Builds keyword co-occurrence networks (overall and per-year slices).
  • Builds semantic similarity networks from Word2Vec embeddings trained on titles/abstracts/keywords.
  • (Optional) Builds concept–method bipartite networks using an LLM-based keyword labelling step (served via Ollama).
  • Provides an interactive Dash dashboard with network visualisations (dash-cytoscape) and time series (plotly).

Installation (Linux/macOS)

python -m venv .venv
source .venv/bin/activate
python -m pip install --upgrade pip
pip install science-of-science-pipeline-udt

Run the dashboard

udt-dashboard

Open http://127.0.0.1:8050/ in your browser.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

science_of_science_pipeline_udt-0.1.2.tar.gz (2.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file science_of_science_pipeline_udt-0.1.2.tar.gz.

File metadata

File hashes

Hashes for science_of_science_pipeline_udt-0.1.2.tar.gz
Algorithm Hash digest
SHA256 a417254f1018ee65566a6069dec00ce0a9e6d1d7a0e4d4cb74354a9c1469550c
MD5 f668389e91a33ead3f53ddfd9b6107e8
BLAKE2b-256 de64a56733ceced50aec3f8c4e0d26d2113b3b90e9fb07bb3fee71b88cecac2b

See more details on using hashes here.

File details

Details for the file science_of_science_pipeline_udt-0.1.2-py3-none-any.whl.

File metadata

File hashes

Hashes for science_of_science_pipeline_udt-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 69f5a01aef8cea714fdd05232b98fa7b37047755aa6fb1867b60a8ab06e6b9cc
MD5 3a60aeae630ce54b2528b57e4efd630c
BLAKE2b-256 7b0a5b5b4137aa11e6f7f7854aa311c0ddc9836a521e494acae7610851e8b2b3

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page