Skip to main content

Integrated pipeline and dashboard for tracing conceptual emergence and evolution in semantic space (UDT case study).

Project description

Science of Science — UDT Concept Evolution Pipeline

This repository contains the Python implementation accompanying the bachelor thesis:

Duco Trompert (Universiteit van Amsterdam, Jan 23, 2026) Science of Science: An Integrated Pipeline for Tracing Conceptual Emergence and Evolution in Semantic Space

The project implements an integrated pipeline for science mapping that links: data collection (OpenAlex) → pre-processing → network & embedding representations → analysis → interactive dashboard.

What it does

  • Collects and caches publication metadata from the OpenAlex API for a target concept (default: "Urban Digital Twin").
  • Builds keyword co-occurrence networks (overall and per-year slices).
  • Builds semantic similarity networks from Word2Vec embeddings trained on titles/abstracts/keywords.
  • (Optional) Builds concept–method bipartite networks using an LLM-based keyword labelling step (served via Ollama).
  • Provides an interactive Dash dashboard with network visualisations (dash-cytoscape) and time series (plotly).

Installation

python -m venv .venv
source .venv/bin/activate
python -m pip install --upgrade pip
pip install science-of-science-udt-pipeline

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

science_of_science_pipeline_udt-0.1.0.tar.gz (2.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file science_of_science_pipeline_udt-0.1.0.tar.gz.

File metadata

File hashes

Hashes for science_of_science_pipeline_udt-0.1.0.tar.gz
Algorithm Hash digest
SHA256 9739dd5b111fd6e36decd2adb7e6895ac8c3a123759c89ba0ae54d05696983a3
MD5 e0416019eed31f8de2a663f8dbba6759
BLAKE2b-256 24e58846e01cdb8409a46ea162b64edafe80490e8105ad10031311ce7bba7ce0

See more details on using hashes here.

File details

Details for the file science_of_science_pipeline_udt-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for science_of_science_pipeline_udt-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c167ec29398074930a7218c415adc2c983dc51a510c9aa44250e63ac12c8c7b0
MD5 970722b1e9f197f18e4a0b995ec5258b
BLAKE2b-256 527b91a29f5a92961b580af440fb4ba9466810accf70e7e08725facd3295dbe3

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page