Professional tools for processing Dutch higher educational data (1CijferHO / ROD)
Project description
📋 Overview
[!NOTE] No Python or technical knowledge required! This tool is designed for everyone, regardless of programming experience.
Transform DUO's complex educational data into research-ready insights in minutes, not hours.
🚨 The Problem
DUO data comes as:
- Fixed-width ASCII files with no clear field separation
- Unstructured .txt metadata files
- Hours of manual processing required
- High risk of errors and data loss
✨ The Solution
Automated pipeline that delivers:
- ⚡ Fast - Process GB files in minutes using multiprocessing
- 🎯 Accurate - Automated validation prevents errors
- 🔒 Secure - Auto-anonymization of sensitive data (BSN, etc.)
- 📊 Ready - Clean CSV/Parquet output for immediate analysis
✨ Features
- Smart Extraction - Auto-finds field positions in messy .txt files
- Intelligent Matching - Links data files with metadata automatically
- Turbo Conversion - Multiprocessing for blazing speed
- Quality Validation - Comprehensive error checking
- Advanced Data Combination - YAML-configured joins with decoder files and reference tables
- Research-Ready Enrichment - 30+ calculated variables for academic analysis
- Switch Analysis - 27+ variables tracking student program switches
- Smart Case Handling - Supports snake_case, camelCase, PascalCase, and original formatting
- File Optimization - 60-80% size reduction via Parquet compression
- Privacy Protection - Cryptographic anonymization
- Zero Setup -
uvhandles all dependencies automatically
🔧 First Time Setup
[!WARNING] Do not skip these steps if this is your first time using this application. It will not work without them.
[!TIP] Save the repository in a Projects/CEDA folder on your main drive for quick access.
1. Get the Repository
Option A: Clone with Git (or Github Desktop)
git clone https://github.com/cedanl/1cijferho.git
cd 1cijferho
Option B: Download ZIP
After downloading extract the ZIP file and navigate into the folder.
2. Install 
MacOS & Linux (Terminal)
curl -LsSf https://astral.sh/uv/install.sh | sh
Windows (Powershell or Windows Terminal)
powershell -ExecutionPolicy ByPass -c "irm https://astral.sh/uv/install.ps1 | iex"
Close and reopen your terminal after installation.
Verify installation
uv self update
See the installation documentation for details and alternative installation methods.
🚀 Running the Application
Ready to see the magic happen? Your 1CijferHO Tool is just one command away! ✨
First, get to the right spot:
Open a terminal in your 1cijferho folder - it's super easy!
- Windows:
Shift + Right-clickin folder →Open in Windows Terminal - Mac:
Right-clickfolder →New Terminal at Folder - VS Code: Just click
Terminal→New Terminal
Or simply navigate there:
cd path/to/1cijferho
Then, launch with a single command:
uv run streamlit run src/main.py
That's it! The app will automatically spring to life in your browser. If you've completed all the steps in the First Time Setup correctly, this is the only command you'll need going forward. 🎉
Pro Tip: Create a shortcut:
.batfile (Windows) or.shscript (macOS/Linux) Pro Tip: Check out our architecture.md for technical details!
Happy analyzing! ✨📊📝
🛠️ Built With
🤲 Support
If you find this project helpful, please consider:
- ⭐ Starring the repo
- 🐛 Reporting bugs
- 💡 Suggesting features
- 💻 Contributing code
If you encounter any issues or need further assistance, please feel free to open an issue or contact a.sewnandan@hhs.nl | t.iwan@vu.nl
🙏 Acknowledgements
Special thanks to:
- Ash Sewnandan & Tomer Iwan for setting the foundation with a clean, user-friendly interface and robust architecture.
- CEDA & Npuls for making this project possible by providing valuable resources and support.
🫂 Contributors
Thank you to all the people who have already contributed to 1cijferho.
🚦 License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file 1cijferho-0.1.0.tar.gz.
File metadata
- Download URL: 1cijferho-0.1.0.tar.gz
- Upload date:
- Size: 45.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.20
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
28931c4dd484363a0a8d19b88f69acea8adf6b5b609954434ed03269fb6515c3
|
|
| MD5 |
b94a229b24a76f314f8bf0901e0941bc
|
|
| BLAKE2b-256 |
48e5fc31af4cd0ab8aacfd72cf1625eca6532bfe6078c0028cf223c493b0f728
|
File details
Details for the file 1cijferho-0.1.0-py3-none-any.whl.
File metadata
- Download URL: 1cijferho-0.1.0-py3-none-any.whl
- Upload date:
- Size: 51.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.20
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ad97e69d1d4965268bef3222f4eb1ea654f56fa7f12c31c965daeca147959732
|
|
| MD5 |
d15673802e91a3324c135ec3e1782946
|
|
| BLAKE2b-256 |
68b53c268ff392c66276ee9956eb848acf7d0ecf0223afcb8de3baeead664790
|