12 projects
knowlyr-trainer
PyTorch-based trainer for Agent trajectory datasets — SFT, DPO, GRPO
knowlyr-hub
Agent trajectory pipeline orchestrator - Task -> Sandbox -> Recorder -> Reward -> Export
knowlyr-reward
Process-level rubric-based reward computation for Code Agent trajectories
knowlyr-recorder
Agent trajectory recorder - convert agent framework logs into a standardized trajectory format
knowlyr-sandbox
Code Agent execution sandbox - reproducible Docker environments for isolated code task execution and trajectory replay
knowlyr-core
Shared data models for knowlyr agent toolchain
knowlyr-modelaudit
LLM distillation detection and model fingerprint audit tool - text source detection, model identity verification, and distillation analysis
knowlyr-datacheck
Data quality inspection toolkit - automated validation, anomaly detection, and distribution analysis
knowlyr-datalabel
Lightweight data annotation toolkit - generate standalone HTML labeling interfaces
knowlyr-datasynth
Data synthesis toolkit - batch generate high-quality training data from seed examples using LLMs
knowlyr-datarecipe
AI dataset 'ingredients label' analyzer - reverse-engineer datasets, estimate costs, analyze quality, and generate production workflows
ai-dataset-radar
Competitive intelligence monitoring system for AI training datasets, tracking labs, vendors, and open-source releases