Profile of sophie.nguyenthuthuy

Some features may not work without JavaScript. Please try enabling it if you encounter problems.

2 projects

Last released Jun 23, 2026

Audit the reliability of an LLM-as-judge eval pipeline: agreement, bias, drift, calibration.

Last released Jun 10, 2026

Online Lyapunov-drift monitor for ML retraining loops: alert when the loop trends unstable, before eval metrics show it.

Supported by