Skip to main content

Evaluation and adaption method for the UNICORN Challenge

Project description

🧪 UNICORN Evaluation Toolkit

Welcome to the official evaluation repository for the UNICORN Challenge — a benchmark for foundation models in pathology, radiology, and medical language processing. This repository provides the official evaluation code and a library of adaptors used to turn frozen features into predictions in vision tasks.

PyPI version

🚀 Challenge Overview

The UNICORN Challenge evaluates how well foundation models generalize across multiple modalities with minimal task-specific supervision:

  • 🧠 Language and Vision-Language tasks: your model directly outputs predictions.
  • 👁️ Vision tasks: your model outputs features. These are then converted to predictions using adaptors — lightweight models like k-NN, linear classifiers, or shallow MLPs.

We provide a few built-in adaptors, but you're highly encouraged to propose your own!
We maintain the full list of adaptors available on the Supported Adaptors page.

🧩 Contributing a Custom Adaptor

Have a better idea for how to turn features into predictions?

You’re welcome to contribute a custom adaptor! Here's how:

  1. Add your adaptor to src/unicorn_eval/adaptors/.
  2. Inherit from one of the base adaptor classes in base.py.
  3. Open a pull request with:
    • Your adaptor code
    • A short description
    • A unique name (we’ll include your team name in the adaptor name to ensure you receive credit).

✅ Once accepted, your adaptor becomes selectable at submission time — and your team gets full recognition when it’s used!

💡 Keep in mind: we prioritize originality. If your adaptor is too similar to an existing one, it may not be accepted — so submit early and make it your own!

📦 Adaptors vs. Algorithms: What's the Difference?

In vision tasks, submissions consist of:

  • A feature extractor (your algorithm)
  • An adaptor (used to turn features into predictions)

You can experiment with different adaptors on top of the same algorithm without using up your submission slots.
Want to try a different adaptor? Send us a request by email, we’ll run the new adaptor strategy for you on top of the existing features. Requests should be submitted via email using the provided template (to be shared soon).

In language and vision-language tasks, the algorithm outputs predictions directly, so no adaptor is needed.

Summary

Modality What You Submit Are Adaptors Used? Submission Limit Applies To
Vision Algorithm (feature extractor) + Adaptor ✅ Yes Algorithm only
Language Algorithm (predictive) ❌ No Algorithm
Vision-Language Algorithm (predictive) ❌ No Algorithm

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

unicorn_eval-1.1.0.tar.gz (35.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

unicorn_eval-1.1.0-py3-none-any.whl (43.5 kB view details)

Uploaded Python 3

File details

Details for the file unicorn_eval-1.1.0.tar.gz.

File metadata

  • Download URL: unicorn_eval-1.1.0.tar.gz
  • Upload date:
  • Size: 35.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.12

File hashes

Hashes for unicorn_eval-1.1.0.tar.gz
Algorithm Hash digest
SHA256 ba1115ed4b2df3f1868b7d0e3a1946297c56b380d40242854c5b9ea4f60d7362
MD5 4748799eff357a4aeecdbe9d51460a43
BLAKE2b-256 6299cb1ef8c730488c0ff9e38c3acc2074aa0d728058e5f83d7023fcf02565d0

See more details on using hashes here.

File details

Details for the file unicorn_eval-1.1.0-py3-none-any.whl.

File metadata

  • Download URL: unicorn_eval-1.1.0-py3-none-any.whl
  • Upload date:
  • Size: 43.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.12

File hashes

Hashes for unicorn_eval-1.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c7fe16e1fe629865d9fc063d501a24dc6b3eca792ece9261146dfeaeba8a17c5
MD5 af599c1c3a5fa189649a181575157e58
BLAKE2b-256 4667f6ec7ad6ffa1092292629eee600fc52afb60180b6d31c11b6d4e501b00ad

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page