Skip to main content

Evaluation and adaption method for the UNICORN Challenge

Project description

🧪 UNICORN Evaluation Toolkit

Welcome to the official evaluation repository for the UNICORN Challenge — a benchmark for foundation models in pathology, radiology, and medical language processing. This repository provides the official evaluation code and a library of adaptors used to turn frozen features into predictions in vision tasks.

PyPI version

🚀 Challenge Overview

The UNICORN Challenge evaluates how well foundation models generalize across multiple modalities with minimal task-specific supervision:

  • 🧠 Language and Vision-Language tasks: your model directly outputs predictions.
  • 👁️ Vision tasks: your model outputs features. These are then converted to predictions using adaptors — lightweight models like k-NN, linear classifiers, or shallow MLPs.

We provide a few built-in adaptors, but you're highly encouraged to propose your own!
We maintain the full list of adaptors available on the Supported Adaptors page.

🧩 Contributing a Custom Adaptor

Have a better idea for how to turn features into predictions?

You’re welcome to contribute a custom adaptor! Here's how:

  1. Add your adaptor to src/unicorn_eval/adaptors/.
  2. Inherit from one of the base adaptor classes in base.py.
  3. Open a pull request with:
    • Your adaptor code
    • A short description
    • A unique name (we’ll include your team name in the adaptor name to ensure you receive credit).

✅ Once accepted, your adaptor becomes selectable at submission time — and your team gets full recognition when it’s used!

💡 Keep in mind: we prioritize originality. If your adaptor is too similar to an existing one, it may not be accepted — so submit early and make it your own!

📦 Adaptors vs. Algorithms: What's the Difference?

In vision tasks, submissions consist of:

  • A feature extractor (your algorithm)
  • An adaptor (used to turn features into predictions)

You can experiment with different adaptors on top of the same algorithm without using up your submission slots.
Want to try a different adaptor? Send us a request by email, we’ll run the new adaptor strategy for you on top of the existing features. Requests should be submitted via email using the provided template (to be shared soon).

In language and vision-language tasks, the algorithm outputs predictions directly, so no adaptor is needed.

Summary

Modality What You Submit Are Adaptors Used? Submission Limit Applies To
Vision Algorithm (feature extractor) + Adaptor ✅ Yes Algorithm only
Language Algorithm (predictive) ❌ No Algorithm
Vision-Language Algorithm (predictive) ❌ No Algorithm

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

unicorn_eval-1.1.1.tar.gz (35.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

unicorn_eval-1.1.1-py3-none-any.whl (43.5 kB view details)

Uploaded Python 3

File details

Details for the file unicorn_eval-1.1.1.tar.gz.

File metadata

  • Download URL: unicorn_eval-1.1.1.tar.gz
  • Upload date:
  • Size: 35.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.12

File hashes

Hashes for unicorn_eval-1.1.1.tar.gz
Algorithm Hash digest
SHA256 0e9b0493e99105e8527c8b7796044e57a35a9d288003f51cd2c821026003ffd0
MD5 8d3a433fe2f7b7df0168005b133abef9
BLAKE2b-256 3382af5c225ab0e65ebf37737981eccc701d541a015d7bd2be02bb88066d7e6a

See more details on using hashes here.

File details

Details for the file unicorn_eval-1.1.1-py3-none-any.whl.

File metadata

  • Download URL: unicorn_eval-1.1.1-py3-none-any.whl
  • Upload date:
  • Size: 43.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.12

File hashes

Hashes for unicorn_eval-1.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 dfd98117ba61efdd6f4f1d7728a9dc9e82bad9646093c2c8b82c81a7dfaec548
MD5 662db7c5913c17d7f7c73315b9e58075
BLAKE2b-256 3752bac9d2b307d8af934aac371d05cc7047726a9320db6976fc9be6bc0cf51d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page