Predict 34-dimensional perceived-trait vectors from face images, trained on the One Million Impressions (OMI) dataset.

These details have not been verified by PyPI

Project links

Project description

face-trait-transformer

Predict a 34-dimensional perceived-trait vector from a face image. Trained on the One Million Impressions (OMI) dataset (Peterson et al., 2022, PNAS).

Model weights: kiante/face-trait-transformer on HuggingFace Hub (pulled automatically by from_pretrained).
Methods: docs/methods.md (paper-ready writeup, including head-to-head with Peterson et al. 2022).
Model card: docs/model_card.md (intended use, biases, limitations).

Install

pip install face-trait-transformer[hub,figures]

First run

The very first call to from_pretrained downloads:

~1.2 GB of MLP heads + fine-tune checkpoint from HuggingFace Hub (cached under ~/.cache/huggingface/…).
~1.2 GB for the DINOv2 ViT-G/14 backbone via torch.hub (cached under ~/.cache/torch/hub/checkpoints/).

So expect ~2.5 GB of traffic and 5–15 min on typical home internet on the first call. Subsequent calls are offline.

Use

from face_trait_transformer import TraitPredictor

m = TraitPredictor.from_pretrained("kiante/face-trait-transformer")
row = m.predict("face.jpg")               # pandas.Series: 34 traits, 0–100 scale
row, fig = m.predict_with_figure("face.jpg", "diag.png")
df = m.predict(["a.jpg", "b.jpg"])        # pandas.DataFrame (batched)

Or from the command line:

ftt predict face.jpg --figure diag.png
ftt predict faces_dir/ --out predictions.csv

The first call downloads ~1.2 GB of weights to the HuggingFace / torch.hub caches; subsequent calls are offline.

What the output looks like

A diagnostic panel for one face (stimulus 582 from the OMI test split — observed in black, predicted in red):

per-face diagnostic panel

Performance

Evaluated on the held-out 101-stimulus test split of the OMI paper, with 5,000-sample bootstrap 95 % CIs over stimuli:

Metric	Point estimate	95 % CI
Mean Pearson r (34 attributes)	0.857	0.842 – 0.870
Mean R²	0.738	0.707 – 0.755
Median Pearson r	0.897	—
Attributes significant at p < 0.05	34 / 34	—

Across the 34 attributes, our 10-fold CV R² reaches on average 99 % of the split-half reliability ceiling (mean ceiling R² = 0.770; mean model R² = 0.734 in CV). Per-attribute ceilings and fractions are at training/results/cv_per_attribute.csv.

Per-attribute scatter (observed vs predicted across the test split)

per-attribute observed vs predicted

Comparison to Peterson et al. (2022)

Under their 10-fold cross-validation protocol on the same 1,004 stimuli:

	Peterson 2022 (StyleGAN2 W + L2 regression)	This work (DINOv2 + MLP ensemble + ViT-L fine-tune)
Mean R² across 34 attributes	≈ 0.55	0.734

We improve per-attribute R² on 33 of 34 attributes, with the largest gains on attributes Peterson et al. flagged as hardest (gay +0.50, looks-like-you +0.39, Black +0.33, believes in god +0.31). Full table at training/results/cv_per_attribute.csv; detailed methods at docs/methods.md; Peterson's numbers are read from their Fig. 2.

What they can still do that we can't: their linear head in StyleGAN2 W-space enables direct attribute manipulation of faces; our DINOv2 features aren't invertible through a generator. For prediction accuracy our model clearly wins; for editing, keep theirs.

Repository layout

src/face_trait_transformer/     the pip-installable package
    predictor.py                TraitPredictor (.from_pretrained / .from_bundle)
    model.py                    TraitHead
    features.py                 DINOv2 backbone + preprocessing + extract_cls
    data.py                     label / split helpers
    metrics.py                  per-attribute Pearson r, R², MAE, RMSE
    figures.py                  single-image diagnostic panel
    hub.py                      HuggingFace Hub download
    cli.py                      `ftt predict …` CLI

training/                       reproduce-the-paper pipeline (not installed)
    scripts/                    extract, train, finetune, eval, aggregate_cv, export_bundle, ...
    splits/                     splits.json + cv_splits/ + attribute_stats.npz
    results/                    cv_per_attribute.csv, per_attribute_r_r2.csv
    sweep_configs.json          the 62-config hyperparameter grid
    README.md                   how to reproduce every reported number

docs/
    methods.md                  paper-ready methods
    model_card.md               HF-style model card
    quickstart.md               usage

examples/
    quickstart.py
    batch_inference.py
    aging_validation/           validity check on the GIRAF aging dataset

figures/                        paper figures
tests/                          package smoke tests

Citation

If you use this model, please cite both the underlying OMI dataset and this repository. The model's predictive behavior is entirely derived from the OMI training data; the repository contributes the architecture, training recipe, and distribution.

@article{peterson2022omi,
  title   = {Deep models of superficial face judgments},
  author  = {Peterson, Joshua C and Uddenberg, Stefan and Griffiths, Thomas L
             and Todorov, Alexander and Suchow, Jordan W},
  journal = {Proceedings of the National Academy of Sciences},
  volume  = {119}, number = {17}, pages = {e2115228119}, year = {2022},
  doi     = {10.1073/pnas.2115228119}
}

@misc{fernandez2026facetrait,
  title        = {face-trait-transformer: Predicting 34-dimensional perceived-trait
                  vectors from face images},
  author       = {Fernandez, Kiante},
  year         = {2026},
  howpublished = {GitHub},
  url          = {https://github.com/kiante-fernandez/face-trait-transformer},
}

Ethics & license

These predictions are learned perceptions from human raters — rater stereotypes aggregated into a model, not ground-truth attributes of the people depicted. Frame outputs as "perceived X", not "X", particularly for demographic (asian / black / white / middle-eastern / hispanic / islander / native) and socially-constructed (gay / liberal / privileged / godly / electable / looks-like-you) dimensions. See docs/model_card.md for the full bias/limitations discussion and the examples/aging_validation/ write-up.

Code: MIT (see LICENSE).
Weights: CC BY-NC-SA 4.0, inherited from the OMI dataset (see LICENSE-WEIGHTS). Non-commercial use only; derivatives must be share-alike.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.0

Apr 21, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

face_trait_transformer-0.1.0.tar.gz (21.6 kB view details)

Uploaded Apr 21, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

face_trait_transformer-0.1.0-py3-none-any.whl (19.7 kB view details)

Uploaded Apr 21, 2026 Python 3

File details

Details for the file face_trait_transformer-0.1.0.tar.gz.

File metadata

Download URL: face_trait_transformer-0.1.0.tar.gz
Upload date: Apr 21, 2026
Size: 21.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.5

File hashes

Hashes for face_trait_transformer-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`096ba0bda1fc3a32faf8d19c4e473d1ad178ca89353d615b5a00f097b46a707d`
MD5	`ab2571703767e24638d769c8d8340a00`
BLAKE2b-256	`b37ed8fdebfa8028f3679ff95c905ded08d31dbc679643b5cc24db032f6de3c2`

See more details on using hashes here.

File details

Details for the file face_trait_transformer-0.1.0-py3-none-any.whl.

File metadata

Download URL: face_trait_transformer-0.1.0-py3-none-any.whl
Upload date: Apr 21, 2026
Size: 19.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.5

File hashes

Hashes for face_trait_transformer-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`04da9783687e8380016996359608ca7a76e25a0bcea872bc56f49e098f905d7f`
MD5	`0c7eb69ea79b3b9518fbb3601bf9bb6a`
BLAKE2b-256	`d167b7c8b0c22ce3a668bafc662b3815341412d3879247281fb840986d52b2b8`

See more details on using hashes here.

face-trait-transformer 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

face-trait-transformer

Install

First run

Use

What the output looks like

Performance

Per-attribute scatter (observed vs predicted across the test split)

Comparison to Peterson et al. (2022)

Repository layout

Citation

Ethics & license

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes