Create and quantify 'archetypes' of your constructs. A dictionary-like method run amok!

These details have not been verified by PyPI

Project links

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

Archetypes!

This is a library developed to run what might be called a "souped-up dictionary method" for psychological text analysis. Or any kind of text analysis, really.

The core idea behind Archetypes is that you pre-define a set of prototypical sentences that reflect the construct that you are looking to measure in a body of text. Using modern contextual embeddings, then, this library will aggregate your prototypes into an archetypal representation of your construct. Then, you can quantify texts in your corpus for their semantic similarity to your construct(s) of interest.

Note: For the curious: no, this approach not inspired by anything Jungian in nature. In the past, I've said a few things about Jungian archetypes that have inspired scholars to write more than a few frustrated e-mails to me. Apologies to the Jungians.

Installation

This package is easily installable via pip via the following command:

pip install archetyper

Requirements

If you want to run the library without pip installing as shown above, you will need to first install the following packages:

numpy
tqdm
torch
sentence_transformers
nltk

You can try to install these all in one go by running the following command from your terminal/cmd:

pip install numpy tqdm torch sentence_transformers nltk

Examples

I have provided an example notebook in this repo that walks through the basic process of using this library, along with demonstrations of a few important "helper" functions to help you evaluate the statistical/psychometric qualities of your archetypes.

Citation

This method is originally described in this paper:

@inproceedings{varadarajan-etal-2024-archetypes,
    title = "Archetypes and Entropy: Theory-Driven Extraction of Evidence for Suicide Risk",
    author = "Varadarajan, Vasudha  and
      Lahnala, Allison  and
      Ganesan, Adithya V. and
      Dey, Gourab  and
      Mangalik, Siddharth  and
      Bucur, Ana-Maria  and
      Soni, Nikita  and
      Rao, Rajath  and
      Lanning, Kevin  and
      Vallejo, Isabella  and
      Flek, Lucie  and
      Schwartz, H. Andrew  and
      Welch, Charles  and
      Boyd, Ryan L.",
    editor = "Yates, Andrew  and
      Desmet, Bart  and
      Prud{'}hommeaux, Emily  and
      Zirikly, Ayah  and
      Bedrick, Steven  and
      MacAvaney, Sean  and
      Bar, Kfir  and
      Ireland, Molly  and
      Ophir, Yaakov",
    booktitle = "Proceedings of the 9th Workshop on Computational Linguistics and Clinical Psychology (CLPsych 2024)",
    month = mar,
    year = "2024",
    address = "St. Julians, Malta",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.clpsych-1.28",
    pages = "278--291",
    abstract = "Research on psychological risk factors for suicide has developed for decades. However, combining explainable theory with modern data-driven language model approaches is non-trivial. In this study, we propose and evaluate methods for identifying language patterns aligned with theories of suicide risk by combining theory-driven suicidal archetypes with language model-based and relative entropy-based approaches. Archetypes are based on prototypical statements that evince risk of suicidality while relative entropy considers the ratio of how unusual both a risk-familiar and unfamiliar model find the statements. While both approaches independently performed similarly, we find that combining the two significantly improved the performance in the shared task evaluations, yielding our combined system submission with a BERTScore Recall of 0.906. Consistent with the literature, we find that titles are highly informative as suicide risk evidence, despite the brevity. We conclude that a combination of theory- and data-driven methods are needed in the mental health space and can outperform more modern prompt-based methods.",
}

Project details

These details have not been verified by PyPI

Project links

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

This version

1.2.3

May 23, 2026

1.2.2

Sep 16, 2025

1.2.1

Apr 8, 2025

1.2.0

Mar 11, 2025

1.1.7

Oct 2, 2024

1.1.6

Jun 24, 2024

1.1.5

Jun 24, 2024

1.1.4

Mar 25, 2024

1.1.3

Mar 22, 2024

1.1.2

Mar 14, 2024

1.1.1

Mar 14, 2024

1.1.0

Feb 19, 2024

1.0.2

Feb 16, 2024

1.0.1

Feb 12, 2024

1.0.0

Feb 8, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

archetyper-1.2.3.tar.gz (11.4 kB view details)

Uploaded May 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

archetyper-1.2.3-py3-none-any.whl (11.6 kB view details)

Uploaded May 23, 2026 Python 3

File details

Details for the file archetyper-1.2.3.tar.gz.

File metadata

Download URL: archetyper-1.2.3.tar.gz
Upload date: May 23, 2026
Size: 11.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.10

File hashes

Hashes for archetyper-1.2.3.tar.gz
Algorithm	Hash digest
SHA256	`c0bfd08842f4f8d9a4c248afb33179255a626483f8b803af3ea0041caa81f168`
MD5	`6f68360b50a942be2954a24cce779460`
BLAKE2b-256	`27c64a82dbb284c1cc90c2cd2a8e5287fe22aea2df248ce299bc26f0fff44ede`

See more details on using hashes here.

File details

Details for the file archetyper-1.2.3-py3-none-any.whl.

File metadata

Download URL: archetyper-1.2.3-py3-none-any.whl
Upload date: May 23, 2026
Size: 11.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.10

File hashes

Hashes for archetyper-1.2.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`be04a8f685ad8e3f7741b37066cf1ed8cd49c6b6a5244be24fdcd8c1a5760481`
MD5	`72b46a49d8e6e9f16f3cdc99e7a607c6`
BLAKE2b-256	`2de645dbfd7ad030a9b7353842c8bd320792f4b09b825aa067e49f3a37477fe1`

See more details on using hashes here.

archetyper 1.2.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Archetypes!

Installation

Requirements

Examples

Citation

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes