Skip to main content

Analyse data from open card sorting

Project description


A Python package to cluster and visualize data from open card sorting tasks

Project Status: WIP – Initial development is in progress, but there has not yet been a stable, usable release suitable for the public. PyPI GitHub Workflow Status (with event) Read the Docs Python Codecov DOI pyOpenSci All Contributors

Cardsort helps UX researchers quickly analyse data from open card sorting exercises using hierarchical cluster analysis. This task helps to understand how people organize information, and is frequently used to develop information architectures for websites. Click here to learn more about the card sorting method.

More precisely, cardsort helps you to:

  • Create distance matrices using hierarchical cluster analysis
  • Create dendrograms from based on these matrices
  • Extract user-generated category-labels
  • Using data exports from kardsort.com

Table of Contents

Documentation

cardsort.readthedocs.io

Quick start

Installation

$ pip install cardsort

Usage

cardsort can be used to create dendrograms and extract user-generated category-labels:

from cardsort import analysis
import pandas as pd

path = "example-data.csv" # data with columns: card_id, card_label, category_id, category_label, user_id
df = pd.read_csv(path) 

The data used in this example can be found under /docs/example-data.csv.

Create a dendrogram that summarizes user-generated clusters

analysis.create_dendrogram(df)

Output

Dendrogram plot generated from example data

Learn which category labels users gave to clusters

cards = ['Banana', 'Apple']
analysis.get_cluster_labels(df, cards)

Output

  user_id   cluster_label                                              cards
0       2  Healthy snacks                                    [Banana, Apple]
1       3          Snacks     [Sandwich, Croissant, Banana, Mooncake, Apple]
2       4          Fruits                                    [Apple, Banana]
3       5            Food  [Banana, Croissant, Apple, Sandwich, Hot Dog, ...

Interpretation: In this case, the users with IDs 2 and 4 made clusters containing exactly the two cards of interest ('Banana' and 'Apple', as specified in the input variable 'cards'). User 2 labelled this cluster 'Healthy snacks', and user 4 'Fruits'. Users 3 and 5 also clustered these cards together, but they included additional other cards in the same cluster, and labelled the cluster 'Snacks' or 'Food'. User 1 does not appear in the output, because they did not cluster the cards together.

Accepted data

  • This package works with data exports from kardsort.com (Export format 'Casolysis Data (.csv) - Recommended') or other data following the same structure.
  • Columns: card_id, card_label, category_id, category_label, user_id

Advanced usage

See documentation

Project information

License

cardsort is licensed under the of the MIT license.

Contributing

Interested in contributing? Check out the contributing guidelines. Please note that this project is released with a Code of Conduct. By contributing to this project, you agree to abide by its terms.

Citation

If you want to cite cardsort, please use the following DOI: DOI

Credits

cardsort was created with cookiecutter and the py-pkgs-cookiecutter template.

Contributors

Thanks goes to these wonderful people (emoji key):

Katharina Kloppenborg
Katharina Kloppenborg

💻 🤔 📖 🎨 📆 🔬
Bastian Greshake Tzovaras
Bastian Greshake Tzovaras

🔍 💻 🤔 🧑‍🏫
Alex Batisse
Alex Batisse

👀 🤔
Semidán Robaina
Semidán Robaina

👀 🤔
khynder
khynder

👀 🤔

This project follows the all-contributors specification. Contributions of any kind welcome!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cardsort-0.2.45.tar.gz (8.0 kB view details)

Uploaded Source

Built Distribution

cardsort-0.2.45-py3-none-any.whl (8.4 kB view details)

Uploaded Python 3

File details

Details for the file cardsort-0.2.45.tar.gz.

File metadata

  • Download URL: cardsort-0.2.45.tar.gz
  • Upload date:
  • Size: 8.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.8

File hashes

Hashes for cardsort-0.2.45.tar.gz
Algorithm Hash digest
SHA256 859a1a410665112f36aefecf7237328987dfa6b25e10cc0cc3ea8d4b1b13a92e
MD5 c3870fc4901cf2f36cb078cbce9e49aa
BLAKE2b-256 71671f0d8b96d4a494b7821aaece6b99b53540d6f1c2744449c628402af3f3f9

See more details on using hashes here.

File details

Details for the file cardsort-0.2.45-py3-none-any.whl.

File metadata

  • Download URL: cardsort-0.2.45-py3-none-any.whl
  • Upload date:
  • Size: 8.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.8

File hashes

Hashes for cardsort-0.2.45-py3-none-any.whl
Algorithm Hash digest
SHA256 cc7b39653fbf0d8ebfe3cc21e20455fb0464a356ba749f1f85e39df632128f1a
MD5 99b004c4a69a17262aaad943445826f0
BLAKE2b-256 588d852e77e3e715c6b581d73d41eaa5da6d80cfd223a789248f4f77cd970128

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page