Download, summarise and process LimeSurvey data

Project description

LimeSurvey is open-source survey software. Using pandas, the limepy package simplifies a number of tasks when working with LimeSurvey data:

Downloading survey data. This requires that LimeSurvey’s RemoteControl 2 API is enabled, as explained here.
Creating a list of al the questions in the survey, with metadata.
Summarising data, e.g. creating value counts for multi-column items such as multiple-choice questions; calculating averages for number arrays; or creating scores for a ranking question.
Printing answers to open-ended questions.
Printing the answers of an individual respondent.

Note that limepy uses f-strings and therefore requires Python 3.6 or higher.

Use at your own risk and please make sure to check the results.

Installation

$ pip install limepy

How is it different

There are various python packages for managing the LimeSurvey RemoteControl 2 API. While limepy can help you download survey data, the emphasis is on processing and summarising the data.

Examples

Download survey data

You can download survey data with the RemoteControl 2 API (provided the api is enabled in your LimeSurvey installation).

For a one-off download, you can of course do this manually. However, you may want to use the api if you want to write a preliminary report based on the first responses, and then automatically update it as new responses come in.

from pathlib import Path
from limepy import download

csv = download.get_responses(base_url, user_name, password, user_id, sid)
path = Path('../data/responses.csv')
path.write_text(csv)

Create Survey object

A Survey object contains the data and metadata of a survey. To create a Survey object, you need:

A csv containing the survey results. You can download it manually or use the api as described above. Make sure to set heading type to 'Question code' and reponse type to 'Answer codes'. If using the api to download, the file will be delimited with ; rather than ,.
An .lss file containing the survey structure. You can download this manually.

from limepy.wrangle import Survey, Question
import pandas as pd

df = pd.read_csv('../data/responses.csv', sep=';')
with open('../data/structure.lss', encoding="utf8") as f:
    my_structure = f.read()

my_survey = Survey(df, my_structure)

If you wish to remove html tags from the questions, set strip_tags=True.

If you have a multilingual questionnaire, then you can select the language the group names, questions, answers and help texts should be presented in, e.g. language='nl' for Dutch.

Note: if you use a merged dataframe (for example, data from various versions of the same questionnaire), you should reset the index before creating a Survey object.

Get list of questions with metadata

my_survey.question_list

Print results for individual respondent

The respondent method will return a string listing the answers of an individual respondent. You need the respondent’s row index.

my_survey.respondent(26)

Create a readable dataframe

Create a dataframe with full questions as column names and ‘long’ responses as values.

my_survey.readable_df

Create a Question object

A Question object can be used to summarise data. To create a Question oject, you need a Survey object and the question id (find it in the index of the question list).

my_question = Question(my_survey, 3154)

If you want to use a subset of the respondents for your analysis (e.g., exclude respondents that do not meet certain criteria, or drop duplicates), the most practical approach is probably to create a subset first and use that to create your Survey object. However, you can also use a mask if you want to create a Question object for a subset of the respondents.

my_question = Question(my_survey, 3154, mask=pd.notnull(df.iloc[:, 8]))

Summarise answers to a question

For many question types, limepy can summarise the results.

In many cases, this will return a dataframe containing value counts (as well as Percent and Valid Percent).
In case of a Numerical input question, the output will be a dataframe containing the results of the pandas DataFrame describe method.
In case of a Numbers array question, the average will be calculated for each option (but you must specify the method, i.e. 'mean' or 'median').
In case of a Ranking question, the result will be a dataframe with scores calculated for each item.
If no method has been implemented for a question type, a dataframe will be returned which contains the columns associated with the question.

my_question.summary

To show the metadata associated with a question:

my_question.metadata

Compare groups

Limepy currently has no method to compare groups, but you can write a function to do so (the example below may not work with all question types).

def compare(qid, category_variable, how='Valid Percent'):
    """Compare answers for groups based on category variable"""
    summaries = []
    for group in set(df[category_variable]):
        if pd.isnull(group):
            continue
        mask = list(df[category_variable] == group)
        q = Question(my_survey, qid, mask=mask)
        summary = q.summary
        if how in list(summary.columns):
            summary = summary[[how]]
        summary.columns = [group]
        summaries.append(summary)
    return pd.concat(summaries, axis=1)

Write answers to an open-ended question

The write_open_ended method creates a string listing all the answers to the question. Optionally, you can specify a list of indices of columns that contain background information you want included in the output.

my_question.write_open_ended(background_column_indices=[9])

You can also create a folder and store text files containing the answers to all open-ended questions in the survey.

from pathlib import Path

remove = ' _?:/()'

def include(row):
    for string in ['free text', 'comment']:
        if string in row.question_type:
            return True
    if row.other == 'Y':
        return True
    return False

for qid, row in my_survey.question_list.iterrows():
    if include(row):
        question = row.question
        for char in remove:
            question = question.replace(char, ' ')
        question = question[:25]
        path = Path('../data/open_ended') / f'{qid} {question}.md'
        path.write_text(Question(sv, qid).write_open_ended(background_column_indices=[9]))

Create report as html

def add_table(question, question_text=None):
    """Add table summarising question"""

    if not question_text:
        question_text = question.question
    html = f"<div class='tableHeader'>{question_text}</div>\n"
    html += question.summary.to_html() + '\n'
    help_txt = question.metadata['help']
    if help_txt:
        html += f"<div class='tableCaption'>{help_txt}</div>"
    return html


html = """<head>
<title>Title</title>
<link rel="stylesheet" href="styles.css">
<meta charset="utf-8">
</head>
<body>
"""

my_question = Question(my_survey, 44)
html += add_table(my_question)

html += "</body>"

Inspect original data

If you want to inspect the original data for a specific question, for example because you want to process answers to an ‘other’ option, then you can use the question title (you can look up the title using my_survey.question_list.

title = 'G01Q07'
colnames = [c for c in df.columns if title in c]
df[colnames]

Project details

Release history Release notifications | RSS feed

This version

0.1.15

Aug 23, 2025

0.1.14

Feb 9, 2023

0.1.13

Jan 19, 2023

0.1.12

Apr 11, 2022

0.1.11

Apr 8, 2022

0.1.10

Apr 8, 2022

0.1.9

Feb 28, 2022

0.1.8

Oct 15, 2020

0.1.7

Sep 26, 2020

0.1.6

May 19, 2020

0.1.5

Feb 10, 2020

0.1.4

Dec 25, 2019

0.1.3

Apr 16, 2019

0.1.2

Mar 15, 2019

0.1.1

Feb 17, 2019

0.1.0

Jan 8, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

limepy-0.1.15.tar.gz (14.2 kB view details)

Uploaded Aug 23, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

limepy-0.1.15-py3-none-any.whl (11.9 kB view details)

Uploaded Aug 23, 2025 Python 3

File details

Details for the file limepy-0.1.15.tar.gz.

File metadata

Download URL: limepy-0.1.15.tar.gz
Upload date: Aug 23, 2025
Size: 14.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.13

File hashes

Hashes for limepy-0.1.15.tar.gz
Algorithm	Hash digest
SHA256	`b89da56bf6af1461a6c4004ff11fc4e65ff6c1c72a9f7a1d12130e329f4f8813`
MD5	`f9062d8f3c3e9536c5036189e577c1c7`
BLAKE2b-256	`e43a2ba95f92de823419f526f34b2e79a6c89076a8e6a40f5c066abb04054033`

See more details on using hashes here.

File details

Details for the file limepy-0.1.15-py3-none-any.whl.

File metadata

Download URL: limepy-0.1.15-py3-none-any.whl
Upload date: Aug 23, 2025
Size: 11.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.13

File hashes

Hashes for limepy-0.1.15-py3-none-any.whl
Algorithm	Hash digest
SHA256	`50a683805a0b82f605dd80cb192a16b2f9f574455b930c8fde1dc8975126a9d6`
MD5	`cb9ac900c7658489d8ba99eed9eb7e26`
BLAKE2b-256	`31606fc7833fbfc99b16aad5d88c009ca6d2d1c7986a67b32a2c4769bc076d21`

See more details on using hashes here.

limepy 0.1.15

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Installation

How is it different

Examples

Download survey data

Create Survey object

Get list of questions with metadata

Print results for individual respondent

Create a readable dataframe

Create a Question object

Summarise answers to a question

Compare groups

Write answers to an open-ended question

Create report as html

Inspect original data

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes