A package for analyzing survey data from Deliberative Polling experiments.

Project description

This package is for analyzing survey data from Deliberative Polling experiments. Although designed for Deliberative Polling, this package can be used to analyze any experimental survey data.

The package is designed with a single, specialized function called outputs. This function accepts as input files exclusively in the IBM SPSS Statistics .SAV format. Upon execution, it generates output files in both .xlsx and .docx formats. These output files contain statistical comparisons of all ordinal and nominal variables across all designated treatment groups, time intervals, and statistical weights.

Installation

To install SPSS, go to Software at Stanford if you are a Stanford affiliate. Othwerwise, go to IBM SPSS Software.

To install Python, go to Download Python.

To install DeliberativePolling, run the following in a terminal:

pip install DeliberativePolling

In SPSS

To import data into SPSS, open SPSS and navigate to File and Import Data.

Once the data has been imported into SPSS, you need to provide metadata about the variables in the tab Variable View.

Measures

In the Measure column of Variable View, variables can be classified as Nominal, Ordinal, or Scale.

Nominal

Nominal variables are categorical variables that lack a natural order. For instance, the variable Employment in the Sample.SAV file includes the categories Employed, Unemployed, Student, and Other, which don't follow a specific sequence. While there are exceptions, such as Education Level, which do have an order, it's generally advisable (but not mandatory) to categorize variables containing demographic data as Nominal.

Ordinal

Ordinal variables are categorical variables that have a well-defined order. For example, the variable Question1 in the Sample.SAV file. This variable uses a Likert scale that ranges from 0 to 10, representing a progression from Poorly to Well in response to the question "How well does democracy function?" Typically, it's recommended (but not obligatory) to classify variables with responses that change between time intervals as Ordinal.

Some statisticians indicate non-response to survey questions using high numeric codes like 77, 98, or 99. It's crucial to remove these high numeric codes from ordinal variables before analysis. The outputs function calculates the average of ordinal values, assuming a consistent scale like 0-10, 1-5, or 1-3. Including out-of-scale high values like 99 can significantly distort the calculated mean. To avoid this, replace these numeric codes with blank cells; blank cells will be counted as DK/NA (Don't Know/Not Applicable) and will not affect mean calculations.

Scale

Any variables that don't fit into the Nominal or Ordinal categories should be classified as Scale variables. These can either be continuous or discrete. All variables related to weight should be categorized as Scale.

Essential Variables

In order for outputs to identify the different subjects, experimental groups, and time intervals in the data, the SPSS file must contain three variables: ID, Time, and Group.

ID

The ID variable helps track individual participants in the study. It's like a name tag that stays the same for each person throughout the experiment. This way, you can see how a person's answers change over time. The ID can be a number, an email address, or any other unique identifier.

Group

The Group variable tells you which part of the experiment a participant is in—either the Treatment group that receives the intervention, or the Control group that doesn't. This helps you compare the effects of the treatment.

Time

The Time variable shows when a participant gave their answers. Labels like Pre-Deliberation or T1 are usually used for answers given before the treatment, and Post-Deliberation or T2 for answers given after. This helps you see how responses change over the course of the experiment.

Optional Variables

Weights

By default, the outputs function generates unweighted tables that compare survey data between all experimental groups and time intervals; however, you can introduce weighting by including columns with the word weight in the header, like Weight1 in Sample.SAV. These weight variables must be numeric with their Measure set to Scale.

Ignored

To keep variables in the SPSS file that you don't want included in the outputs function's analysis but might use later, set their Measure to Scale; variables with this setting won't be part of the analysis unless they are designated as weight variables.

Labels

In SPSS, labels help clarify the meaning of variable names and values.

Column Labels

Variable names can't have spaces or punctuation. Descriptive Column Labels can be set in Variable View under the column Label to provide more information about the variables.

Nominal Variables: For nominal variables use concise labels. For example, the variable Education in Sample.SAV has the column label Education Level. Keep these labels short because they will appear in file names like Tables - Ordinal Variables - Treatment at T1 v. T2 (Unweighted) - Education Level.

Ordinal Variables: For ordinal variables you can use fuller more descriptive labels. For example, the variable Question1 in Sample.SAV has the column label How well does democracy function?. These ordinal column labels do not appear in file names, only within cells in the outputted files so length is less of an issue.

Value Labels

When working with SPSS, it's essential to set the Type of both Ordinal and Nominal variables to Numeric in the Variable View. Since the data will be numeric, you'll use value labels to provide meaningful context to these coded numbers.

Numeric Codes: For ordinal variables like Age in Sample.SAV, you'll need to specify what each numeric code (1, 2, 3, and 4) represents. Use the Values column in Variable View to associate each number with a label, such as 1 for 18-30 and 2 for 30-50.

Shared Labels: Some variables might have several numeric codes that mean the same thing. For instance, in the ordinal variable Question1, the codes 0 through 4 are all labeled as Poorly, while 6 through 10 are labeled as Well.

Ensure that all values have labels, otherwise the outputs function will return an error message indicating which values are unlabeled.

Once you've included all essential variables and assigned column and value labels to all nominal and ordinal variables, you can run the outputs function on the SPSS file. If any metadata is missing, the outputs function will return an error and specify what data is lacking.

Columns like Width, Decimals, Missing, Columns, Align, and Role in Variable View can usually be ignored.

In Python

To execute the outputs function, open a terminal with the directory containing the .SAV file. Then, run the following commands:

Python3
from DeliberativePolling import outputs
outputs("your_file.SAV")

Outputs

After running the function, a new folder named Outputs will be created in the directory. This folder will contain all the generated tables and reports in .xlsx format. If these tables and reports are reasonably sized (under 10,000 cells), they will also be exported in .docx format.

Project details

Release history Release notifications | RSS feed

1.4.2

Nov 19, 2023

1.4.1

Nov 19, 2023

1.4.0

Nov 17, 2023

1.3.8

Nov 17, 2023

1.3.7

Oct 24, 2023

1.3.6

Oct 24, 2023

1.3.5

Oct 11, 2023

1.3.4

Oct 11, 2023

1.3.3

Oct 11, 2023

1.3.2

Oct 7, 2023

1.3.1

Oct 7, 2023

1.3.0

Oct 7, 2023

1.2.9

Oct 7, 2023

1.2.8

Oct 7, 2023

1.2.7

Oct 7, 2023

1.2.6

Oct 5, 2023

1.2.5

Oct 5, 2023

1.2.4

Oct 5, 2023

1.2.3

Oct 5, 2023

1.2.2

Oct 5, 2023

1.2.1

Oct 4, 2023

1.2.0

Oct 4, 2023

1.1.8

Oct 4, 2023

1.1.7

Oct 4, 2023

1.1.6

Oct 4, 2023

1.1.5

Oct 4, 2023

1.1.4

Oct 4, 2023

1.1.3

Oct 4, 2023

1.1.2 yanked

Oct 3, 2023

1.1.1 yanked

Oct 2, 2023

1.1.0 yanked

Sep 27, 2023

1.0.9 yanked

Sep 27, 2023

1.0.8 yanked

Sep 27, 2023

1.0.7 yanked

Sep 27, 2023

1.0.6 yanked

Sep 27, 2023

1.0.5 yanked

Sep 27, 2023

1.0.4 yanked

Sep 27, 2023

This version

1.0.3 yanked

Sep 27, 2023

1.0.2

Sep 27, 2023

1.0.1

Sep 27, 2023

1.0.0

Sep 27, 2023

0.2.2

Sep 27, 2023

0.2.1

Sep 27, 2023

0.2.0

Sep 26, 2023

0.1.9

Sep 25, 2023

0.1.8

Sep 23, 2023

0.1.7

Sep 23, 2023

0.1.6

Sep 23, 2023

0.1.5

Sep 23, 2023

0.1.4

Sep 23, 2023

0.1.3 yanked

Sep 23, 2023

0.1.2 yanked

Sep 23, 2023

0.1.0 yanked

Sep 21, 2023

0.0.8 yanked

Sep 21, 2023

0.0.7 yanked

Sep 21, 2023

0.0.6 yanked

Sep 21, 2023

0.0.5 yanked

Sep 21, 2023

0.0.4 yanked

Sep 21, 2023

0.0.3 yanked

Sep 21, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

DeliberativePolling-1.0.3.tar.gz (12.0 kB view details)

Uploaded Sep 27, 2023 Source

Built Distribution

DeliberativePolling-1.0.3-py3-none-any.whl (18.5 kB view details)

Uploaded Sep 27, 2023 Python 3

File details

Details for the file DeliberativePolling-1.0.3.tar.gz.

File metadata

Download URL: DeliberativePolling-1.0.3.tar.gz
Upload date: Sep 27, 2023
Size: 12.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.11.4

File hashes

Hashes for DeliberativePolling-1.0.3.tar.gz
Algorithm	Hash digest
SHA256	`0cfd86a7e599179d24ae021871d95d667e4b835e2cad10b7017cad299d1b6d3d`
MD5	`3e5530464ba3a42fd6f9140076b113c6`
BLAKE2b-256	`cba1b3a0aa1ae71daa5d9b98d25d8717d79412be077de70fe29b73c52f234066`

See more details on using hashes here.

File details

Details for the file DeliberativePolling-1.0.3-py3-none-any.whl.

File metadata

Download URL: DeliberativePolling-1.0.3-py3-none-any.whl
Upload date: Sep 27, 2023
Size: 18.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.11.4

File hashes

Hashes for DeliberativePolling-1.0.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5902de63429246a0d36169b98094ac959859540db2384e004196752299894596`
MD5	`a6a8e90f63bb07085d0847d5756c6250`
BLAKE2b-256	`1c9f6b9466febc55f567f5068d9bc6e881680600668648943831341cdbf29c90`