Skip to main content

Utilities for text processing and statistical analysis from Pew Research Center

Project description

pewanalytics

pewanalytics is a Python package that provides text processing and statistics utilities for computational social science researchers.

Installation

To install, you can use pip:

pip install git+https://github.com/pewresearch/pewanalytics#egg=pewanalytics

Or you can install from source:

git clone https://github.com/pewresearch/pewanalytics.git
cd pewanalytics
python setup.py install

Installation Troubleshooting

Using 64-bit Python

Some of our libraries require the use of 64-bit Python. If you encounter errors during installation that are related to missing libraries, you may be using 32-bit Python. We recommend that you uninstall this version and switch to a 64-bit version instead. On Windows, these will be marked with x86-64; you can find the latest 64-bit versions of Python here.

Installing ssdeep

ssdeep is an optional dependency that can be used by the get_hash function in Pewtils. Installation instructions for various Linux distributions can be found in the library's documentation. The ssdeep Python library is not currently compatible with Windows. Installing ssdeep on Mac OS may involve a few additional steps, detailed below:

  1. Install Homebrew
  2. Install xcode
    xcode-select --install
    
  3. Install system dependencies
    brew install pkg-config libffi libtool automake
    ln -s /usr/local/bin/glibtoolize /usr/local/bin/libtoolize
    
  4. Install ssdeep with an additional flag to build the required libraries
    BUILD_LIB=1 pip install ssdeep
    
  5. If step 4 fails, you may need to redirect your system to the new libraries by setting the following flags:
    export LIBTOOL=`which glibtool`
    export LIBTOOLIZE=`which glibtoolize`
    
    Do this and try step 4 again.
  6. Now you should be able to run the main installation process detailed above.

Documentation

Please refer to the official documentation for information on how to use this package.

Use Policy

In addition to the license, Users must abide by the following conditions:

  • User may not use the Center's logo
  • User may not use the Center's name in any advertising, marketing or promotional materials.
  • User may not use the licensed materials in any manner that implies, suggests, or could otherwise be perceived as attributing a particular policy or lobbying objective or opinion to the Center, or as a Center endorsement of a cause, candidate, issue, party, product, business, organization, religion or viewpoint.

Issues and Pull Requests

This code is provided as-is for use in your own projects. You are free to submit issues and pull requests with any questions or suggestions you may have. We will do our best to respond within a 30-day time period.

Recommended Package Citation

Pew Research Center, 2020, "pewanalytics" Available at: github.com/pewresearch/pewanalytics

Acknowledgements

The following authors contributed to this repository:

  • Patrick van Kessel
  • Regina Widjaya
  • Skye Toor
  • Emma Remy
  • Onyi Lam
  • Brian Broderick
  • Galen Stocking
  • Dennis Quinn

About Pew Research Center

Pew Research Center is a nonpartisan fact tank that informs the public about the issues, attitudes and trends shaping the world. It does not take policy positions. The Center conducts public opinion polling, demographic research, content analysis and other data-driven social science research. It studies U.S. politics and policy; journalism and media; internet, science and technology; religion and public life; Hispanic trends; global attitudes and trends; and U.S. social and demographic trends. All of the Center's reports are available at www.pewresearch.org. Pew Research Center is a subsidiary of The Pew Charitable Trusts, its primary funder.

Contact

For all inquiries, please email info@pewresearch.org. Please be sure to specify your deadline, and we will get back to you as soon as possible. This email account is monitored regularly by Pew Research Center Communications staff.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pewanalytics-1.1.1.tar.gz (53.9 kB view details)

Uploaded Source

Built Distribution

pewanalytics-1.1.1-py3-none-any.whl (56.1 kB view details)

Uploaded Python 3

File details

Details for the file pewanalytics-1.1.1.tar.gz.

File metadata

  • Download URL: pewanalytics-1.1.1.tar.gz
  • Upload date:
  • Size: 53.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/0.0.0 importlib_metadata/4.8.2 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.7

File hashes

Hashes for pewanalytics-1.1.1.tar.gz
Algorithm Hash digest
SHA256 b3ce63a3f00a64f7f83543531aece783da4ca887c47d75e8ad8c2b9065a09d70
MD5 adfc280170f90292061befdb0b776127
BLAKE2b-256 ed82ef421599683b135467fe4001783c45564c60218109810370e5744450e605

See more details on using hashes here.

File details

Details for the file pewanalytics-1.1.1-py3-none-any.whl.

File metadata

  • Download URL: pewanalytics-1.1.1-py3-none-any.whl
  • Upload date:
  • Size: 56.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/0.0.0 importlib_metadata/4.8.2 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.7

File hashes

Hashes for pewanalytics-1.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 12f9f289449101a72c9f66990fe73186343bb1e7bf343ff834ee459fcde697f8
MD5 5ad6f25ce67d115c992a97517341a065
BLAKE2b-256 4acc6e1de56707b14e3ab6ebdc1d32e983068a3e361a77321517e5d7f0761894

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page