Skip to main content

Sketch Grammar Explorer (Sketch Engine API wrapper)

Project description

Sketch Grammar Explorer

PyPI Latest Release PyPI - Python Versions Package Status License DOI

Introduction

Sketch Grammar Explorer is an API wrapper for Sketch Engine, a corpus management software useful for linguistic research. The goal is to build a flexible scaffold for any kind of programmatic work with Sketch Engine and NoSketch Engine.

UPDATE

SGEX 0.7.0+ is another redesign of the package meant to facilitate enhancements. The workflow is improved and it's streamlined for adapting to SkE's updated API schema. Old methods are deprecated and unavailable in new releases.

Installation

Clone SGEX or install it with pip install sgex (main dependencies are pandas pyyaml aiohttp aiofiles).

Get a Sketch Engine API key. Be sure to reference SkE's documentation and schema:

wget https://www.sketchengine.eu/apidoc/openapi.yaml -O .openapi.yaml

Getting started

A quick intro on the API (examples use a local NoSketch Engine server).

Most things are identical for SkE's main server, apart from using credentials and more call types being available. SGEX currently uses the Bonito API, with URLs ending in /bonito/run.cgi, not newer paths like /search/corp_info.

Package modules

  • job: the primary module - makes requests and manipulates data
  • call: classes and methods for API call types
  • query: functions to generate/manipulate CQL queries
  • util: utility functions

The Job class

Calls are made with the job module, which can also be run as a script. The Job class has a few options:

from sgex.job import Job

j = Job(
	# define API calls
	infile: str | list | None = None,
	params: str | list | None = None,
	# set server info
	server: str = "local",
	default_servers: dict = default_servers,
	# supply credentials
	api_key: str | None = None,
	username: str | None = None,
	# manage caching
	cache_dir: str = "data",
	clear_cache: bool = False,
	# run asynchronous requests
	thread: bool = False,
	# control request throttling
	wait_dict: dict = wait_dict,
	# make a dry run
	dry_run: bool = False,
	# change verbosity
	verbose: bool = False,
	)

j.run()

See the code repository for full documentation.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sgex-0.7.1.tar.gz (31.5 kB view details)

Uploaded Source

Built Distribution

sgex-0.7.1-py3-none-any.whl (19.3 kB view details)

Uploaded Python 3

File details

Details for the file sgex-0.7.1.tar.gz.

File metadata

  • Download URL: sgex-0.7.1.tar.gz
  • Upload date:
  • Size: 31.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.5

File hashes

Hashes for sgex-0.7.1.tar.gz
Algorithm Hash digest
SHA256 5df84d552dcd6e9d2cec89300cc79295201565da7e084c4d593a228541f5de06
MD5 12a220520f8ed0abea061ddcfc081bcd
BLAKE2b-256 e4efabfcda279d2650f18df55469e7164c1ae00ac494aaec6ef2eae4dab059ff

See more details on using hashes here.

File details

Details for the file sgex-0.7.1-py3-none-any.whl.

File metadata

  • Download URL: sgex-0.7.1-py3-none-any.whl
  • Upload date:
  • Size: 19.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.5

File hashes

Hashes for sgex-0.7.1-py3-none-any.whl
Algorithm Hash digest
SHA256 72222dca6743154347c90d2feb507b828ab26c74d00780a41759a2241a2a9c8b
MD5 09943649dd74701e84f434c0004c04af
BLAKE2b-256 4c77cce88d24ff306623f3b1f7ba7e77ef697b128e408b58723090522c0980ed

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page