Skip to main content

Unofficial Octoparse API client.

Project description

Octoparse

Unofficial Octoparse API client in python

Now with support for Advanced API and China as well

Installation:

use pip to install:

pip install octoparse

Example usage:

from octoparse import Octoparse

# initialize api client
# it will try to log in & ask for credentials if required
octo = Octoparse()

# if using advanced API:
octo = Octoparse(advanced_api=True)

# if using from China:
octo = Octoparse(china=True)

# List all task groups
groups = octo.list_all_task_groups()

# List all tasks in a group
tasks = octo.list_all_tasks_in_group(group_id='xxxx-ssdsd-1212')

# Export the not exported data
data = octo.get_not_exported_data(task_id='abcd-1234-djfsd-dfdf', size=100)

# Update data status
resp = octo.update_data_status(task_id='abcd-1234-djfsd-dfdf')

# get data for a task with task id: 'abcd-1234-djfsd-dfdf'
data = octo.get_task_data(task_id='abcd-1234-djfsd-dfdf')

# get task data as a pandas.DataFrame for a task with task id: 'abcd-1234-djfsd-dfdf'
df = octo.get_task_data_df(task_id='abcd-1234-djfsd-dfdf')

# clear data for a task with task id: 'abcd-1234-djfsd-dfdf'
octo.clear_task_data(task_id='abcd-1234-djfsd-dfdf')

Following are supported for Advanced API

# Get Tasks' status
task_list = ['abcd-1234-djfsd-dfdf', 'ab23-5677-djfsd-dfdf']
resp = octo.get_task_status(task_list)

# Get Task's parameter
resp = octo.get_task_param(task_id='abcd-1234-djfsd-dfdf', name='loopAction1.Url')

# Update Task's parameter
resp = octo.update_task_param(task_id='abcd-1234-djfsd-dfdf', name='loopAction1.Url', value='http://xyz.abc')

# Add new URLs/text to an existing loop
resp = octo.add_url_text_to_loop(task_id='abcd-1234-djfsd-dfdf', name='loopAction1.Url', value='http://xyz.abc')

# Start running task
resp = octo.start_task(task_id='abcd-1234-djfsd-dfdf')

# Stop running task
resp = octo.stop_task(task_id='abcd-1234-djfsd-dfdf')

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

octoparse-1.3.0.tar.gz (5.9 kB view details)

Uploaded Source

Built Distribution

octoparse-1.3.0-py3-none-any.whl (10.2 kB view details)

Uploaded Python 3

File details

Details for the file octoparse-1.3.0.tar.gz.

File metadata

  • Download URL: octoparse-1.3.0.tar.gz
  • Upload date:
  • Size: 5.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.23.0 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.51.0 CPython/3.8.3

File hashes

Hashes for octoparse-1.3.0.tar.gz
Algorithm Hash digest
SHA256 ce20c05f05c5ce1508ca3b230fbb5b7f5c1ca456811d2ffafe9e4100137704a5
MD5 7388a50b9bbfa07a22ad3d3ddc8caf29
BLAKE2b-256 eda1650b3698f6a511b447ff82d87c05001877cbf230b76d1e6c3264b80a66f5

See more details on using hashes here.

File details

Details for the file octoparse-1.3.0-py3-none-any.whl.

File metadata

  • Download URL: octoparse-1.3.0-py3-none-any.whl
  • Upload date:
  • Size: 10.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.23.0 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.51.0 CPython/3.8.3

File hashes

Hashes for octoparse-1.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c77d6eed3e686ea9f19b4dc9342c98c69472515fc60b3ce695024f0dea6b3b8b
MD5 57c1d3242e8b419c80b655047a53ac3e
BLAKE2b-256 5308de19b3f94a5692f43aab6e25965ba69d98a5a8abb639b4efc3061bf255fb

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page