A package to simplify usage of the DeepCrawl GraphQL

These details have not been verified by PyPI

Project links

Homepage

Project description

DeepCrawl GraphQL Wrapper 1.0.0 documentation

Welcome to DeepCrawl GraphQL Wrapper’s documentation!¶

pip install deepcrawl_graphql

Authentication¶

To authenticate with the DeepCrawl API you have to first create a connection.

Connection¶

class deepcrawl_graphql.api.DeepCrawlConnection(user_key_id=None, secret=None, token=None)¶

DeepCrawlConnection class

Creates a connection instance used for sending GraphQL queries to DeepCrawl.

>>> from deepcrawl_graphql.api import DeepCrawlConnection

>>> DeepCrawlConnection("user_key_id", "secret")
DeepCrawlConnection instance

>>> DeepCrawlConnection(token="token")
DeepCrawlConnection instance

Parameters:

user_key_id (int or str) – user key, used together with secret for authentication.
secret (str) – secret key, used together with user_key_id for authentication.
token (str) – authentication token, if used ignores the user_key_id and secret

run_query(query, variables=None, var=None, **kwargs)¶ Runs a query.

>>> conn = DeepCrawlConnection("user_key_id", "secret")

There are 3 possible ways you can run a query

Using a query string. You can use the DeepCrawl explorer to construct the string https://graph-docs.deepcrawl.com/graphql/explorer

>>> query = 'query MyQuery {version}'
>>> conn.run_query(query)
{'version': '1.91.1-next.0-en'}

Using the gql package to construct a dynamic query.

>>> from gql.dsl import DSLQuery
>>> query = DSLQuery(conn.ds.Query.me.select(conn.ds.User.id, conn.ds.User.username))
>>> conn.run_query(query)
{
 'me': {
 'id': 'id',
 'username': 'email@example.com'
 }
}

>>> from gql.dsl import DSLVariableDefinitions
>>> from gql.dsl import DSLQuery
>>> var = DSLVariableDefinitions()
>>> query = conn.ds.Query.me.select(
 conn.ds.User.accounts.args(first=var.first, after=var.after)
 .select(
 conn.ds.AccountConnection.nodes.select(
 conn.ds.Account.id,
 conn.ds.Account.name,
 )
 )
 .select(
 conn.ds.AccountConnection.pageInfo.select(
 conn.ds.PageInfo.startCursor,
 conn.ds.PageInfo.endCursor,
 conn.ds.PageInfo.hasNextPage,
 conn.ds.PageInfo.hasPreviousPage,
 )
 )
 )
>>> conn.run_query(query, variables={"first": 1, "after": "MQ"}, var=var)
{
 "me": {
 "accounts": {
 "nodes": [
 {
 "id": "id",
 "name": "name"
 }
 ],
 "pageInfo": {
 "startCursor": "Mg",
 "endCursor": "Mg",
 "hasNextPage": false,
 "hasPreviousPage": true
 }
 }
 }
}
# For more information about constructing queries with dsl
# see https://gql.readthedocs.io/en/stable/advanced/dsl_module.html

Import a query from the deepcrawl_graphql package and use it’s prebuild queries.

>>> from deepcrawl_graphql.me.me import MeQuery
>>> me_query = MeQuery(conn)
>>> me_query.select_me()
>>> conn.run_query(me_query)
{
 'me': {
 'id': 'id',
 'username': 'email@example.com',
 'email': 'email@example.com',
 'firstName': 'FirstName',
 'lastName': 'LastName',
 'createdAt': '2019-10-27T17:11:17.000Z',
 'updatedAt': '2022-01-15T10:10:38.000Z',
 'jobTitle': None,
 'overallLimitLevelsMax': 1000,
 'overallLimitPagesMax': 10000000,
 'ssoClientId': None,
 'termsAgreed': True,
 'rawID': 'id',
 'permissions': []
 }
}

Parameters:

query (str or Query) – query object
variables (dict or None) – variables to use in the query. run_query(… {“first”: 1, “after”: “MQ”})
var (DSLVariableDefinitions) – gql DSLVariableDefinitions instance to be used in the query. Works tougether with variables
kwargs – variables to use in the query. If variables is not used and the user prefers to send variables as function arguments run_query(… first=1, after=”MQ”)

Pagination¶

There are some optional arguments which can be used while building the query: first, last, after, before

first - Number of records to fetch from start
last - Number of records to fetch from end
after - Fetch after cursor
before - Fetch before cursor

>>> conn = DeepCrawlConnection("user_key_id", "secret")

By default first 100 or less objects are retrieved

>>> account_query = AccountQuery(conn, "74910")
>>> account_query.select_projects()
>>> conn.run_query(account_query)

{
    "getAccount": {
        "projects": {
            "nodes": [
                {"id": "id-1", "name": "name-1", ...},
                {"id": "id-2", "name": "name-2", ...},
                {"id": "id-3", "name": "name-3", ...},
                {"id": "id-4", "name": "name-4", ...},
            ],
            "pageInfo": {
                "startCursor": "MQ",
                "endCursor": "NA",
                "hasNextPage": False,
                "hasPreviousPage": False
            }
        }
    }
}

By using the first argument you can choose how many object to retrieve per page.

>>> conn.run_query(account_query, variables={"first": 2})
OR
>>> conn.run_query(account_query, first=2)

{
    "me": {
        "accounts": {
            "nodes": [
                {"id": "id-1", "name": "name-1", ...},
                {"id": "id-2", "name": "name-2", ...},
            ],
            "pageInfo": {
                "startCursor": "MQ",
                "endCursor": "Mg",
                "hasNextPage": True,
                "hasPreviousPage": False
            }
        }
    }
}

By using the first argument combined with the after argument you can choose how many object to retrieve per page after the selected cursor.

>>> conn.run_query(account_query, variables={"first": 2, "after": "Mg"})
OR
>>> conn.run_query(account_query, first=2, after="Mg")

{
    "me": {
        "accounts": {
            "nodes": [
                {"id": "id-3", "name": "name-3", ...},
                {"id": "id-4", "name": "name-4", ...},
            ],
            "pageInfo": {
                "startCursor": "Mw",
                "endCursor": "NA",
                "hasNextPage": False,
                "hasPreviousPage": True
            }
        }
    }
}

By using the first argument combined with the before argument you can choose how many object to retrieve per page before the selected cursor.

>>> conn.run_query(account_query, variables={"first": 2, "before": "Mg"})
OR
>>> conn.run_query(account_query, first=2, before="Mg")

{
    "me": {
        "accounts": {
            "nodes": [
                {"id": "id-1", "name": "name-1", ...},
                {"id": "id-2", "name": "name-2", ...},
            ],
            "pageInfo": {
                "startCursor": "MQ",
                "endCursor": "Mg",
                "hasNextPage": True,
                "hasPreviousPage": False
            }
        }
    }
}

Me¶

MeQuery¶

class deepcrawl_graphql.me.me.MeQuery(conn: DeepCrawlConnection)¶

MeQuery class

Creates a me query instance. “Me” being the authenticated user. The instance will be passed to the run_query method in order to execute the query.

>>> from deepcrawl_graphql.me.me import MeQuery

>>> me_query = MeQuery(conn)
>>> me_query.select_me()
>>> me_query.select_accounts()
>>> me = conn.run_query(me_query)

Parameters: conn (DeepCrawlConnection) – Connection.

select_me()¶ Selects user fields.

select_accounts()¶ Selects users accounts.

Accounts¶

AccountQuery¶

class deepcrawl_graphql.accounts.account.AccountQuery(conn: DeepCrawlConnection, account_id)¶

AccountQuery class

Creates an accout query instance. The instance will be passed to the run_query method in order to execute the query.

>>> from deepcrawl_graphql.accounts.account import AccountQuery

>>> account_query = AccountQuery(conn, "id")
>>> account_query.select_account()
>>> account_query.select_settings()
>>> account_query.select_callback_headers()
>>> account_query.select_feature_flags()
>>> account_query.select_locations()
>>> account_query.select_package()
>>> account_query.select_subscription()
>>> account_query.select_projects()
>>> account_query.select_project("project_id")
>>> account = conn.run_query(account_query)

Parameters:

conn (DeepCrawlConnection) – Connection.
account_id (int or str) – account id.

select_account()¶ Selects account fields.

select_settings()¶ Selects account accountSettings.

select_callback_headers()¶ Selects account apiCallbackHeaders.

select_feature_flags()¶ Selects account featureFlags.

select_locations()¶ Selects account locations.

select_package()¶ Selects account primaryAccountPackage.

select_subscription(include_addons=False, integration_type=None)¶ Selects account subscription.

Parameters:

include_addons (bool) – If true includes the addons available.
integration_type (str) – Selects an addon by integration type

select_projects()¶ Selects account projects.

select_project(project_id)¶ Selects account project by id.

Parameters: project_id (bool) – Project id.

Projects¶

ProjectQuery¶

class deepcrawl_graphql.projects.project.ProjectQuery(conn: DeepCrawlConnection, project_id)¶

ProjectQuery class

Creates a project query instance. The instance will be passed to the run_query method in order to execute the query.

>>> from deepcrawl_graphql.projects.project import ProjectQuery

>>> project_query = ProjectQuery(conn, "project_id")
>>> project_query.select_project()
>>> project_query.select_sitemaps()
>>> project_query.select_advanced_crawl_rate()
>>> project_query.select_majestic_configuration()
>>> project_query.select_location()
>>> project_query.select_google_search_configuration()
>>> project_query.select_custom_extraction_settings()
>>> project_query.select_account()
>>> project_query.select_crawls()
>>> project = conn.run_query(project_query)

Parameters:

conn (DeepCrawlConnection) – Connection.
project_id (int or str) – project id.

select_project()¶ Selects project fields.

select_sitemaps()¶ Selects project sitemaps.

select_advanced_crawl_rate()¶ Selects project maximumCrawlRateAdvanced.

select_majestic_configuration()¶ Selects project majesticConfiguration.

select_location()¶ Selects project location.

select_last_finished_crawl()¶ Selects project lastFinishedCrawl.

Not implemented yet.

select_google_search_configuration()¶ Selects project googleSearchConsoleConfiguration.

select_google_analytics_project_view()¶ Selects project googleAnalyticsProjectView.

Not implemented yet.

select_custom_extraction_settings()¶ Selects project customExtractions.

select_account()¶ Selects project account.

select_crawls()¶ Selects project crawls.

Crawls¶

CrawlQuery¶

class deepcrawl_graphql.crawls.crawl.CrawlQuery(conn: DeepCrawlConnection, crawl_id)¶

CrawlQuery class

Creates a crawl query instance. The instance will be passed to the run_query method in order to execute the query.

>>> from deepcrawl_graphql.crawls.crawl import CrawlQuery

>>> crawl_query = CrawlQuery(conn, "crawl_id")
>>> crawl_query.select_crawl()
>>> crawl_query.select_parquet_files("datasource_name")
>>> crawl_query.select_compared_to()
>>> crawl = conn.run_query(crawl_query)

Parameters:

conn (DeepCrawlConnection) – Connection.
crawl_id (int or str) – crawl id.

select_crawl()¶ Selects crawl fields.

select_parquet_files(datasource_name)¶ Selects crawl parquetFiles.

Parameters: datasource_name (str) – Datasource name.

select_crawl_type_counts(crawl_types, segment_id=None)¶ Selects crawl fields.

Not implemented yet.

Parameters:

crawl_types (str) – Crawl type.
segment_id (int or str) – Segment id.

select_crawl_settings()¶ Selects crawl crawlSetting.

select_compared_to()¶ Selects crawl comparedTo.

select_reports()¶ Selects crawl reports.

Reports¶

ReportQuery¶

class deepcrawl_graphql.reports.report.ReportQuery(conn: DeepCrawlConnection, crawl_id, report_tamplate_code, report_type_code, segment_id=None)¶

ReportQuery class

Creates a report query instance. The instance will be passed to the run_query method in order to execute the query.

>>> from deepcrawl_graphql.reports.report import ReportQuery

>>> report_query = ReportQuery(conn, "crawl_id", "report_tamplate_code", "report_type_code")
>>> report_query.select_report()
>>> report_query.select_datasource()
>>> report_query.select_type()
>>> report_query.select_trend()
>>> report_query.select_segment()
>>> report_query.select_report_template()
>>> conn.run_query(report_query)

Parameters:

conn (DeepCrawlConnection) – Connection.
crawl_id (int or str) – crawl id.
report_tamplate_code (str) – report template code.
report_type_code (str) – report type code.
segment_id (int or str) – segment id.

select_report()¶ Selects report fields.

select_raw_trends()¶ Selects report rawTrends.

select_datasource()¶ Selects report datasources.

select_type()¶ Selects report type.

select_trend()¶ Selects report trend.

select_segment()¶ Selects report segment.

select_report_template()¶ Selects report reportTemplate.

Indices and tables¶

Index
Module Index
Search Page

DeepCrawl GraphQL Wrapper

Navigation

Pagination
MeQuery
AccountQuery
ProjectQuery
CrawlQuery
ReportQuery

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.2.13

Feb 7, 2024

0.2.12

Jan 16, 2024

0.2.11

Jan 16, 2024

0.2.10

Dec 7, 2023

0.2.9

Dec 7, 2023

0.2.8

Dec 6, 2023

0.2.7

Dec 5, 2023

0.2.6

Sep 13, 2023

0.2.5

Aug 27, 2023

0.2.4

Aug 15, 2023

0.2.3

Oct 30, 2022

0.2.2

Sep 17, 2022

0.2.1

Aug 11, 2022

0.2.0

Jul 18, 2022

This version

0.1.4

Jul 10, 2022

0.1.3

Jul 9, 2022

0.1.2

Jun 28, 2022

0.1.1

Jun 28, 2022

0.1.0

Jun 27, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deepcrawl_graphql-0.1.4.tar.gz (56.9 kB view details)

Uploaded Jul 10, 2022 Source

File details

Details for the file deepcrawl_graphql-0.1.4.tar.gz.

File metadata

Download URL: deepcrawl_graphql-0.1.4.tar.gz
Upload date: Jul 10, 2022
Size: 56.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.4.2 importlib_metadata/4.11.4 pkginfo/1.8.3 requests/2.28.0 requests-toolbelt/0.9.1 tqdm/4.64.0 CPython/3.8.13

File hashes

Hashes for deepcrawl_graphql-0.1.4.tar.gz
Algorithm	Hash digest
SHA256	`0f80a76c9d2d7194922acb203672ddfdef611685a15288db3509b0000e45ca1c`
MD5	`b072844063e8a41c48c24eac05125e45`
BLAKE2b-256	`9f38be534d8dc90f6dd02def86cdc5a4a7040dfdede950379f9c057b3ff0be5c`