bulk-support

Python interface to the Salesforce.com Bulk API.

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 5 - Production/Stable
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Natural Language
- English
Programming Language

Project description

Salesforce Bulk

Python client library for accessing the asynchronous Salesforce.com Bulk API.

Installation

pip install bulk-support

Authentication

To access the Bulk API you need to authenticate a user into Salesforce. The easiest way to do this is just to supply username, password and security_token. This library will use the simple-salesforce package to handle password based authentication.

from bulk_support import SalesforceBulk

bulk = SalesforceBulk(username=username, password=password, security_token=security_token)
...

Alternatively if you run have access to a session ID and instance_url you can use those directly:

from urlparse import urlparse
from bulk_support import SalesforceBulk

bulk = SalesforceBulk(sessionId=sessionId, host=urlparse(instance_url).hostname)
...

Operations

The basic sequence for driving the Bulk API is:

Create a new job
Add one or more batches to the job
Close the job
Wait for each batch to finish

Bulk Query

bulk.create_query_job(object_name, contentType='JSON')

Using API v45.0 or higher, you can also use the queryAll operation:

bulk.create_queryall_job(object_name, contentType='JSON')

Example

import json
from bulk_support.util import IteratorBytesIO

job = bulk.create_query_job("Contact", contentType='JSON')
batch = bulk.query(job, "select Id,LastName from Contact")
bulk.close_job(job)
while not bulk.is_batch_done(batch):
    sleep(10)

for result in bulk.get_all_results_for_query_batch(batch):
    result = json.load(IteratorBytesIO(result))
    for row in result:
        print row # dictionary rows

Same example but for CSV:

import unicodecsv

job = bulk.create_query_job("Contact", contentType='CSV')
batch = bulk.query(job, "select Id,LastName from Contact")
bulk.close_job(job)
while not bulk.is_batch_done(batch):
    sleep(10)

for result in bulk.get_all_results_for_query_batch(batch):
    reader = unicodecsv.DictReader(result, encoding='utf-8')
    for row in reader:
        print(row) # dictionary rows

Note that while CSV is the default for historical reasons, JSON should be prefered since CSV has some drawbacks including its handling of NULL vs empty string.

PK Chunk Header

If you are querying a large number of records you probably want to turn on PK Chunking:

bulk.create_query_job(object_name, contentType='CSV', pk_chunking=True)

That will use the default setting for chunk size. You can use a different chunk size by providing a number of records per chunk:

bulk.create_query_job(object_name, contentType='CSV', pk_chunking=100000)

Additionally if you want to do something more sophisticated you can provide a header value:

bulk.create_query_job(object_name, contentType='CSV', pk_chunking='chunkSize=50000; startRow=00130000000xEftMGH')

Bulk Insert, Update, Delete

All Bulk upload operations work the same. You set the operation when you create the job. Then you submit one or more documents that specify records with columns to insert/update/delete. When deleting you should only submit the Id for each record.

For efficiency you should use the post_batch method to post each batch of data. (Note that a batch can have a maximum 10,000 records and be 1GB in size.) You pass a generator or iterator into this function and it will stream data via POST to Salesforce. For help sending CSV formatted data you can use the salesforce_bulk.CsvDictsAdapter class. It takes an iterator returning dictionaries and returns an iterator which produces CSV data.

Full example:

from bulk_support import CsvDictsAdapter

job = bulk.create_insert_job("Account", contentType='CSV')
accounts = [dict(Name="Account%d" % idx) for idx in xrange(5)]
csv_iter = CsvDictsAdapter(iter(accounts))
batch = bulk.post_batch(job, csv_iter)
bulk.wait_for_batch(job, batch)
bulk.close_job(job)
print("Done. Accounts uploaded.")

Concurrency mode

When creating the job, pass concurrency='Serial' or concurrency='Parallel' to set the concurrency mode for the job.

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 5 - Production/Stable
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Natural Language
- English
Programming Language

Release history Release notifications | RSS feed

This version

0.3

May 30, 2019

0.2

May 30, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bulk_support-0.3.tar.gz (11.9 kB view details)

Uploaded May 30, 2019 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

bulk_support-0.3-py2.py3-none-any.whl (10.3 kB view details)

Uploaded May 30, 2019 Python 2Python 3

File details

Details for the file bulk_support-0.3.tar.gz.

File metadata

Download URL: bulk_support-0.3.tar.gz
Upload date: May 30, 2019
Size: 11.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.19.1 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.7.1

File hashes

Hashes for bulk_support-0.3.tar.gz
Algorithm	Hash digest
SHA256	`8c8cf2d25445a830987d90549fc7ffe148b4f676c43af7e71e62ff76a559ec89`
MD5	`ae6fca0e2fa9837b2edd31c8119d625f`
BLAKE2b-256	`26dd6f12a97a4a03abe53b0cc42b7028d7a6f5e318c44347d8e10b7de0e2d9da`

See more details on using hashes here.

File details

Details for the file bulk_support-0.3-py2.py3-none-any.whl.

File metadata

Download URL: bulk_support-0.3-py2.py3-none-any.whl
Upload date: May 30, 2019
Size: 10.3 kB
Tags: Python 2, Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.19.1 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.7.1

File hashes

Hashes for bulk_support-0.3-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`f6c8d37f63779ce994c027ace6c2fbe71c1e3f9dffc3a22a6f97f3ecf08792ed`
MD5	`cda65d33cd72e3cfd8beaffc9ef2ebb7`
BLAKE2b-256	`a29a17c020eba779e69273af914a7e12b9c19e01af68d3e7022c3758f88d856c`

See more details on using hashes here.

bulk-support 0.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Salesforce Bulk

Installation

Authentication

Operations

Bulk Query

PK Chunk Header

Bulk Insert, Update, Delete

Concurrency mode

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes