A framework for writing Airbyte Connectors.
Project description
Connector Development Kit (Python)
The Airbyte Python CDK is a framework for rapidly developing production-grade Airbyte connectors.The CDK currently offers helpers specific for creating Airbyte source connectors for:
- HTTP APIs (REST APIs, GraphQL, etc..)
- Generic Python sources (anything not covered by the above)
The CDK provides an improved developer experience by providing basic implementation structure and abstracting away low-level glue boilerplate.
This document is a general introduction to the CDK. Readers should have basic familiarity with the Airbyte Specification before proceeding.
Setup
Prerequisites
Poetry
Before you can start working on this project, you will need to have Poetry installed on your system. Please follow the instructions below to install Poetry:
- Open your terminal or command prompt.
- Install Poetry using the recommended installation method:
curl -sSL https://install.python-poetry.org | POETRY_VERSION=1.5.1 python3 -
Alternatively, you can use pip
to install Poetry:
pip install --user poetry
- After the installation is complete, close and reopen your terminal to ensure the newly installed
poetry
command is available in your system's PATH.
For more detailed instructions and alternative installation methods, please refer to the official Poetry documentation: https://python-poetry.org/docs/#installation
Concepts & Documentation
See the concepts docs for a tour through what the API offers.
Example Connectors
HTTP Connectors:
Simple Python connectors using the bare-bones Source
abstraction:
Contributing
First time setup
Install the project dependencies and development tools:
poetry install --all-extras
Installing all extras is required to run the full suite of unit tests.
Iteration
- Iterate on the CDK code locally
- Run tests via
poetry run poe unit-test-with-cov
, orpython -m pytest -s unit_tests
if you want to pass pytest options. - Run
poetry run poe check-local
to lint all code, type-check modified code, and run unit tests with coverage in one command.
To see all available scripts, run poetry run poe
.
Autogenerated files
If the iteration you are working on includes changes to the models or the connector generator, you might want to regenerate them. In order to do that, you can run:
poetry run poe build
This will generate the code generator docker image and the component manifest files based on the schemas and templates.
Testing
All tests are located in the unit_tests
directory. Run poetry run poe unit-test-with-cov
to run them. This also presents a test coverage report. For faster iteration with no coverage report and more options, python -m pytest -s unit_tests
is a good place to start.
Building and testing a connector with your local CDK
When developing a new feature in the CDK, you may find it helpful to run a connector that uses that new feature. You can test this in one of two ways:
- Running a connector locally
- Building and running a source via Docker
Installing your local CDK into a local Python connector
In order to get a local Python connector running your local CDK, do the following.
First, make sure you have your connector's virtual environment active:
# from the `airbyte/airbyte-integrations/connectors/<connector-directory>` directory
source .venv/bin/activate
# if you haven't installed dependencies for your connector already
pip install -e .
Then, navigate to the CDK and install it in editable mode:
cd ../../../airbyte-cdk/python
pip install -e .
You should see that pip
has uninstalled the version of airbyte-cdk
defined by your connector's setup.py
and installed your local CDK. Any changes you make will be immediately reflected in your editor, so long as your editor's interpreter is set to your connector's virtual environment.
Building a Python connector in Docker with your local CDK installed
Pre-requisite: Install the airbyte-ci
CLI
You can build your connector image with the local CDK using
# from the airbytehq/airbyte base directory
airbyte-ci connectors --use-local-cdk --name=<CONNECTOR> build
Note that the local CDK is injected at build time, so if you make changes, you will have to run the build command again to see them reflected.
Running Connector Acceptance Tests for a single connector in Docker with your local CDK installed
Pre-requisite: Install the airbyte-ci
CLI
To run acceptance tests for a single connectors using the local CDK, from the connector directory, run
airbyte-ci connectors --use-local-cdk --name=<CONNECTOR> test
When you don't have access to the API
There may be a time when you do not have access to the API (either because you don't have the credentials, network access, etc...) You will probably still want to do end-to-end testing at least once. In order to do so, you can emulate the server you would be reaching using a server stubbing tool.
For example, using mockserver, you can set up an expectation file like this:
{
"httpRequest": {
"method": "GET",
"path": "/data"
},
"httpResponse": {
"body": "{\"data\": [{\"record_key\": 1}, {\"record_key\": 2}]}"
}
}
Assuming this file has been created at secrets/mock_server_config/expectations.json
, running the following command will allow to match any requests on path /data
to return the response defined in the expectation file:
docker run -d --rm -v $(pwd)/secrets/mock_server_config:/config -p 8113:8113 --env MOCKSERVER_LOG_LEVEL=TRACE --env MOCKSERVER_SERVER_PORT=8113 --env MOCKSERVER_WATCH_INITIALIZATION_JSON=true --env MOCKSERVER_PERSISTED_EXPECTATIONS_PATH=/config/expectations.json --env MOCKSERVER_INITIALIZATION_JSON_PATH=/config/expectations.json mockserver/mockserver:5.15.0
HTTP requests to localhost:8113/data
should now return the body defined in the expectations file. To test this, the implementer either has to change the code which defines the base URL for Python source or update the url_base
from low-code. With the Connector Builder running in docker, you will have to use domain host.docker.internal
instead of localhost
as the requests are executed within docker.
Publishing a new version to PyPi
- Open a PR
- Once it is approved and merged, an Airbyte member must run the
Publish CDK Manually
workflow from master usingrelease-type=major|manor|patch
and setting the changelog message.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for airbyte_cdk-0.77.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6dffbe0c4b3454a5cdd20525b4f1e9cfef2e80c005b6b30473fc5bf6f75af64e |
|
MD5 | d45fa1c919cd8f8f11bc9ee7377a4579 |
|
BLAKE2b-256 | b0b4490e3f9ac2dbdb485a6e76ea4397d4746dad08c5de1f6fdecb57ed16cf18 |