Skip to main content

ingestr is a command-line application that ingests data from various sources and stores them in any database.

Project description

Copy data from any source to any destination without any code


ingestr is a command-line app that allows you to ingest data from any source into any destination using simple command-line flags, no code necessary.

  • โœจ copy data from your database into any destination
  • โž• incremental loading: append, merge or delete+insert
  • ๐Ÿ single-command installation

ingestr takes away the complexity of managing any backend or writing any code for ingesting data, simply run the command and watch the data land on its destination.

MongoDB to Postgres benchmark

Installation

You can install ingestr using the install script:

curl -LsSf https://getbruin.com/install/ingestr | sh

Alternatively, you can install it with pip:

pip install ingestr

The pip package can also be used from Python. Install the SDK extra for Python data ingestion:

pip install 'ingestr[sdk]'

Python rows, generators, and DataFrames are sent to the bundled ingestr binary as Arrow IPC streams by default:

import ingestr

ingestr.ingest(
    [{"id": 1, "name": "Ada"}, {"id": 2, "name": "Grace"}],
    dest_uri="duckdb:///tmp/warehouse.duckdb",
    dest_table="main.people",
)

DataFrames and yielded data use the same Arrow stream transport:

ingestr.ingest(df, dest_uri="duckdb:///tmp/warehouse.duckdb", dest_table="main.events")

def events():
    yield [{"id": 1, "event": "signup"}]
    yield [{"id": 2, "event": "purchase"}]

ingestr.ingest(events, dest_uri="postgresql://...", dest_table="public.events")

For push-style code, omit the data argument and use ingest as a context manager. The context value accepts the same shapes as ingestr.ingest(data, ...):

with ingestr.ingest(dest_uri="postgresql://...", dest_table="public.events") as ingest:
    for response in client.list_events():
        ingest(response["items"])

For very large already-materialized data, use the existing mmap Arrow IPC file transport:

ingestr.ingest(df, dest_uri="duckdb:///tmp/warehouse.duckdb", dest_table="main.events", transport="mmap")

For full CLI pass-through, use ingestr.run(["ingest", "--source-uri", "...", "--dest-uri", "...", "--source-table", "..."]), or ingestr.run_cli(...) for keyword arguments that map to CLI flags.

Quickstart

ingestr ingest \
    --source-uri 'postgresql://admin:admin@localhost:8837/web?sslmode=disable' \
    --source-table 'public.some_data' \
    --dest-uri 'bigquery://<your-project-name>?credentials_path=/path/to/service/account.json' \
    --dest-table 'ingestr.some_data'

That's it.

This command:

  • gets the table public.some_data from the Postgres instance.
  • uploads this data to your BigQuery warehouse under the schema ingestr and table some_data.

Documentation

You can see the full documentation here.

Community

Join our Slack community here.

Contributing

Pull requests are welcome. However, please open an issue first to discuss what you would like to change. We maybe able to offer you help and feedback regarding any changes you would like to make.

[!NOTE] After cloning ingestr make sure to run make setup to install githooks.

Supported sources & destinations

Source Destination
Databases
AWS Athena โœ… โœ…
Apache Iceberg - โœ…
AWS Redshift โœ… โœ…
Cassandra โœ… โœ…
ClickHouse โœ… โœ…
Couchbase โœ… -
CrateDB โœ… โœ…
Databricks โœ… โœ…
DuckDB โœ… โœ…
DynamoDB โœ… โœ…
Elasticsearch โœ… โœ…
Google BigQuery โœ… โœ…
GCP Spanner โœ… -
IBM Db2 โœ… -
InfluxDB โœ… -
Kafka โœ… -
Local CSV file โœ… โœ…
MaxCompute โœ… โœ…
Microsoft Fabric โœ… โœ…
Microsoft OneLake - โœ…
Microsoft SQL Server โœ… โœ…
MongoDB โœ… โœ…
MotherDuck โœ… โœ…
MySQL โœ… โœ…
Oracle โœ… -
PlanetScale โœ… โœ…
Postgres โœ… โœ…
RabbitMQ โœ… -
SAP Hana โœ… -
Snowflake โœ… โœ…
Socrata โœ… -
SQLite โœ… โœ…
StarRocks โœ… โœ…
Synapse - โœ…
Trino โœ… โœ…
Platforms
Adjust โœ… -
Airtable โœ… -
Allium โœ… -
Amazon Kinesis โœ… -
Anthropic โœ… -
API-Football โœ… -
AppsFlyer โœ… -
Apple Ads โœ… -
Apple App Store โœ… -
Applovin โœ… -
Applovin Max โœ… -
Asana โœ… -
Attio โœ… -
Azure Data Lake Storage Gen2 โœ… โœ…
BallDontLie FIFA โœ… -
Braze โœ… -
Bruin โœ… -
Chess.com โœ… -
ClickUp โœ… -
Cursor โœ… -
Docebo โœ… -
Dune โœ… -
Facebook Ads โœ… -
Fireflies โœ… -
Fluxx โœ… -
football-data.org โœ… -
Frankfurter โœ… -
Freshdesk โœ… -
FundraiseUp โœ… -
G2 โœ… -
GitHub โœ… -
GitLab โœ… -
Google Ads โœ… -
Google Analytics โœ… -
Google Cloud Storage (GCS) โœ… โœ…
Google Sheets โœ… -
Gorgias โœ… -
Granola โœ… -
Hostaway โœ… -
HubSpot โœ… -
Indeed โœ… -
Intercom โœ… -
Internet Society Pulse โœ… -
Jira โœ… -
JobTread โœ… -
Klaviyo โœ… -
Linear โœ… -
LinkedIn Ads โœ… -
Mailchimp โœ… -
Mixpanel โœ… -
Monday โœ… -
Notion โœ… -
Paddle โœ… -
Personio โœ… -
PhantomBuster โœ… -
Pinterest โœ… -
Pipedrive โœ… -
Plus Vibe AI โœ… -
PostHog โœ… -
Primer โœ… -
QuickBooks โœ… -
Reddit Ads โœ… -
RevenueCat โœ… -
S3 โœ… โœ…
Salesforce โœ… -
SFTP โœ… -
SendGrid โœ… -
Shopify โœ… -
Slack โœ… -
Smartsheet โœ… -
Snapchat Ads โœ… -
Solidgate โœ… -
Square โœ… -
Stripe โœ… -
SurveyMonkey โœ… -
TikTok Ads โœ… -
Trustpilot โœ… -
Twilio โœ… -
Wise โœ… -
Zendesk โœ… -
Zoom โœ… -

Feel free to create an issue if you'd like to see support for another source or destination.

License

ingestr is source-available under the Functional Source License 1.1, with Apache 2.0 as the future license. You can use ingestr freely for internal production use, development, testing, education, research, and professional services. You cannot use ingestr to offer a competing commercial ingestion, ELT, connector, or managed data pipeline product/service.

Each version becomes Apache 2.0 two years after release.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

ingestr-1.0.64-py3-none-win_amd64.whl (77.1 MB view details)

Uploaded Python 3Windows x86-64

ingestr-1.0.64-py3-none-manylinux2014_x86_64.whl (76.8 MB view details)

Uploaded Python 3

ingestr-1.0.64-py3-none-manylinux2014_aarch64.whl (68.6 MB view details)

Uploaded Python 3

ingestr-1.0.64-py3-none-macosx_11_0_arm64.whl (70.0 MB view details)

Uploaded Python 3macOS 11.0+ ARM64

ingestr-1.0.64-py3-none-macosx_10_15_x86_64.whl (76.4 MB view details)

Uploaded Python 3macOS 10.15+ x86-64

File details

Details for the file ingestr-1.0.64-py3-none-win_amd64.whl.

File metadata

  • Download URL: ingestr-1.0.64-py3-none-win_amd64.whl
  • Upload date:
  • Size: 77.1 MB
  • Tags: Python 3, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for ingestr-1.0.64-py3-none-win_amd64.whl
Algorithm Hash digest
SHA256 dd839a139cc3c62703017bb73e1c1f131de52b89265412519ea662b92b46e6f5
MD5 f8f9fd9c0bb2147433ae43eecd788296
BLAKE2b-256 7f7ff3fc11782b1e20c2c44822d65d0b2f7cb177d7592c65b9e6647fa96f58a9

See more details on using hashes here.

File details

Details for the file ingestr-1.0.64-py3-none-manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for ingestr-1.0.64-py3-none-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 770dadea964f51591e0616a621f7e7440871798161b413a8f6dae009dda6c827
MD5 b49f31266ceeb1b4d17530312a578c01
BLAKE2b-256 1ccd5e9bb282c808b4d5fd6a335f2c2809d965f9ddbc56d1a7b2ccaf02c33c4b

See more details on using hashes here.

File details

Details for the file ingestr-1.0.64-py3-none-manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for ingestr-1.0.64-py3-none-manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 6f8c0d7f55883af84ff5a15a60766b9d6e501d90c0f3e90b3f09d4e9d6373d69
MD5 b9c6845c85105d733cbad9500e46d8a1
BLAKE2b-256 b60cea69d8192e72d995fc232ff33e96349eaf0de01834d9fa345cf2056149be

See more details on using hashes here.

File details

Details for the file ingestr-1.0.64-py3-none-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for ingestr-1.0.64-py3-none-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 76d8367a04b82f3991b07a7efe5ab3d86ca1eed3f22563d40c8ee6c830a87d18
MD5 3896aa24edb54b3e139b59c59ce4bce1
BLAKE2b-256 f9306e286762fb8a340368b6688b01c9d0a39c8d0e1a3269c5caef5861196c07

See more details on using hashes here.

File details

Details for the file ingestr-1.0.64-py3-none-macosx_10_15_x86_64.whl.

File metadata

File hashes

Hashes for ingestr-1.0.64-py3-none-macosx_10_15_x86_64.whl
Algorithm Hash digest
SHA256 3a652d6bb3f0c985142a3e246a92fe5bd9bca2436f9a30db533bb5e2baedac8c
MD5 fa5ec15774ee81677f228b5006d96003
BLAKE2b-256 25ab445a3de2e3938ce4d40d3357686fb37e27766f7445f9a9a8b4d84aa2cefb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page