Skip to main content

All connectors that are part of the Unstract platform

Project description

Unstract Connectors

This is Unstract's python package which helps connect to a number of different filesystems and databases.

Filesystems

Filesystems are supported with the help of fsspec libraries that provide a uniform interface to these connectors.

The following filesystems are supported

  • Google Drive
  • S3/Minio
  • Unstract Cloud Storage
  • Box
  • Dropbox (issues exist around file discovery/listing)
  • HTTP(S)

Databases

The following databases are supported

  • Snowflake
  • PostgreSQL
  • MySQL
  • MSSQL
  • Redshift
  • MariaDB
  • BigQuery

Installation

Local Development

To get started with local development,

  • Create and source a virtual environment if you haven't already following these steps.
  • If you're using Mac, install the below library needed for PyMSSQL
brew install pkg-config freetds
  • Install the required dependencies with
pdm install

Environment variables

If the GCSHelper is used, the following environment variables need to be set

  • GOOGLE_SERVICE_ACCOUNT : The service account JSON to perform authentication with Google Cloud Storage account.
  • GOOGLE_PROJECT_ID : The project ID associated with the Google Cloud Storage account.

Running tests

TODO: Use a test framework and document way to run tests

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

unstract_connectors-0.0.3.tar.gz (18.5 kB view hashes)

Uploaded Source

Built Distribution

unstract_connectors-0.0.3-py3-none-any.whl (37.1 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page