Skip to main content

This package converts jsonl file generated by twarc2 to sql database in an opnionated way.

Project description

twarc2sql

https://img.shields.io/pypi/v/twarc2sql.svg Documentation Status Tests

This package converts jsonl file generated by twarc2 to sql database in an opnionated way.

Features

  • This package converts jsonl file generated by twarc2 to a postgres sql database in an opnionated way.

  • It creates a database with multiple tables that can be found in the documentation & models.py file.

Installation

You can install twarc2sql using pip:

$ pip install twarc2sql

Usage

import twarc2sql

twarc2sql.connect_to_db_and_upload(
    "folderpath/to/jsonl/file",
    "jsonl_file",
    "twarc_task_type",
    "env_file_with_db_information",
)

Credits

This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.

History

0.1.0 (2023-03-23)

  • First release on PyPI.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

twarc2sql-1.0.0.tar.gz (487.3 kB view details)

Uploaded Source

Built Distribution

twarc2sql-1.0.0-py2.py3-none-any.whl (16.7 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file twarc2sql-1.0.0.tar.gz.

File metadata

  • Download URL: twarc2sql-1.0.0.tar.gz
  • Upload date:
  • Size: 487.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.11.3

File hashes

Hashes for twarc2sql-1.0.0.tar.gz
Algorithm Hash digest
SHA256 5d260e6d0c97e641b12de5bc49aaad767dab4ee55f0e97b68c03e13b9cd6441d
MD5 b5376d84eb093b96426156558dcb9f38
BLAKE2b-256 21b86ddae0f40c5a6b5a2596807c8edd11125601c266dd134240e5ac6d0799d1

See more details on using hashes here.

File details

Details for the file twarc2sql-1.0.0-py2.py3-none-any.whl.

File metadata

  • Download URL: twarc2sql-1.0.0-py2.py3-none-any.whl
  • Upload date:
  • Size: 16.7 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.11.3

File hashes

Hashes for twarc2sql-1.0.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 618d0dda70381539137362de75fe27edcf69eb6cd51e1007ec71936df4c80160
MD5 04baa1fee832faa1f7430646ee3db6ca
BLAKE2b-256 56d3fa5e53207e2d30672fc016badb95f7575f3bd06291a55af74a310a3bb3ea

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page