Skip to main content

This package converts jsonl file generated by twarc2 to sql database in an opnionated way.

Project description

twarc2sql

https://img.shields.io/pypi/v/twarc2sql.svg Documentation Status Tests

This package converts jsonl file generated by twarc2 to sql database in an opnionated way.

Features

  • This package converts jsonl file generated by twarc2 to a postgres sql database in an opnionated way.

  • It creates a database with multiple tables that can be found in the documentation & models.py file.

Installation

You can install twarc2sql using pip:

$ pip install twarc2sql

Usage

import twarc2sql

twarc2sql.connect_to_db_and_upload(
    "folderpath/to/jsonl/file",
    "jsonl_file",
    "twarc_task_type",
    "env_file_with_db_information",
)

Credits

This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.

History

0.1.0 (2023-03-23)

  • First release on PyPI.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

twarc2sql-1.0.0.tar.gz (487.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

twarc2sql-1.0.0-py2.py3-none-any.whl (16.7 kB view details)

Uploaded Python 2Python 3

File details

Details for the file twarc2sql-1.0.0.tar.gz.

File metadata

  • Download URL: twarc2sql-1.0.0.tar.gz
  • Upload date:
  • Size: 487.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.11.3

File hashes

Hashes for twarc2sql-1.0.0.tar.gz
Algorithm Hash digest
SHA256 5d260e6d0c97e641b12de5bc49aaad767dab4ee55f0e97b68c03e13b9cd6441d
MD5 b5376d84eb093b96426156558dcb9f38
BLAKE2b-256 21b86ddae0f40c5a6b5a2596807c8edd11125601c266dd134240e5ac6d0799d1

See more details on using hashes here.

File details

Details for the file twarc2sql-1.0.0-py2.py3-none-any.whl.

File metadata

  • Download URL: twarc2sql-1.0.0-py2.py3-none-any.whl
  • Upload date:
  • Size: 16.7 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.11.3

File hashes

Hashes for twarc2sql-1.0.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 618d0dda70381539137362de75fe27edcf69eb6cd51e1007ec71936df4c80160
MD5 04baa1fee832faa1f7430646ee3db6ca
BLAKE2b-256 56d3fa5e53207e2d30672fc016badb95f7575f3bd06291a55af74a310a3bb3ea

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page