Skip to main content

This package converts jsonl file generated by twarc2 to sql database in an opnionated way.

Project description

twarc2sql

https://img.shields.io/pypi/v/twarc2sql.svg Documentation Status Tests

This package converts jsonl file generated by twarc2 to sql database in an opnionated way.

Features

  • This package converts jsonl file generated by twarc2 to a postgres sql database in an opnionated way.

  • It creates a database with multiple tables that can be found in the documentation & models.py file.

Installation

You can install twarc2sql using pip:

$ pip install twarc2sql

Usage

import twarc2sql

twarc2sql.connect_to_db_and_upload(
    "folderpath/to/jsonl/file",
    "jsonl_file",
    "twarc_task_type",
    "env_file_with_db_information",
)

Credits

This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.

History

0.1.0 (2023-03-23)

  • First release on PyPI.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

twarc2sql-0.2.2.tar.gz (485.3 kB view details)

Uploaded Source

Built Distribution

twarc2sql-0.2.2-py2.py3-none-any.whl (14.6 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file twarc2sql-0.2.2.tar.gz.

File metadata

  • Download URL: twarc2sql-0.2.2.tar.gz
  • Upload date:
  • Size: 485.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.11.3

File hashes

Hashes for twarc2sql-0.2.2.tar.gz
Algorithm Hash digest
SHA256 902aa3fa2bad70d37b046abdff140288a2e38e63ad463d44470a2fd1c35e80cf
MD5 5511233ef140585e855de2641dde1ef9
BLAKE2b-256 987bf573cec5c5a3889a2d7b0754492a8919745426f7d36f01035339982467a9

See more details on using hashes here.

File details

Details for the file twarc2sql-0.2.2-py2.py3-none-any.whl.

File metadata

  • Download URL: twarc2sql-0.2.2-py2.py3-none-any.whl
  • Upload date:
  • Size: 14.6 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.11.3

File hashes

Hashes for twarc2sql-0.2.2-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 8e76ff230227102f4ca13af44df25886b870c5921006109dc4ccaef1b1b9679a
MD5 2ed746d3ea0f17d01c53175461fa0f4d
BLAKE2b-256 4b42dfdde4caea3ef7497682182c415edcb642d654002f07cb1e73495193cd11

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page