Skip to main content

Commands to download datasets from the Open Data portal of the Kingdom of Saudi Arabia (KSA).

Project description

Open Data - Kingdom of Saudi Arabia

This repository contains scripts to download datasets from the Open Data portal of the Kingdom of Saudi Arabia (KSA). The main script downloads all datasets for a given organization ID and saves them locally.

Directory Structure

.
├── README.md
├── download_all_org.py
├── opendata (optional - parameter in file)
├── requirements.txt
├── open_ksa
│   ├── download_file.py
│   ├── organizations.py
│   ├── get_dataset_resources.py
│   └── get_org_resources.py
└── system-drawing.excalidraw

Functions Overview

  • organizations(): Get the organization information, including the option to write to a target file as a CSV or JSON.
  • get_org_resources(org_id): Retrieves the organization name, organization ID, and dataset IDs for the specified organization.
  • get_dataset_resources(dataset_ids, allowed_exts=['csv', 'xlsx', 'xls'], output_dir='opendata/org_resources', verbose=False): Downloads all data resources for the specified dataset IDs.
  • download_file(session, url, headers, file_path): Downloads a file from the specified URL using the provided session and headers.

Process Flow

graph TD
    subgraph Initialization
        A[Start] --> B[Create SSL Adapter]
        B --> C[Setup Session]
    end

    subgraph Resource Extraction
        C --> D[Extract Organization ID]
        D --> E[Extract Dataset IDs]
    end

    subgraph Directory Setup
        E --> F[Create Directory for Organization]
    end

    subgraph Data Download
        F --> G[Download Dataset Resources]
        G --> H[Save Data Locally]
    end

    H --> I[End]

Usage

To run the script with the dependencies, first install the virtualenv:

python -m venv venv

pip install -r requirements.txt

Then you'll be able to run the python primary python script successfully:

python download_all_org.py

NOTE: For a different organization, you need to update the parameter in the file for the org_id parameter in the function

Release Plan / To DO

  • Create a set of functions to cover the entire API, including:
    • Create a function to get the list of organizations
    • Create a function to get the list of datasets for an organization
    • Create a function to get the list of resources for a dataset
    • Create a function to download a resource
    • Create a function to check the status of a download
  • Create a set of unit tests for the functions
  • Create a set of examples for the functions
  • Create a set of documentation for the functions
  • Move the repository to a PyPi library

Contribution

The contribution process is as follows:

  1. Clone the repository and create a new branch
  2. Make your changes, following the coding style guidelines
  3. Create a pull request with a detailed description of your changes
  4. Wait for your pull request to be reviewed and approved
  5. Once approved, your changes will be merged and available in the main branch

When contributing to this repository, please first discuss the change you wish to make via issue, email, or any other method with the owners of this repository before making a change.

Please note we have a code of conduct, please follow it in all your interactions with the project.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

open_ksa-0.1.4.tar.gz (25.9 kB view details)

Uploaded Source

Built Distribution

open_ksa-0.1.4-py3-none-any.whl (13.4 kB view details)

Uploaded Python 3

File details

Details for the file open_ksa-0.1.4.tar.gz.

File metadata

  • Download URL: open_ksa-0.1.4.tar.gz
  • Upload date:
  • Size: 25.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for open_ksa-0.1.4.tar.gz
Algorithm Hash digest
SHA256 730189ed30f1c2e70921e13eee338a3bad4cc29de47ff831a619e17a7a3a0c27
MD5 4fb0100932379f991cb61c39285cf555
BLAKE2b-256 144c6ad3059dc8acf8376e3cbe0f265e2a3e43ab7a01a082eb4f8ebef1b9a2ae

See more details on using hashes here.

File details

Details for the file open_ksa-0.1.4-py3-none-any.whl.

File metadata

  • Download URL: open_ksa-0.1.4-py3-none-any.whl
  • Upload date:
  • Size: 13.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for open_ksa-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 8b84b146ff4a12349571439cce54fe2ac5bff4db825815406a2554edee0b8b06
MD5 be21bd369f24b0f9b8c2d5846829b49b
BLAKE2b-256 e503d2e1861456d42abfaf1ebee357cc35dd884a994d4b0a72428876f7d8b8f5

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page