Skip to main content

Commands to download datasets from the Open Data portal of the Kingdom of Saudi Arabia (KSA).

Project description

Open Data - Kingdom of Saudi Arabia

This repository contains scripts to download datasets from the Open Data portal of the Kingdom of Saudi Arabia (KSA). The main script downloads all datasets for a given organization ID and saves them locally.

Directory Structure

.
├── README.md
├── download_all_org.py
├── opendata (optional - parameter in file)
├── requirements.txt
├── open_ksa
│   ├── download_file.py
│   ├── organizations.py
│   ├── get_dataset_resources.py
│   └── get_org_resources.py
└── system-drawing.excalidraw

Functions Overview

  • organizations(): Get the organization information, including the option to write to a target file as a CSV or JSON.
  • get_org_resources(org_id): Retrieves the organization name, organization ID, and dataset IDs for the specified organization.
  • get_dataset_resources(dataset_ids, allowed_exts=['csv', 'xlsx', 'xls'], output_dir='opendata/org_resources', verbose=False): Downloads all data resources for the specified dataset IDs.
  • download_file(session, url, headers, file_path): Downloads a file from the specified URL using the provided session and headers.

Process Flow

graph TD
    subgraph Initialization
        A[Start] --> B[Create SSL Adapter]
        B --> C[Setup Session]
    end

    subgraph Resource Extraction
        C --> D[Extract Organization ID]
        D --> E[Extract Dataset IDs]
    end

    subgraph Directory Setup
        E --> F[Create Directory for Organization]
    end

    subgraph Data Download
        F --> G[Download Dataset Resources]
        G --> H[Save Data Locally]
    end

    H --> I[End]

Usage

To run the script with the dependencies, first install the virtualenv:

python -m venv venv

pip install -r requirements.txt

Then you'll be able to run the python primary python script successfully:

python download_all_org.py

NOTE: For a different organization, you need to update the parameter in the file for the org_id parameter in the function

Release Plan / To DO

  • Create a set of functions to cover the entire API, including:
    • Create a function to get the list of organizations
    • Create a function to get the list of datasets for an organization
    • Create a function to get the list of resources for a dataset
    • Create a function to download a resource
    • Create a function to check the status of a download
  • Create a set of unit tests for the functions
  • Create a set of examples for the functions
  • Create a set of documentation for the functions
  • Move the repository to a PyPi library

Contribution

The contribution process is as follows:

  1. Clone the repository and create a new branch
  2. Make your changes, following the coding style guidelines
  3. Create a pull request with a detailed description of your changes
  4. Wait for your pull request to be reviewed and approved
  5. Once approved, your changes will be merged and available in the main branch

When contributing to this repository, please first discuss the change you wish to make via issue, email, or any other method with the owners of this repository before making a change.

Please note we have a code of conduct, please follow it in all your interactions with the project.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

open_ksa-0.1.3.tar.gz (25.9 kB view details)

Uploaded Source

Built Distribution

open_ksa-0.1.3-py3-none-any.whl (13.4 kB view details)

Uploaded Python 3

File details

Details for the file open_ksa-0.1.3.tar.gz.

File metadata

  • Download URL: open_ksa-0.1.3.tar.gz
  • Upload date:
  • Size: 25.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for open_ksa-0.1.3.tar.gz
Algorithm Hash digest
SHA256 8fb609d6d758b38ddefa0e7265b02f91f6bb571870e01a96c59a551b58a18340
MD5 c1b6224984995a80f55dec7f1be7d4ef
BLAKE2b-256 53f1836d018da0d68a31208e817f7e18f9f4ed6ef057874aa6f938518df75b74

See more details on using hashes here.

File details

Details for the file open_ksa-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: open_ksa-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 13.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for open_ksa-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 913ac50a5b8ab28108e18cc163fba7867111fae242af419a858593b09ed9534e
MD5 15d4b631a124e12b85405be69b4360ce
BLAKE2b-256 f3497904a4afae3c33323b0b2fdb20e6b08a6fadb24625f42a1584256d79c840

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page