Skip to main content

Commands to download datasets from the Open Data portal of the Kingdom of Saudi Arabia (KSA).

Project description

Open Data - Kingdom of Saudi Arabia

This repository contains scripts to download datasets from the Open Data portal of the Kingdom of Saudi Arabia (KSA). The main script downloads all datasets for a given organization ID and saves them locally.

Directory Structure

.
├── README.md
├── download_all_org.py
├── opendata (optional - parameter in file)
├── requirements.txt
├── open_ksa
│   ├── download_file.py
│   ├── organizations.py
│   ├── get_dataset_resources.py
│   └── get_org_resources.py
└── system-drawing.excalidraw

Functions Overview

  • organizations(): Get the organization information, including the option to write to a target file as a CSV or JSON.
  • get_org_resources(org_id): Retrieves the organization name, organization ID, and dataset IDs for the specified organization.
  • get_dataset_resources(dataset_ids, allowed_exts=['csv', 'xlsx', 'xls'], output_dir='opendata/org_resources', verbose=False): Downloads all data resources for the specified dataset IDs.
  • download_file(session, url, headers, file_path): Downloads a file from the specified URL using the provided session and headers.

Process Flow

graph TD
    subgraph Initialization
        A[Start] --> B[Create SSL Adapter]
        B --> C[Setup Session]
    end

    subgraph Resource Extraction
        C --> D[Extract Organization ID]
        D --> E[Extract Dataset IDs]
    end

    subgraph Directory Setup
        E --> F[Create Directory for Organization]
    end

    subgraph Data Download
        F --> G[Download Dataset Resources]
        G --> H[Save Data Locally]
    end

    H --> I[End]

Usage

To run the script with the dependencies, first install the virtualenv:

python -m venv venv

pip install -r requirements.txt

Then you'll be able to run the python primary python script successfully:

python download_all_org.py

NOTE: For a different organization, you need to update the parameter in the file for the org_id parameter in the function

Release Plan / To DO

  • Create a set of functions to cover the entire API, including:
    • Create a function to get the list of organizations
    • Create a function to get the list of datasets for an organization
    • Create a function to get the list of resources for a dataset
    • Create a function to download a resource
    • Create a function to check the status of a download
  • Create a set of unit tests for the functions
  • Create a set of examples for the functions
  • Create a set of documentation for the functions
  • Move the repository to a PyPi library

Contribution

The contribution process is as follows:

  1. Clone the repository and create a new branch
  2. Make your changes, following the coding style guidelines
  3. Create a pull request with a detailed description of your changes
  4. Wait for your pull request to be reviewed and approved
  5. Once approved, your changes will be merged and available in the main branch

When contributing to this repository, please first discuss the change you wish to make via issue, email, or any other method with the owners of this repository before making a change.

Please note we have a code of conduct, please follow it in all your interactions with the project.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

open_ksa-0.1.2.tar.gz (25.6 kB view details)

Uploaded Source

Built Distribution

open_ksa-0.1.2-py3-none-any.whl (13.1 kB view details)

Uploaded Python 3

File details

Details for the file open_ksa-0.1.2.tar.gz.

File metadata

  • Download URL: open_ksa-0.1.2.tar.gz
  • Upload date:
  • Size: 25.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for open_ksa-0.1.2.tar.gz
Algorithm Hash digest
SHA256 78c2e55707de0c7d4468ed0f4363f58b5e6e044ad8af2a28b1fd919343c507a4
MD5 b559ce260c7eeb2a48c29107684d6bf6
BLAKE2b-256 38b62bcd68b4efcc3e29c3b7ee16647fe41f2300059d162af6bff99a007e82ed

See more details on using hashes here.

File details

Details for the file open_ksa-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: open_ksa-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 13.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for open_ksa-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 783603a79462071a4827f54e98f5e4c195fb6a92281c40f1ef20c0b1d0e02f7a
MD5 227b5014e6879908c794b0678436a5d5
BLAKE2b-256 abec4945046637cbc1b39c06e85fd0d17e1b93a10d869e54b0ebd742803e15b6

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page