Skip to main content

A client for scraping and parsing the Periodic Review Secretariat's web page for Guananamo detainees.

Project description

>From their website: “The Periodic Review Secretariat (PRS) develops and administers the periodic review process for eligible Guantanamo Bay detainees, including provision of personal representatives to detainees.”

Usage

The PRB engages in three different forms of review for a Guantanamo detainee”s documents: An initial review, a file review and a full review. Technically, a fourth type, the subsequent full review, is available. To date, no subsequent full reviews have been posted.

Initial review

initial_review --csv > initial_review.csv
initial_review --json > initial_review.json
initial_review --tsv > initial_review.tsv

File review

file_review --csv > file_review.csv
file_review --json > file_review.json
file_review --tsv > file_review.tsv

Full review

full_review --csv > full_review.csv
full_review --json > full_review.json
full_review --tsv > full_review.tsv

Schema

Returns a row or an object for every document. Each document contains the document-specific fields like type_name, type_id, and url as well as the detainee-specific fields like name and isn. Builds a unique id for each document from isn-type_id-hearing_or_review_date.

[
    {
        "review_type": "full-review",
        "review_url": "http://www.prs.mil/Review-Information/Initial-Review/",
        "hearing_or_review_date":"2014-11-05",
        "denial":null,
        "name":"Abdel Malik Ahmed Abdel Wahab Al Rahabi",
        "type_id":"1",
        "url":"http:\/\/www.prs.mil\/Portals\/60\/Documents\/ISN037\/141105_U_ISN037_GOVERNMENT'S_UNCLASSIFIED_SUMMARY_PUBLIC.pdf",
        "type_name":"Government's Unclassified Summary",
        "id":"037-initial-review-1-2014-11-05",
        "isn":"037",
        "denied":false,
        "notification_date":"2014-08-26",
        "final_determination_date":"2014-12-05"
    },
    {
        "review_type": "full-review",
        "review_url": "http://www.prs.mil/Review-Information/Initial-Review/",
        "hearing_or_review_date":"2014-11-05",
        "denial":null,
        "name":"Abdel Malik Ahmed Abdel Wahab Al Rahabi",
        "type_id":"2",
        "url":"http:\/\/www.prs.mil\/Portals\/60\/Documents\/ISN037\/141105_U_ISN037_PR_STATEMENT_PRB.pdf",
        "type_name":"Opening Statements of Detainee's Representatives",
        "id":"037-initial-review-2-2014-11-05",
        "isn":"037",
        "denied":false,
        "notification_date":"2014-08-26",
        "final_determination_date":"2014-12-05"
    },
    {
        "review_type": "full-review",
        "review_url": "http://www.prs.mil/Review-Information/Initial-Review/",
        "hearing_or_review_date":"2014-11-05",
        "denial":null,
        "name":"Abdel Malik Ahmed Abdel Wahab Al Rahabi",
        "type_id":"3",
        "url":"http:\/\/www.prs.mil\/Portals\/60\/Documents\/ISN037\/141216_U_ISN037_DETAINEE_WRITTEN_SUBMISSION_PUBLIC.pdf",
        "type_name":"Detainee's Written Submission",
        "id":"037-initial-review-3-2014-11-05",
        "isn":"037",
        "denied":false,
        "notification_date":"2014-08-26",
        "final_determination_date":"2014-12-05"
    },
    {
        "review_type": "full-review",
        "review_url": "http://www.prs.mil/Review-Information/Initial-Review/",
        "hearing_or_review_date":"2014-11-05",
        "denial":null,
        "name":"Abdel Malik Ahmed Abdel Wahab Al Rahabi",
        "type_id":"4",
        "url":"http:\/\/www.prs.mil\/LinkClick.aspx?fileticket=RFOMdQD69k4%3d&tabid=8447&portalid=60&mid=20067",
        "type_name":"Transcript of Public Session",
        "id":"037-initial-review-4-2014-11-05",
        "isn":"037",
        "denied":false,
        "notification_date":"2014-08-26",
        "final_determination_date":"2014-12-05"
    },
    {
        "review_type": "full-review",
        "review_url": "http://www.prs.mil/Review-Information/Initial-Review/",
        "hearing_or_review_date":"2014-11-05",
        "denial":null,
        "name":"Abdel Malik Ahmed Abdel Wahab Al Rahabi",
        "type_id":"5",
        "url":"http:\/\/www.prs.mil\/Portals\/60\/Documents\/ISN037\/141105_U_ISN037_TRANSCRIPT_OF_DETAINEE_SESSION_PUBLIC.pdf",
        "type_name":"Transcript of Detainee Session",
        "id":"037-initial-review-5-2014-11-05",
        "isn":"037",
        "denied":false,
        "notification_date":"2014-08-26",
        "final_determination_date":"2014-12-05"
    },
    {
        "review_type": "full-review",
        "review_url": "http://www.prs.mil/Review-Information/Initial-Review/",
        "hearing_or_review_date":"2014-11-05",
        "denial":null,
        "name":"Abdel Malik Ahmed Abdel Wahab Al Rahabi",
        "type_id":"6",
        "url":"http:\/\/www.prs.mil\/LinkClick.aspx?fileticket=s0XT-7qYc94%3d&tabid=8447&portalid=60&mid=20067",
        "type_name":"Unclassified Summary of Final Determination",
        "id":"037-initial-review-6-2014-11-05",
        "isn":"037",
        "denied":false,
        "notification_date":"2014-08-26",
        "final_determination_date":"2014-12-05"
    }
]

Output

The scrapers can return CSV, JSON or TSV. The default if no options are passed is CSV.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nyt-prb-scraper-0.0.10.tar.gz (4.1 kB view details)

Uploaded Source

Built Distribution

nyt_prb_scraper-0.0.10-py3-none-any.whl (6.6 kB view details)

Uploaded Python 3

File details

Details for the file nyt-prb-scraper-0.0.10.tar.gz.

File metadata

File hashes

Hashes for nyt-prb-scraper-0.0.10.tar.gz
Algorithm Hash digest
SHA256 94f9dcb065a2b48533bf4846750329e7d08bc4895cb3f4a9856d7c51956c4145
MD5 20dcca0da7e8130872fe7f691581bfcd
BLAKE2b-256 1386de45ddda8e209a1a0924d6b5ab52817df421cae5371620899d4c01e41b4a

See more details on using hashes here.

File details

Details for the file nyt_prb_scraper-0.0.10-py3-none-any.whl.

File metadata

File hashes

Hashes for nyt_prb_scraper-0.0.10-py3-none-any.whl
Algorithm Hash digest
SHA256 6dd198d58affd2f198dcc1c7ac472aec4842c8e2462f72b6f4e9b835f455e10f
MD5 40a3cff8691a78f4dac4e22b491be51b
BLAKE2b-256 51dd61050859fd34ad70f1aaa97bb63c65f8438456839fe1b329d2f49ed496a8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page