A client for scraping and parsing the Periodic Review Secretariat's web page for Guananamo detainees.
Project description
>From their website: “The Periodic Review Secretariat (PRS) develops and administers the periodic review process for eligible Guantanamo Bay detainees, including provision of personal representatives to detainees.”
Usage
The PRB engages in three different forms of review for a Guantanamo detainee”s documents: An initial review, a file review and a full review. Technically, a fourth type, the subsequent full review, is available. To date, no subsequent full reviews have been posted.
Initial review
initial_review --csv > initial_review.csv
initial_review --json > initial_review.json
initial_review --tsv > initial_review.tsv
File review
file_review --csv > file_review.csv
file_review --json > file_review.json
file_review --tsv > file_review.tsv
Full review
full_review --csv > full_review.csv
full_review --json > full_review.json
full_review --tsv > full_review.tsv
Schema
Returns a row or an object for every document. Each document contains the document-specific fields like type_name, type_id, and url as well as the detainee-specific fields like name and isn. Builds a unique id for each document from isn-type_id-hearing_or_review_date.
[
{
"review_type": "full-review",
"review_url": "http://www.prs.mil/Review-Information/Initial-Review/",
"hearing_or_review_date":"2014-11-05",
"denial":null,
"name":"Abdel Malik Ahmed Abdel Wahab Al Rahabi",
"type_id":"1",
"url":"http:\/\/www.prs.mil\/Portals\/60\/Documents\/ISN037\/141105_U_ISN037_GOVERNMENT'S_UNCLASSIFIED_SUMMARY_PUBLIC.pdf",
"type_name":"Government's Unclassified Summary",
"id":"037-initial-review-1-2014-11-05",
"isn":"037",
"denied":false,
"notification_date":"2014-08-26",
"final_determination_date":"2014-12-05"
},
{
"review_type": "full-review",
"review_url": "http://www.prs.mil/Review-Information/Initial-Review/",
"hearing_or_review_date":"2014-11-05",
"denial":null,
"name":"Abdel Malik Ahmed Abdel Wahab Al Rahabi",
"type_id":"2",
"url":"http:\/\/www.prs.mil\/Portals\/60\/Documents\/ISN037\/141105_U_ISN037_PR_STATEMENT_PRB.pdf",
"type_name":"Opening Statements of Detainee's Representatives",
"id":"037-initial-review-2-2014-11-05",
"isn":"037",
"denied":false,
"notification_date":"2014-08-26",
"final_determination_date":"2014-12-05"
},
{
"review_type": "full-review",
"review_url": "http://www.prs.mil/Review-Information/Initial-Review/",
"hearing_or_review_date":"2014-11-05",
"denial":null,
"name":"Abdel Malik Ahmed Abdel Wahab Al Rahabi",
"type_id":"3",
"url":"http:\/\/www.prs.mil\/Portals\/60\/Documents\/ISN037\/141216_U_ISN037_DETAINEE_WRITTEN_SUBMISSION_PUBLIC.pdf",
"type_name":"Detainee's Written Submission",
"id":"037-initial-review-3-2014-11-05",
"isn":"037",
"denied":false,
"notification_date":"2014-08-26",
"final_determination_date":"2014-12-05"
},
{
"review_type": "full-review",
"review_url": "http://www.prs.mil/Review-Information/Initial-Review/",
"hearing_or_review_date":"2014-11-05",
"denial":null,
"name":"Abdel Malik Ahmed Abdel Wahab Al Rahabi",
"type_id":"4",
"url":"http:\/\/www.prs.mil\/LinkClick.aspx?fileticket=RFOMdQD69k4%3d&tabid=8447&portalid=60&mid=20067",
"type_name":"Transcript of Public Session",
"id":"037-initial-review-4-2014-11-05",
"isn":"037",
"denied":false,
"notification_date":"2014-08-26",
"final_determination_date":"2014-12-05"
},
{
"review_type": "full-review",
"review_url": "http://www.prs.mil/Review-Information/Initial-Review/",
"hearing_or_review_date":"2014-11-05",
"denial":null,
"name":"Abdel Malik Ahmed Abdel Wahab Al Rahabi",
"type_id":"5",
"url":"http:\/\/www.prs.mil\/Portals\/60\/Documents\/ISN037\/141105_U_ISN037_TRANSCRIPT_OF_DETAINEE_SESSION_PUBLIC.pdf",
"type_name":"Transcript of Detainee Session",
"id":"037-initial-review-5-2014-11-05",
"isn":"037",
"denied":false,
"notification_date":"2014-08-26",
"final_determination_date":"2014-12-05"
},
{
"review_type": "full-review",
"review_url": "http://www.prs.mil/Review-Information/Initial-Review/",
"hearing_or_review_date":"2014-11-05",
"denial":null,
"name":"Abdel Malik Ahmed Abdel Wahab Al Rahabi",
"type_id":"6",
"url":"http:\/\/www.prs.mil\/LinkClick.aspx?fileticket=s0XT-7qYc94%3d&tabid=8447&portalid=60&mid=20067",
"type_name":"Unclassified Summary of Final Determination",
"id":"037-initial-review-6-2014-11-05",
"isn":"037",
"denied":false,
"notification_date":"2014-08-26",
"final_determination_date":"2014-12-05"
}
]
Output
The scrapers can return CSV, JSON or TSV. The default if no options are passed is CSV.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file nyt-prb-scraper-0.0.10.tar.gz
.
File metadata
- Download URL: nyt-prb-scraper-0.0.10.tar.gz
- Upload date:
- Size: 4.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 94f9dcb065a2b48533bf4846750329e7d08bc4895cb3f4a9856d7c51956c4145 |
|
MD5 | 20dcca0da7e8130872fe7f691581bfcd |
|
BLAKE2b-256 | 1386de45ddda8e209a1a0924d6b5ab52817df421cae5371620899d4c01e41b4a |
File details
Details for the file nyt_prb_scraper-0.0.10-py3-none-any.whl
.
File metadata
- Download URL: nyt_prb_scraper-0.0.10-py3-none-any.whl
- Upload date:
- Size: 6.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6dd198d58affd2f198dcc1c7ac472aec4842c8e2462f72b6f4e9b835f455e10f |
|
MD5 | 40a3cff8691a78f4dac4e22b491be51b |
|
BLAKE2b-256 | 51dd61050859fd34ad70f1aaa97bb63c65f8438456839fe1b329d2f49ed496a8 |