Source: National Highway Traffic Safety Administration(NHTSA) and U.S. Department of Transportation.
1. Download the package from pypi or launchpad:
2. Unzip it:
tar -xzvf datahub.gov.dot.nhtsa.recall-0.2dev.tar.gz
2a. You can also use easy_install to downlaod the package:
easy_install -xmaxd . datahub.gov.dot.nhtsa.recall unzip datahub.gov.dot.nhtsa.recall
3. Go into the folder and read what needs to be setup:
cd cd datahub.gov.dot.nhtsa.recall/datahub/gov/dot/nhtsa/recall/ #Read Readme.txt.
4. After you have installed any necessary packages or programs, and setup your database you are ready to start the process:
5. When you start this script you should see results like:
Crawling Data Done Crawling Parsing Data Done Parsing Data Loading Data Done Loading Data
6. Now, this package has downloaded the recall files, unzipped it, parsed it and loaded it to a database you specified in load.py in about 5 minutes depending on your speed. Try comparing that to weeks to figure out where, how and what structure this data has.
[Sample]Here is a website that uses this package to load new recalls every month.<http://lucasmanual.com/recall>
Feedback is always welcomed. If you don’t agree with data structure, or would like to make a improvements please send a patch and we will add it in.
TODO: Figure out how to actually get changelog content.
Changelog content for this version goes here.