Skip to main content

Crawling all GEO metadata.

Project description

geo-spider

crawl all GEO metadata, features:

  1. crawl platforms

  2. crawl samples

  3. crawl series

  4. incremental crawling

  5. installation

  6. output file format

  7. platforms

  8. samples

  9. series

installation

pip install geo-spider

output file format

geo-spider saves files in jsonlines form, Refer to this site for details.

platforms

platforms denovo crawling

geo-spider platforms -o platforms.jl

platforms incremental crawling

If you have a crawled platforms jsonlines file:

geo-spider platforms -cf platforms.jl -o new-platforms.jl

If you have multiple platforms jsonlines files:

geo-spider platforms -cd platforms -o new-platforms.jl

samples

samples denovo crawling

samples incremental crawling

series

series denovo crawling

series incremental crawling

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

geo-spider-0.0.1.tar.gz (4.1 kB view hashes)

Uploaded Source

Built Distributions

geo_spider-0.0.1-py3.7.egg (6.2 kB view hashes)

Uploaded Source

geo_spider-0.0.1-py3-none-any.whl (3.6 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page