Skip to main content

A crawler for extracting Maven coordinates

Project description

Maven Crawler

This is a tool for crawling Maven repositories and gathering Maven coordinates. It can be used for research and education purposes.

Installation

Requirements

Quick install

pip install mvncrawler

Usage

mvncrawler --p ./maven/ --q q_items.txt  --t "fasten.mvn.pkg" --c 5 --l 10

It extracts 10 Maven coordinates.

  • Use --help option to see the description of each arguments.
  • If you do not have a Kafka server on your machine, add --no-kafka option to the tool for saving Maven coordinates in a file.
  • You can remove --l 10 option to extract Maven coordinates without a limit.

Output format

Extracted Maven coordinates are converted to a JSON-compatible string as shown and described below:

{"groupId": "com.yahoo.vespa", "artifactId": "zookeeper-server-common", "version": "7.171.10", "date": "1580860140", "url": "https://repo1.maven.org/maven2/com/yahoo/vespa/zookeeper-server-common/7.171.10/zookeeper-server-common-7.171.10.pom"}
  • groupId: The specified groupID in a POM file.
  • artifactId: The specified artifactID in a POM file.
  • version: The version of a Maven package as specified in its POM file.
  • date: The release date of a Maven package in Unix epoch format.
  • url: The URL of a POM file on the Maven server.

Disclaimer

We are NOT responsible for any damage or the misuse of this tool.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mvncrawler-0.1.0.tar.gz (6.4 kB view details)

Uploaded Source

Built Distribution

mvncrawler-0.1.0-py3-none-any.whl (12.1 kB view details)

Uploaded Python 3

File details

Details for the file mvncrawler-0.1.0.tar.gz.

File metadata

  • Download URL: mvncrawler-0.1.0.tar.gz
  • Upload date:
  • Size: 6.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.39.0 CPython/3.7.4

File hashes

Hashes for mvncrawler-0.1.0.tar.gz
Algorithm Hash digest
SHA256 d99dd61dff51521d48b57f4467f75842a4eaf0f1c32ac4f3a814fb76e75d0db8
MD5 d4ac5098def48083745a6e1afc5739c7
BLAKE2b-256 bbb8601132a377ed952b1af674bd68735c13c815f3f5dda7bdc0ccf17afd5fe4

See more details on using hashes here.

File details

Details for the file mvncrawler-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: mvncrawler-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 12.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.39.0 CPython/3.7.4

File hashes

Hashes for mvncrawler-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 899738ec36c17d55e876076e2e29864c220c53241fec1d2c4838674ef2e8c197
MD5 587ee35cecc0b1a8ac197a510220b328
BLAKE2b-256 0a5f1185872b299765bf9ad76cde98ce09463d265ad38025b0357bfb4275d894

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page