A crawler for extracting Maven coordinates
Project description
Maven Crawler
This is a tool for crawling Maven repositories and gathering Maven coordinates. It can be used for research and education purposes.
Installation
Requirements
- Python 3.5 or newer
- Apache Kafka (optional)
Quick install
pip install mvncrawler
Usage
mvncrawler --p ./maven/ --q q_items.txt --t "fasten.mvn.pkg" --c 5 --l 10
It extracts 10 Maven coordinates.
- Use
--help
option to see the description of each arguments. - If you do not have a Kafka server on your machine, add
--no-kafka
option to the tool for saving Maven coordinates in a file. - You can remove
--l 10
option to extract Maven coordinates without a limit.
Output format
Extracted Maven coordinates are converted to a JSON-compatible string as shown and described below:
{"groupId": "com.yahoo.vespa", "artifactId": "zookeeper-server-common", "version": "7.171.10", "date": "1580860140", "url": "https://repo1.maven.org/maven2/com/yahoo/vespa/zookeeper-server-common/7.171.10/zookeeper-server-common-7.171.10.pom"}
groupId
: The specified groupID in a POM file.artifactId
: The specified artifactID in a POM file.version
: The version of a Maven package as specified in its POM file.date
: The release date of a Maven package in Unix epoch format.url
: The URL of a POM file on the Maven server.
Disclaimer
We are NOT responsible for any damage or the misuse of this tool.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
mvncrawler-0.1.0.tar.gz
(6.4 kB
view details)
Built Distribution
File details
Details for the file mvncrawler-0.1.0.tar.gz
.
File metadata
- Download URL: mvncrawler-0.1.0.tar.gz
- Upload date:
- Size: 6.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.39.0 CPython/3.7.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d99dd61dff51521d48b57f4467f75842a4eaf0f1c32ac4f3a814fb76e75d0db8 |
|
MD5 | d4ac5098def48083745a6e1afc5739c7 |
|
BLAKE2b-256 | bbb8601132a377ed952b1af674bd68735c13c815f3f5dda7bdc0ccf17afd5fe4 |
File details
Details for the file mvncrawler-0.1.0-py3-none-any.whl
.
File metadata
- Download URL: mvncrawler-0.1.0-py3-none-any.whl
- Upload date:
- Size: 12.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.39.0 CPython/3.7.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 899738ec36c17d55e876076e2e29864c220c53241fec1d2c4838674ef2e8c197 |
|
MD5 | 587ee35cecc0b1a8ac197a510220b328 |
|
BLAKE2b-256 | 0a5f1185872b299765bf9ad76cde98ce09463d265ad38025b0357bfb4275d894 |