A crawler for extracting Maven coordinates
Project description
Maven Crawler
This is a tool for crawling Maven repositories and gathering Maven coordinates. It can be used for research and education purposes.
Installation
Requirements
- Python 3.5 or newer
- Apache Kafka (optional)
Quick install
pip install mvncrawler
Usage
mvncrawler --p ./maven/ --q q_items.txt --t "fasten.mvn.pkg" --c 5 --l 10
It extracts 10 Maven coordinates.
- Use
--helpoption to see the description of each arguments. - If you do not have a Kafka server on your machine, add
--no-kafkaoption to the tool for saving Maven coordinates in a file. - You can remove
--l 10option to extract Maven coordinates without a limit.
Output format
Extracted Maven coordinates are converted to a JSON-compatible string as shown and described below:
{"groupId": "com.yahoo.vespa", "artifactId": "zookeeper-server-common", "version": "7.171.10", "date": "1580860140", "url": "https://repo1.maven.org/maven2/com/yahoo/vespa/zookeeper-server-common/7.171.10/zookeeper-server-common-7.171.10.pom"}
groupId: The specified groupID in a POM file.artifactId: The specified artifactID in a POM file.version: The version of a Maven package as specified in its POM file.date: The release date of a Maven package in Unix epoch format.url: The URL of a POM file on the Maven server.
Disclaimer
We are NOT responsible for any damage or the misuse of this tool.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file mvncrawler-0.1.0.tar.gz.
File metadata
- Download URL: mvncrawler-0.1.0.tar.gz
- Upload date:
- Size: 6.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.39.0 CPython/3.7.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d99dd61dff51521d48b57f4467f75842a4eaf0f1c32ac4f3a814fb76e75d0db8
|
|
| MD5 |
d4ac5098def48083745a6e1afc5739c7
|
|
| BLAKE2b-256 |
bbb8601132a377ed952b1af674bd68735c13c815f3f5dda7bdc0ccf17afd5fe4
|
File details
Details for the file mvncrawler-0.1.0-py3-none-any.whl.
File metadata
- Download URL: mvncrawler-0.1.0-py3-none-any.whl
- Upload date:
- Size: 12.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.39.0 CPython/3.7.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
899738ec36c17d55e876076e2e29864c220c53241fec1d2c4838674ef2e8c197
|
|
| MD5 |
587ee35cecc0b1a8ac197a510220b328
|
|
| BLAKE2b-256 |
0a5f1185872b299765bf9ad76cde98ce09463d265ad38025b0357bfb4275d894
|