Skip to main content

Language Crawler

Project description

langcrawler

Actions Status PyPI License

langcrawler is a language crawler written in Python.

See Lang Fetcher as a language fetcher from langcrawler.

Requirement

  • PostgreSQL >= 12.4
  • Python >= 3.8
  • Redis >= 6.0

Installation

Ubuntu

apt update
apt install -y python3-dev python3-pip python3-setuptools
pip install langcrawler

Windows

# Install Python, Microsoft Visual C++ and Windows SDK
pip install langcrawler

Updating

pip install langcrawler --upgrade

Running

langcrawler \
  --pg-address 127.0.0.1:5432 \
  --pg-login postgres/postgres \
  --redis-address 127.0.0.1:6379 \
  --redis-pass redis \
  --repo-count 1 \
  --repo-host gerrit,github,gitlab \
  --repo-lang go,javascript,php,python,rust,typescript

Usage

usage: crawler.py [-h] [--pg-address PG_ADDRESS] [--pg-login PG_LOGIN]
                  [--redis-address REDIS_ADDRESS] [--redis-pass REDIS_PASS]
                  [--repo-count REPO_COUNT] [--repo-host REPO_HOST]
                  [--repo-lang REPO_LANG] [-v]

Language Crawler

optional arguments:
  -h, --help            show this help message and exit
  --pg-address PG_ADDRESS
                        postgres address (host:port)
  --pg-login PG_LOGIN   postgres login (user/pass)
  --redis-address REDIS_ADDRESS
                        redis address (host:port)
  --redis-pass REDIS_PASS
                        redis pass
  --repo-count REPO_COUNT
                        repository count
  --repo-host REPO_HOST
                        repository host
  --repo-lang REPO_LANG
                        repository language
  -v, --version         show program's version number and exit

Design

design

License

Project License can be found here.

Reference

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

langcrawler-0.0.4.tar.gz (11.6 kB view details)

Uploaded Source

Built Distribution

langcrawler-0.0.4-py2.py3-none-any.whl (16.9 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file langcrawler-0.0.4.tar.gz.

File metadata

  • Download URL: langcrawler-0.0.4.tar.gz
  • Upload date:
  • Size: 11.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.3.0 requests-toolbelt/0.9.1 tqdm/4.50.0 CPython/3.6.9

File hashes

Hashes for langcrawler-0.0.4.tar.gz
Algorithm Hash digest
SHA256 c70476d723f84739b84fd8bec2c3fadad0077c2889dbbeb870961e657409bdab
MD5 34224a2b94bcc303fd6792980458348c
BLAKE2b-256 cc4c34cd72717d5a066ffcfbdc8f05970d861d1596ecc3279608dd447746c608

See more details on using hashes here.

File details

Details for the file langcrawler-0.0.4-py2.py3-none-any.whl.

File metadata

  • Download URL: langcrawler-0.0.4-py2.py3-none-any.whl
  • Upload date:
  • Size: 16.9 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.3.0 requests-toolbelt/0.9.1 tqdm/4.50.0 CPython/3.6.9

File hashes

Hashes for langcrawler-0.0.4-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 0fb4547acb835f42b3c6fe5eb0573e1e03901b02ba715750616c030aec8e2fca
MD5 d8c37457a15ffaf67b82679fa95cd124
BLAKE2b-256 f87eb3dd26d76d51dc70b8abf31741b6b6af9bc79e62da6a5d8d1a7d3882ddce

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page