Skip to main content

A small package that contains commonly used codes while scraping

Project description

Scrap Utils

This is small package that contains some code regularly repeated when scraping.

To install

pip install scrap-utils

Sample code

import scrap_utils as su

response = su.requests_get("https://python.org")
len(response.text)

It has the following functions:

load_json(filepath, encoding=None, errors=None, parse_float=None,
	parse_int=None, parse_constant=None)

dump_json(data, filepath, encoding=None, errors=None, indent=4, skipkeys=False,
	ensure_ascii=False, separators=None, sort_keys=False)

to_csv(dataset, filepath, mode="a", encoding=None, errors=None, newline='',
	header=True, dialect='excel', **fmtparams)

requests_get(url, trials=0, sleep_time=30, max_try=math.inf, **requests_kwargs)

requests_post(url, trials=0, sleep_time=30, max_try=math.inf, **requests_kwargs)

To-do list I'm considering:

  • remove print statements
  • add unittest
  • soup_get()
  • driver_get()
  • start_firefox()
  • read_csv()

Feel free to add your contribution here

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scrap-utils-0.0.2.tar.gz (3.5 kB view details)

Uploaded Source

Built Distribution

scrap_utils-0.0.2-py3-none-any.whl (3.9 kB view details)

Uploaded Python 3

File details

Details for the file scrap-utils-0.0.2.tar.gz.

File metadata

  • Download URL: scrap-utils-0.0.2.tar.gz
  • Upload date:
  • Size: 3.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.25.1 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.58.0 CPython/3.9.1

File hashes

Hashes for scrap-utils-0.0.2.tar.gz
Algorithm Hash digest
SHA256 c7de35ab09292d19966c35e57099124489fc09be9580ab52cccae8e16b8b4c41
MD5 8e4a5a70181dcc7d682aaeb448ed4d43
BLAKE2b-256 98a0f76b97f323bc4644a114d1f2866bb3d816935a3353e065afedea06b61902

See more details on using hashes here.

File details

Details for the file scrap_utils-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: scrap_utils-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 3.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.25.1 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.58.0 CPython/3.9.1

File hashes

Hashes for scrap_utils-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 df812d0576a405229b60f10e21d9d9c54c0d732ce676dd9896388e68e79ad907
MD5 a41c6f68a7134cba2371f6d4199c43cb
BLAKE2b-256 606411f9ac926aa43ae783fd5de3e0fa4e79d76c97d9ec1c956b7e8a64a167c6

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page