A small package that contains commonly used codes while scraping
Project description
Scrap Utils
This is small package that contains some code regularly repeated when scraping.
To install
pip install scrap-utils
Sample code
import scrap_utils as su
response = su.requests_get("https://python.org")
len(response.text)
It has the following functions:
load_json(filepath, encoding=None, errors=None, parse_float=None,
parse_int=None, parse_constant=None)
dump_json(data, filepath, encoding=None, errors=None, indent=4, skipkeys=False,
ensure_ascii=False, separators=None, sort_keys=False)
to_csv(dataset, filepath, mode="a", encoding=None, errors=None, newline='',
header=True, dialect='excel', **fmtparams)
requests_get(url, trials=0, sleep_time=30, max_try=math.inf, **requests_kwargs)
requests_post(url, trials=0, sleep_time=30, max_try=math.inf, **requests_kwargs)
To-do list I'm considering:
- remove print statements
- add unittest
- soup_get()
- driver_get()
- start_firefox()
- read_csv()
Feel free to add your contribution here
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
scrap-utils-0.0.2.tar.gz
(3.5 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file scrap-utils-0.0.2.tar.gz.
File metadata
- Download URL: scrap-utils-0.0.2.tar.gz
- Upload date:
- Size: 3.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.25.1 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.58.0 CPython/3.9.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c7de35ab09292d19966c35e57099124489fc09be9580ab52cccae8e16b8b4c41
|
|
| MD5 |
8e4a5a70181dcc7d682aaeb448ed4d43
|
|
| BLAKE2b-256 |
98a0f76b97f323bc4644a114d1f2866bb3d816935a3353e065afedea06b61902
|
File details
Details for the file scrap_utils-0.0.2-py3-none-any.whl.
File metadata
- Download URL: scrap_utils-0.0.2-py3-none-any.whl
- Upload date:
- Size: 3.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.25.1 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.58.0 CPython/3.9.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
df812d0576a405229b60f10e21d9d9c54c0d732ce676dd9896388e68e79ad907
|
|
| MD5 |
a41c6f68a7134cba2371f6d4199c43cb
|
|
| BLAKE2b-256 |
606411f9ac926aa43ae783fd5de3e0fa4e79d76c97d9ec1c956b7e8a64a167c6
|