Async wrapper for requests / aiohttp, and some python crawler toolkits. Let synchronization code enjoy the performance of asynchronous programming. Read more: https://github.com/ClericPy/torequests.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

torequests v4.8.15

Briefly speaking, requests / aiohttp wrapper for asynchronous programming rookie, to shorten the code quantity.

To install:

pip install torequests -U

requirements:

| requests
| futures # python2
| aiohttp >= 3.0.5 # python3
| uvloop  # python3
| jsonpath_rw_ext
| lxml
| cssselect
| objectpath

optional:

| psutil
| fuzzywuzzy
| python-Levenshtein
| pyperclip

Features

Inspired by tomorrow, to make async-coding brief & smooth, compatible for win32 / python 2&3.

convert any funtions into async-mode with concurrent.futures
wrap requests module in future...
simplify aiohttp, make it requests-like.
some crawler toolkits.

Getting started

1. Async, threads - make functions asynchronous

from torequests import threads, Async
import time


@threads(5)
def test1(n):
    time.sleep(n)
    return 'test1 ok'


def test2(n):
    time.sleep(n)
    return 'test1 ok'


start = int(time.time())
# here async_test2 is same as test1
async_test2 = Async(test2)
future = test1(1)
# future run in non blocking thread pool
print(future, ', %s s passed' % (int(time.time() - start)))
# call future.x will block main thread and get the future.result()
print(future.x, ', %s s passed' % (int(time.time() - start)))
# output:
# <NewFuture at 0x34b1d30 state=running> , 0 s passed
# test1 ok , 1 s passed

2. tPool - thread pool for async-requests

from torequests.main import tPool
from torequests.logs import print_info

req = tPool()
test_url = 'http://p.3.cn'
ss = [
    req.get(
        test_url,
        retry=2,
        callback=lambda x: (len(x.content), print_info(len(x.content))))
    for i in range(3)
]
# or [i.x for i in ss]
req.x
ss = [i.cx for i in ss]
print_info(ss)

# [2019-04-01 00:19:07] temp_code.py(10): 612
# [2019-04-01 00:19:07] temp_code.py(10): 612
# [2019-04-01 00:19:07] temp_code.py(10): 612
# [2019-04-01 00:19:07] temp_code.py(16): [(612, None), (612, None), (612, None)]

2.1 Test the performance (win32+python3.7).

from torequests import tPool
import time

start_time = time.time()
trequests = tPool()
list1 = [
    trequests.get('http://127.0.0.1:5000/test/%s' % num) for num in range(5000)
]
# If failed, i.x may return False by default,
# or you can reset the fail_return arg.
list2 = [i.x.text if i.x else 'fail' for i in list1]
end_time = time.time()
print(list2[:5], '\n5000 requests time cost: %s s' % (end_time - start_time))
# output:
# ['test ok 0', 'test ok 1', 'test ok 2', 'test ok 3', 'test ok 4'] 
# 5000 requests time cost: 5.906721591949463 s

3. Requests - aiohttp-wrapper

from torequests.dummy import Requests
from torequests.logs import print_info
trequests = Requests(frequencies={'p.3.cn': (2, 2)})
ss = [
    trequests.get(
        'http://p.3.cn', retry=1, timeout=5,
        callback=lambda x: (len(x.content), print_info(trequests.frequencies)))
    for i in range(4)
]
trequests.x
ss = [i.cx for i in ss]
print_info(ss)

# [2019-04-01 00:16:35] temp_code.py(7): {'p.3.cn': Frequency(sem=<1/2>, interval=2)}
# [2019-04-01 00:16:35] temp_code.py(7): {'p.3.cn': Frequency(sem=<0/2>, interval=2)}
# [2019-04-01 00:16:37] temp_code.py(7): {'p.3.cn': Frequency(sem=<2/2>, interval=2)}
# [2019-04-01 00:16:37] temp_code.py(7): {'p.3.cn': Frequency(sem=<2/2>, interval=2)}
# [2019-04-01 00:16:37] temp_code.py(12): [<NewResponse [200]>, <NewResponse [200]>, <NewResponse [200]>, <NewResponse [200]>]

3.1 win32+python3.7 cost 3.9s per 5000 requests, which may be much faster with uvloop.

from torequests.dummy import Requests
import time

start_time = time.time()
trequests = Requests()
list1 = [
    trequests.get('http://127.0.0.1:5000/test/%s' % num) for num in range(5000)
]
# If failed, i.x may return False by default,
# or you can reset the fail_return arg.
list2 = [i.x.text if i.x else 'fail' for i in list1]
end_time = time.time()
print(list2[:5], '\n5000 requests time cost:%s s' % (end_time - start_time))
# output:
# win32, without uvloop;
# ['test ok 0', 'test ok 1', 'test ok 2', 'test ok 3', 'test ok 4'] 
# 5000 requests time cost:3.909820079803467 s

3.2 using torequests.dummy.Requests in async environment.

import asyncio

from responder import API
from torequests.dummy import Requests

loop = asyncio.get_event_loop()
api = API()


@api.route('/')
async def index(req, resp):
    # await for request or FailureException
    r = await api.req.get('http://p.3.cn', timeout=(1, 1))
    print(r)
    if r:
        # including good request with status_code between 200 and 299
        resp.text = 'ok' if 'Welcome to nginx!' in r.text else 'bad'
    else:
        resp.text = 'fail'


if __name__ == "__main__":
    api.req = Requests(loop=loop)
    api.run(port=5000, loop=loop)

3.3 mock server source code

from gevent.monkey import patch_all
patch_all()
import bottle
app = bottle.Bottle()
@app.get('/test/<num>')
def test(num):
    return 'ok %s' % num
app.run(server='gevent', port=5000)

4. utils: some useful crawler toolkits

ClipboardWatcher: watch your clipboard changing.
Counts: counter while every time being called.
Null: will return self when be called, and alway be False.
Regex: Regex Mapper for string -> regex -> object.
Saver: simple object persistent toolkit with pickle/json.
Timer: timing tool.
UA: some common User-Agents for crawler.
curlparse: translate curl-string into dict of request.
md5: str(obj) -> md5_string.
print_mem: show the proc-mem-cost with psutil, use this only for lazinesssss.
ptime: %Y-%m-%d %H:%M:%S -> timestamp.
ttime: timestamp -> %Y-%m-%d %H:%M:%S
slice_by_size: slice a sequence into chunks, return as a generation of chunks with size.
slice_into_pieces: slice a sequence into n pieces, return a generation of n pieces.
timeago: show the seconds as human-readable.
unique: unique one sequence.
find_one: use regex like Javascript to find one string with index(like [0], [1]).
...

Documentation

Document & Usage

License

MIT license

Benchmarks

to be continued......

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

6.0.0

Jul 3, 2022

5.1.5

Dec 9, 2021

5.1.4

Jan 24, 2021

5.1.3

Dec 9, 2020

5.1.2

Sep 1, 2020

5.1.1

Aug 28, 2020

5.1.0

Aug 16, 2020

5.0.12

Aug 3, 2020

5.0.11

Jul 30, 2020

5.0.10

Jul 16, 2020

5.0.9

Jun 22, 2020

5.0.8

Jun 15, 2020

5.0.7

Jun 4, 2020

5.0.6

May 11, 2020

5.0.5

May 11, 2020

5.0.4

May 8, 2020

5.0.3

May 5, 2020

5.0.2

May 3, 2020

5.0.1

Apr 24, 2020

5.0.0

Apr 18, 2020

4.9.14

Apr 14, 2020

4.9.13

Apr 14, 2020

4.9.12

Mar 23, 2020

4.9.11

Mar 19, 2020

4.9.10

Mar 12, 2020

4.9.9

Mar 11, 2020

4.9.8

Feb 21, 2020

4.9.7

Feb 21, 2020

4.9.6

Feb 21, 2020

4.9.5

Feb 16, 2020

4.9.4

Feb 13, 2020

4.9.3

Feb 11, 2020

4.9.2

Feb 11, 2020

4.9.1

Feb 11, 2020

4.9.0

Jan 21, 2020

4.8.22

Jan 21, 2020

4.8.21

Jan 19, 2020

4.8.20

Dec 26, 2019

4.8.19

Oct 25, 2019

4.8.18

Oct 24, 2019

4.8.17

Oct 14, 2019

This version

4.8.15

Oct 12, 2019

4.8.14

Jul 6, 2019

4.8.13

Jun 13, 2019

4.8.12

Jun 11, 2019

4.8.10

May 14, 2019

4.8.9

May 13, 2019

4.8.8

May 2, 2019

4.8.7

Mar 31, 2019

4.8.6

Mar 4, 2019

4.8.5

Jan 17, 2019

4.8.4

Jan 16, 2019

4.8.3

Dec 5, 2018

4.8.2

Dec 5, 2018

4.8.1

Dec 3, 2018

4.8.0

Nov 26, 2018

4.7.17

Nov 16, 2018

4.7.16

Nov 14, 2018

4.7.15

Oct 24, 2018

4.7.14

Aug 8, 2018

4.7.13

Aug 5, 2018

4.7.12

Aug 2, 2018

4.7.11

Aug 1, 2018

4.7.10

Jun 14, 2018

4.7.9

Jun 11, 2018

4.7.8

Jun 10, 2018

4.7.5

Jun 10, 2018

4.7.4

Jun 10, 2018

4.7.3

May 27, 2018

4.7.2

Apr 7, 2018

4.7.1

Apr 5, 2018

4.7.0

Apr 5, 2018

4.6.10

Mar 29, 2018

4.6.9

Mar 28, 2018

4.6.8

Mar 26, 2018

4.6.7

Mar 19, 2018

4.6.6

Mar 18, 2018

4.6.5

Mar 16, 2018

4.6.4

Mar 14, 2018

4.6.3

Mar 12, 2018

4.6.2

Mar 12, 2018

4.6.1

Mar 11, 2018

4.6.0

Mar 8, 2018

4.5.11

Mar 4, 2018

4.5.10

Mar 2, 2018

4.5.9

Feb 28, 2018

4.5.8

Feb 26, 2018

4.5.7

Feb 26, 2018

4.5.6

Feb 11, 2018

4.5.5

Feb 6, 2018

4.5.4

Feb 6, 2018

4.5.3

Feb 5, 2018

4.5.2

Feb 5, 2018

4.5.1

Jan 5, 2018

4.5.0

Dec 18, 2017

4.4.12

Dec 10, 2017

4.4.11

Nov 14, 2017

4.4.10

Oct 23, 2017

4.4.9

Oct 14, 2017

4.4.8

Oct 8, 2017

4.4.7

Oct 8, 2017

4.4.6

Sep 6, 2017

4.4.5

Aug 25, 2017

4.4.3

Aug 25, 2017

4.4.2

Aug 14, 2017

4.4.1

Aug 8, 2017

4.4.0

Aug 6, 2017

4.3.2

Aug 6, 2017

4.3.1

Aug 5, 2017

4.3.0

Aug 3, 2017

4.2.6

Jul 19, 2017

4.2.5

Jul 19, 2017

4.2.4

Jul 18, 2017

4.2.2

Jul 9, 2017

4.2.1

Jul 9, 2017

4.2.0

Jul 7, 2017

4.1.0

Jul 5, 2017

4.0.6

Jun 26, 2017

4.0.5

Jun 26, 2017

4.0.4

Jun 25, 2017

4.0.3

Jun 22, 2017

4.0.2

Jun 22, 2017

4.0.1

Jun 21, 2017

4.0.0

Jun 19, 2017

3.1.0

Mar 6, 2017

3.0.3

Feb 25, 2017

3.0.2

Feb 25, 2017

3.0.1

Feb 19, 2017

3.0.0

Feb 19, 2017

2.2.1

Apr 26, 2016

2.2.0

Apr 12, 2016

2.1.2

Apr 6, 2016

2.1.1

Apr 5, 2016

2.0.1

Apr 5, 2016

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

torequests-4.8.15-py3-none-any.whl (48.0 kB view hashes)

Uploaded Oct 12, 2019 Python 3

torequests-4.8.15-py2-none-any.whl (48.1 kB view hashes)

Uploaded Oct 12, 2019 Python 2

Hashes for torequests-4.8.15-py3-none-any.whl

Hashes for torequests-4.8.15-py3-none-any.whl
Algorithm	Hash digest
SHA256	`17a853b6e9560fb86a4bbce4ee8980a7f1ba6668f7687dfbcb048c2fa24906a3`
MD5	`5c8b78b311d83e146723867a680bb221`
BLAKE2b-256	`b75981b9368f1f57cbfee406125f04214931441b9bbbf2f3d15aacfb39370f85`

Hashes for torequests-4.8.15-py2-none-any.whl

Hashes for torequests-4.8.15-py2-none-any.whl
Algorithm	Hash digest
SHA256	`0d0dacca39b40df3b0943cde4aaf480c9dca2b32fc0da580c34df6f6e889d7d7`
MD5	`1220506d18e6285228a194f2abee1b6b`
BLAKE2b-256	`718f9bbe0e32f1a1bf1e4e3a2461035c74c9deedf556cb7ff3ecf882a7f3e1fc`