Cross-language UserAgent classifier library, python implementation
Project description
The Python implementation of Project Woothee, which is multi-language user-agent strings parsers.
https://github.com/woothee/woothee
Installation
$ pip install woothee
Usage
Parsing user-agent
import woothee
woothee.parse("Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 6.1; Trident/4.0)")
# => {'name': 'Internet Explorer', 'category': 'pc', 'os': 'Windows 7', 'version': '8.0', 'vendor': 'Microsoft', 'os_version': 'NT 6.1'}
Parse user-agent string and returns a object with keys name, category, os, version, vendor and os_version.
For unknown user-agent (or partially failed to parse), result objects may have value ‘UNKNOWN’.
category
labels of user terminal type, one of ‘pc’, ‘smartphone’, ‘mobilephone’, ‘appliance’, ‘crawler’ or ‘misc’ (or ‘UNKNOWN’)
name
the name of browser, like ‘Internet Explorer’, ‘Firefox’, ‘GoogleBot’
version
version string, like ‘8.0’ for IE, ‘9.0.1’ for Firefix, ‘0.2.149.27’ for Chrome, and so on
os
ex: ‘Windows 7’, ‘Mac OSX’, ‘iPhone’, ‘iPad’, ‘Android’
This field used to indicate cellar phone carrier for category ‘mobilephone’
vendor
optional field, shows browser vendor
os_version
optional field, shows version of operating systems
Finding crawlers (almost all, not all) in fast
woothee.is_crawler('Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 6.1; Trident/4.0)')
# => False
Try to see useragent’s category is ‘crawler’ or not, by casual(fast) method. Minor case of crawlers is not tested in this method. To check crawler strictly, use woothee.parse(str)['category'] == 'crawler'.
License
Copyright 2012- TAGOMORI Satoshi (tagomoris)
Licensed under the Apache License, Version 2.0 (the “License”); you may not use this file except in compliance with the License. You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an “AS IS” BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.
History
1.10.1(Aug 8, 2019)
1.10.0(Apr 14, 2019)
1.8.0(Jul 5, 2018)
#13 Add support for Yandex Browser. Thanks to hhatto .
1.7.0(May 7, 2017)
Add support for WebView on Android.
Add support for curl.
Add support for trendictionbot crawler.
Add support for Yeti 1.1.
Compatible with Python 3.6.
Caution. We’ll drop Python2.6 and python3.2 support in the next version.
1.5.0(Aug 16, 2016)
#11 Support BingPreview Thanks to taise.
1.4.0(May 17, 2016)
Add support for Vivaldi
1.3.0(Jan 7, 2016)
Add support for Firefox for iOS
1.2.0(Aug 16, 2015)
Add support for Twitterbot
Add support for webviews of mobile devices
Add support for Windows 10 and Edge browser
Add support for BlackBerry10
1.1.0(Mar 1, 2015)
#9 Test blank cases Thanks to yuya-takeyama.
1.0.0(Jan 20, 2015)
First release
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for woothee-1.10.1-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 33ed024f3c8cebb0400a1bf5a3c0c58a03b7ffab1b8f37e629a4bb662b2641b9 |
|
MD5 | 53b488d3f0caee9e0e94ecce55108ab8 |
|
BLAKE2b-256 | adee1295c7d4bacd1acb292e62a6457374a3a65f1262a45b7e2c4f5934220535 |