Syntactic patterns of HTTP user-agents used by bots / robots / crawlers / scrapers / spiders.

These details have been verified by PyPI

Project links

Homepage

GitHub Statistics

Maintainers

monperrus

Project description

crawler-user-agents

This repository contains a list of of HTTP user-agents used by robots, crawlers, and spiders as in single JSON file.

NPM package: https://www.npmjs.com/package/crawler-user-agents
Go package: https://pkg.go.dev/github.com/monperrus/crawler-user-agents
PyPi package: https://pypi.org/project/crawler-user-agents/

Each pattern is a regular expression. It should work out-of-the-box wih your favorite regex library.

Sponsor

💼 Using crawler-user-agents in a commercial product? This package is free to use, but it takes real time to maintain and expand. If it's providing value (and it probably is), please consider sponsoring at the commercial tier.

It keeps the project alive and actively maintained. Your company can afford it. 🙏

Install

Direct download

Download the crawler-user-agents.json file from this repository directly.

Javascript

crawler-user-agents is deployed on npmjs.com: https://www.npmjs.com/package/crawler-user-agents

To use it using npm or yarn:

npm install --save crawler-user-agents
# OR
yarn add crawler-user-agents

In Node.js, you can require the package to get an array of crawler user agents.

const crawlers = require('crawler-user-agents');
console.log(crawlers);

Python

Install with pip install crawler-user-agents

Then:

import crawleruseragents
if crawleruseragents.is_crawler("Googlebot/"):
   # do something

or:

import crawleruseragents
indices = crawleruseragents.matching_crawlers("bingbot/2.0")
print("crawlers' indices:", indices)
print(
    "crawler's URL:",
    crawleruseragents.CRAWLER_USER_AGENTS_DATA[indices[0]]["url"]
)

Note that matching_crawlers is much slower than is_crawler, if the given User-Agent does indeed match any crawlers.

Go

Go: use this package, it provides global variable Crawlers (it is synchronized with crawler-user-agents.json), functions IsCrawler and MatchingCrawlers.

Example of Go program:

package main

import (
	"fmt"

	"github.com/monperrus/crawler-user-agents"
)

func main() {
	userAgent := "Mozilla/5.0 (compatible; Discordbot/2.0; +https://discordapp.com)"

	isCrawler := agents.IsCrawler(userAgent)
	fmt.Println("isCrawler:", isCrawler)

	indices := agents.MatchingCrawlers(userAgent)
	fmt.Println("crawlers' indices:", indices)
	fmt.Println("crawler's URL:", agents.Crawlers[indices[0]].URL)
}

Output:

isCrawler: true
crawlers' indices: [237]
crawler' URL: https://discordapp.com

Contributing

I do welcome additions contributed as pull requests.

The pull requests should:

contain a single addition
specify a discriminant relevant syntactic fragment (for example "totobot" and not "Mozilla/5 totobot v20131212.alpha1")
contain the pattern (generic regular expression), the discovery date (year/month/day) and the official url of the robot
result in a valid JSON file (don't forget the comma between items)

Example:

{
  "pattern": "rogerbot",
  "addition_date": "2014/02/28",
  "url": "http://moz.com/help/pro/what-is-rogerbot-",
  "instances" : ["rogerbot/2.3 example UA"],
  "tags": ["seo"]
}

License

The list is under a MIT License. The versions prior to Nov 7, 2016 were under a CC-SA license.

Related work

There are a few wrapper libraries that use this data to detect bots:

Voight-Kampff (Ruby)
isbot (Ruby)
crawlers (Clojure)
isBot (Node.JS)

Other systems for spotting robots, crawlers, and spiders that you may want to consider are:

Crawler-Detect (PHP)
BrowserDetector (PHP)
browscap (JSON files)

Project details

These details have been verified by PyPI

Project links

Homepage

GitHub Statistics

Maintainers

monperrus

Release history Release notifications | RSS feed

1.45.0

Apr 20, 2026

1.44.0

Apr 19, 2026

This version

1.43.0

Apr 18, 2026

1.42.0

Apr 12, 2026

1.41.0

Apr 8, 2026

1.40.0

Mar 26, 2026

1.39.0

Mar 19, 2026

1.38.0

Mar 11, 2026

1.37.0

Mar 11, 2026

1.36.0

Mar 9, 2026

1.35.0

Feb 28, 2026

1.34.0

Feb 20, 2026

1.33.0

Feb 20, 2026

1.32.0

Feb 17, 2026

1.31.0

Feb 11, 2026

1.30.0

Feb 11, 2026

1.29.0

Feb 11, 2026

1.28.0

Feb 11, 2026

1.27.0

Jan 29, 2026

1.26.0

Jan 22, 2026

1.25.0

Jan 18, 2026

1.24.0

Dec 24, 2025

1.23.0

Dec 24, 2025

1.22.0

Nov 19, 2025

1.21.0

Sep 12, 2025

1.20.0

Sep 10, 2025

1.19.0

Sep 5, 2025

1.18.0

Aug 20, 2025

1.17.0

Aug 9, 2025

1.16.0

Aug 9, 2025

1.15.0

May 1, 2025

1.14.0

Apr 23, 2025

1.13.0

Apr 22, 2025

0.38.0

Apr 22, 2025

0.37.0

Apr 22, 2025

0.36.0

Apr 22, 2025

0.35.0

Apr 22, 2025

0.34.0

Apr 22, 2025

0.33.0

Mar 16, 2025

0.32.0

Mar 15, 2025

0.31.0

Mar 15, 2025

0.30.0

Mar 9, 2025

0.29.0

Feb 12, 2025

0.28.0

Feb 11, 2025

0.27.0

Feb 11, 2025

0.26.0

Feb 11, 2025

0.25.0

Feb 10, 2025

0.24.0

Feb 6, 2025

0.23.0

Jan 7, 2025

0.22.0

Dec 21, 2024

0.21.0

Dec 16, 2024

0.20.0

Dec 12, 2024

0.19.0

Oct 23, 2024

0.18.0

Oct 21, 2024

0.17.0

Oct 19, 2024

0.16.0

Oct 4, 2024

0.15.0

Sep 25, 2024

0.14.0

Sep 8, 2024

0.13.0

Sep 6, 2024

0.12.0

Sep 5, 2024

0.11.0

Sep 4, 2024

0.10.0

Aug 31, 2024

0.9.0

Aug 31, 2024

0.8.0

Aug 14, 2024

0.7.0

May 19, 2024

0.6.0

May 18, 2024

0.5.0

May 18, 2024

0.4.0

May 18, 2024

0.3.0

May 18, 2024

0.2.0

May 18, 2024

0.1

May 18, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

crawler_user_agents-1.43.0-py3-none-any.whl (58.8 kB view details)

Uploaded Apr 18, 2026 Python 3

File details

Details for the file crawler_user_agents-1.43.0-py3-none-any.whl.

File metadata

Download URL: crawler_user_agents-1.43.0-py3-none-any.whl
Upload date: Apr 18, 2026
Size: 58.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for crawler_user_agents-1.43.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ae8a1e6fb0b4041015cfc719ceb2da1e891f76585837559ac66c6d5ec68b1ffa`
MD5	`afa4fb32f16a40e077d6c7499d4b8318`
BLAKE2b-256	`085ad5892519cac8bb6c8761184b241b564f8c67451c1eaac294f671a7f9d890`

See more details on using hashes here.

Provenance

The following attestation bundles were made for crawler_user_agents-1.43.0-py3-none-any.whl:

Publisher: pypi-publish.yml on monperrus/crawler-user-agents

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: crawler_user_agents-1.43.0-py3-none-any.whl
- Subject digest: ae8a1e6fb0b4041015cfc719ceb2da1e891f76585837559ac66c6d5ec68b1ffa
- Sigstore transparency entry: 1338602770
- Sigstore integration time: Apr 18, 2026
Source repository:
- Permalink: monperrus/crawler-user-agents@d832f8d29cd0345a32858254ea06e6a90ea548ae
- Branch / Tag: refs/heads/master
- Owner: https://github.com/monperrus
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pypi-publish.yml@d832f8d29cd0345a32858254ea06e6a90ea548ae
- Trigger Event: workflow_run

crawler-user-agents 1.43.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Project description

crawler-user-agents

Sponsor

Install

Direct download

Javascript

Python

Go

Contributing

License

Related work

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes

Provenance