Skip to main content

A package for removing tracing parameters from URLs. This package supports automatically updating filtering rules from Adguard

Project description

URL Cleaner

PyPI version

Introduction

A package for removing tracing parameters from URLs. This package supports:

  • Automatically updating filtering rules from Adguard.
  • Custom filtering rules.
  • Host pathname specific filtering.
  • Hundreds of filtering rules for using!

Inspired by ClearUrl and URL Bot, thanks for their efforts!

Rules from:

  1. AdguardFilters
  2. url_bot

Examples

原始: https://baijiahao.baidu.com/s?id=1748839822649920321&wfr=spider&for=pc
清除后: https://baijiahao.baidu.com/s?id=1748839822649920321

原始: https://mp.weixin.qq.com/s?__biz=MjM5OTExMjYwMA==&mid=2670081058&idx=6&sn=1ad7112020c2a4104d67ca542ab14444&chksm=bc12eed58b6567c30c78123a9e8901241512642305dabae4fa1f52357f5ce0ac7a85554#rd
清除后: https://mp.weixin.qq.com/s?__biz=MjM5OTExMjYwMA%3D%3D&mid=2670081058&idx=6&sn=1ad7112020c2a4104d67ca542ab14444#rd

原始: https://www.bilibili.com/video/BV158411b7ki/?spm_id_from=333234107.tianma.1-2-2.click
清除后: https://www.bilibili.com/video/BV158411b7ki/

Usage

Install

pip install url-cleaner

Clean URLs

from url_cleaner import UrlCleaner
c = UrlCleaner()
url = "https://baijiahao.baidu.com/s?id=1748839822649920321&wfr=spider&for=pc"
cleaned = c.clean(url)
print(cleaned)

https://baijiahao.baidu.com/s?id=1748839822649920321

Update rules

from url_cleaner import UrlCleaner
c = UrlCleaner()
c.ruler.update_rules()

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

url_cleaner-0.1.5.tar.gz (22.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

url_cleaner-0.1.5-py3-none-any.whl (24.0 kB view details)

Uploaded Python 3

File details

Details for the file url_cleaner-0.1.5.tar.gz.

File metadata

  • Download URL: url_cleaner-0.1.5.tar.gz
  • Upload date:
  • Size: 22.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.8.8

File hashes

Hashes for url_cleaner-0.1.5.tar.gz
Algorithm Hash digest
SHA256 247b461d911617b831537a0a034f1789e735d0b70071eceb13686222c174639c
MD5 890f0a1d5114b5687d58534b31a6409c
BLAKE2b-256 860c43c8eabf6efbf6c5d91f96620b1f787f63e8d3a336b040fd5fb0e098dbc4

See more details on using hashes here.

File details

Details for the file url_cleaner-0.1.5-py3-none-any.whl.

File metadata

  • Download URL: url_cleaner-0.1.5-py3-none-any.whl
  • Upload date:
  • Size: 24.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.8.8

File hashes

Hashes for url_cleaner-0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 e9f6f2dcf6e4849fd3a9600ff1b24121d84803ddd42601fe7967826099f2604d
MD5 90d47a9460e8f4aa74f01188673df2a7
BLAKE2b-256 00824fefecb8d0a8655af4d8d8aab6e273d52d925a82477cdd26ff1401de8578

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page