Skip to main content

Python MediaWiki Bot Framework

Project description

GitHub CI AppVeyor Build Status Code coverage Maintainability Python Top language Pywikibot release wheel Total downloads Monthly downloads Last commit

Pywikibot

The Pywikibot framework is a Python library that interfaces with the MediaWiki API version 1.27 or higher.

Also included are various general function scripts that can be adapted for different tasks.

For further information about the library excluding scripts see the full code documentation.

Quick start

pip install requests
git clone https://gerrit.wikimedia.org/r/pywikibot/core.git
cd core
git submodule update --init
python pwb.py script_name

Or to install using PyPI (excluding scripts)

pip install -U setuptools
pip install pywikibot
pwb <scriptname>

Our installation guide has more details for advanced usage.

Basic Usage

If you wish to write your own script it’s very easy to get started:

import pywikibot
site = pywikibot.Site('en', 'wikipedia')  # The site we want to run our bot on
page = pywikibot.Page(site, 'Wikipedia:Sandbox')
page.text = page.text.replace('foo', 'bar')
page.save('Replacing "foo" with "bar"')  # Saves the page

Wikibase Usage

Wikibase is a flexible knowledge base software that drives Wikidata. A sample pywikibot script for getting data from Wikibase:

import pywikibot
site = pywikibot.Site('wikipedia:en')
repo = site.data_repository()  # the Wikibase repository for given site
page = repo.page_from_repository('Q91')  # create a local page for the given item
item = pywikibot.ItemPage(repo, 'Q91')  # a repository item
data = item.get()  # get all item data from repository for this item

Script example

Pywikibot provides bot classes to develop your own script easily:

import pywikibot
from pywikibot import pagegenerators
from pywikibot.bot import ExistingPageBot

class MyBot(ExistingPageBot):

    update_options = {
        'text': 'This is a test text',
        'summary': 'Bot: a bot test edit with Pywikibot.'
    }

    def treat_page(self):
        """Load the given page, do some changes, and save it."""
        text = self.current_page.text
        text += '\n' + self.opt.text
        self.put_current(text, summary=self.opt.summary)

def main():
    """Parse command line arguments and invoke bot."""
    options = {}
    gen_factory = pagegenerators.GeneratorFactory()
    # Option parsing
    local_args = pywikibot.handle_args(args)  # global options
    local_args = gen_factory.handle_args(local_args)  # generators options
    for arg in local_args:
        opt, sep, value = arg.partition(':')
        if opt in ('-summary', '-text'):
            options[opt[1:]] = value
    MyBot(generator=gen_factory.getCombinedGenerator(), **options).run()

if __name == '__main__':
    main()

For more documentation on Pywikibot see our docs.

Roadmap

Current release

  • Add support for gpewiki (T335989)

  • family.WikibaseFamilyand family.DefaultWikibaseFamilywere added to familymodule

  • Remove incorrect time normalization in page.Claim(T338748, T325860, T57755)

  • Add support for other types of diffs in Site.compare()

  • Improvements for textlib.extract_sectionsfunction (T338748)

  • Backport itertools.batched() from Python 3.12 which replaces tools.itertools.itergroup

  • Upcast page types in pagegenerators.RecentChangesPageGenerator(T340450)

  • Enable FilePage.download()to download thumbnails (T247095)

  • Refactor tools.compute_file_hashand use hashlib.file_digest with Python 3.11

  • Url ends with curly bracket in textlib.compileLinkR(T338029)

  • Allows spaces in environment variables for editor.TextEditor(T102465, T323078)

  • Add textlib.get_regexespublic function (T336144)

  • Return ‘https’ scheme with family.Family.protocol(T326046)

  • Use build instead of setuptools.setup() to build the distribution

  • Raise ConnectionError on requests.ReadTimeout in comms.http.error_handling_callback

  • Raise exceptions.ServerErroron requests.ReadTimeout in comms.http.error_handling_callback

  • Do not evaluate pywikibot.Sitewith dict.pop() as default value (T335720)

  • L10N updates

  • family.Familyclass was rewritten. obsolete.setter was removed, family.Family.interwiki_replacementsreturns an invariant mapping, family.Family.interwiki_removalsreturns a frozenset. closed_wikis, removed_wikis and code_aliases are family.Familyclass attributes. (T334834)

Deprecations

  • 8.2.0: normalize parameter of WbTime.toTimestrand WbTime.toWikibasewill be removed

  • 8.1.0: Dependency of exceptions.NoSiteLinkErrorfrom exceptions.NoPageErrorwill be removed

  • 8.1.0: exceptions.Server414Error is deprecated in favour of exceptions.Client414Error

  • 8.0.0: Timestamp.clone()method is deprecated in favour of Timestamp.replace() method.

  • 8.0.0: family.Family.maximum_GET_lengthmethod is deprecated in favour of config.maximum_GET_length(T325957)

  • 8.0.0: addOnly parameter of textlib.replaceLanguageLinksand textlib.replaceCategoryLinksare deprecated in favour of add_only

  • 8.0.0: textlib.TimeStripperregex attributes ptimeR, ptimeznR, pyearR, pmonthR, pdayR are deprecated in favour of patterns attribute which is a textlib.TimeStripperPatterns.

  • 8.0.0: textlib.TimeStripper``groups`` attribute is deprecated in favour of textlib.TIMEGROUPS

  • 8.0.0: LoginManager.get_login_tokenwas replaced by login.ClientLoginManager.site.tokens['login']

  • 8.0.0: data.api.LoginManager() is deprecated in favour of login.ClientLoginManager

  • 8.0.0: APISite.messages()method is deprecated in favour of userinfo[‘messages’]

  • 8.0.0: Page.editTime()method is deprecated and should be replaced by Page.latest_revision.timestamp

  • 7.7.0: tools.threadingclasses should no longer imported from tools

  • 7.6.0: tools.itertoolsdatatypes should no longer imported from tools

  • 7.6.0: tools.collectionsdatatypes should no longer imported from tools

  • 7.5.0: textlib.tzoneFixedOffset class will be removed in favour of time.TZoneFixedOffset

  • 7.4.0: FilePage.usingPages() was renamed to using_pages()

  • 7.2.0: tb parameter of exception()function was renamed to exc_info

  • 7.2.0: XMLDumpOldPageGenerator is deprecated in favour of a content parameter of XMLDumpPageGenerator(T306134)

  • 7.2.0: RedirectPageBot and NoRedirectPageBot bot classes are deprecated in favour of use_redirectsattribute

  • 7.2.0: tools.formatter.color_formatis deprecated and will be removed

  • 7.1.0: Unused get_redirect parameter of Page.getOldVersion()will be removed

  • 7.0.0: User.isBlocked() method is renamed to is_blocked for consistency

  • 7.0.0: A boolean watch parameter in Page.save() is deprecated and will be desupported

  • 7.0.0: baserevid parameter of editSource(), editQualifier(), removeClaims(), removeSources(), remove_qualifiers() DataSite methods will be removed

  • 7.0.0: Values of APISite.allpages() parameter filterredir other than True, False and None are deprecated

  • 7.0.0: The i18n identifier ‘cosmetic_changes-append’ will be removed in favour of ‘pywikibot-cosmetic-changes’

  • 6.5.0: OutputOption.output() method will be removed in favour of OutputOption.out property

  • 6.5.0: Infinite rotating file handler with logfilecount of -1 is deprecated

  • 6.4.0: ‘allow_duplicates’ parameter of tools.itertools.intersect_generatorsas positional argument is deprecated, use keyword argument instead

  • 6.4.0: ‘iterables’ of tools.itertools.intersect_generatorsgiven as a list or tuple is deprecated, either use consecutive iterables or use ‘*’ to unpack

  • 6.2.0: outputter of OutputProxyOption without out property is deprecated

  • 6.2.0: ContextOption.output_range() and HighlightContextOption.output_range() are deprecated

  • 6.2.0: Error messages with ‘%’ style is deprecated in favour for str.format() style

  • 6.2.0: page.url2unicode() function is deprecated in favour of tools.chars.url2string()

  • 6.2.0: Throttle.multiplydelay attribute is deprecated

  • 6.2.0: SequenceOutputter.format_list() is deprecated in favour of ‘out’ property

  • 6.0.0: config.register_family_file() is deprecated

  • 5.5.0: APISite.redirectRegex() will be removed in favour of APISite.redirect_regex()

Release history

See https://github.com/wikimedia/pywikibot/blob/stable/HISTORY.rst

Contributing

Our code is maintained on Wikimedia’s Gerrit installation, learn how to get started.

Code of Conduct

The development of this software is covered by a Code of Conduct.

Project details


Release history Release notifications | RSS feed

This version

8.2.0

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pywikibot-8.2.0.tar.gz (604.1 kB view details)

Uploaded Source

Built Distribution

pywikibot-8.2.0-py3-none-any.whl (703.9 kB view details)

Uploaded Python 3

File details

Details for the file pywikibot-8.2.0.tar.gz.

File metadata

  • Download URL: pywikibot-8.2.0.tar.gz
  • Upload date:
  • Size: 604.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.0

File hashes

Hashes for pywikibot-8.2.0.tar.gz
Algorithm Hash digest
SHA256 d6d0cd748ee5534ffecf8ebdd831949b795935cf31127f313b1a115e44de87f2
MD5 b66be8e54c69a44b3faf25641fecb28d
BLAKE2b-256 e339e33be38997644bbf64ecf3caece86166835b63c941f106ba1eeecaae9f7e

See more details on using hashes here.

File details

Details for the file pywikibot-8.2.0-py3-none-any.whl.

File metadata

  • Download URL: pywikibot-8.2.0-py3-none-any.whl
  • Upload date:
  • Size: 703.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.0

File hashes

Hashes for pywikibot-8.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 193a1b56e6ffc44f981b9a87ccfe022b7084e8d0f626635990bd9dc712c07033
MD5 1d56f220b69c7f9500767fb4e60226c5
BLAKE2b-256 4ec2d381251be1021138a96f37de35e0e9d9fbd8868ef303270039f6c37ee0e7

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page