Python MediaWiki Bot Framework
Project description
Pywikibot
The Pywikibot framework is a Python library that interfaces with the MediaWiki API version 1.27 or higher.
Also included are various general function scripts that can be adapted for different tasks.
For further information about the library excluding scripts see the full code documentation.
Quick start
git clone https://gerrit.wikimedia.org/r/pywikibot/core.git
cd core
git submodule update --init
pip install -r requirements.txt
python pwb.py <script_name>
Or to install using PyPI (excluding scripts)
pip install pywikibot
pwb <scriptname>
Our installation guide has more details for advanced usage.
Basic Usage
If you wish to write your own script it’s very easy to get started:
import pywikibot
site = pywikibot.Site('en', 'wikipedia') # The site we want to run our bot on
page = pywikibot.Page(site, 'Wikipedia:Sandbox')
page.text = page.text.replace('foo', 'bar')
page.save('Replacing "foo" with "bar"') # Saves the page
Wikibase Usage
Wikibase is a flexible knowledge base software that drives Wikidata. A sample pywikibot script for getting data from Wikibase:
import pywikibot
site = pywikibot.Site('wikipedia:en')
repo = site.data_repository() # the Wikibase repository for given site
page = repo.page_from_repository('Q91') # create a local page for the given item
item = pywikibot.ItemPage(repo, 'Q91') # a repository item
data = item.get() # get all item data from repository for this item
Script example
Pywikibot provides bot classes to develop your own script easily:
import pywikibot
from pywikibot import pagegenerators
from pywikibot.bot import ExistingPageBot
class MyBot(ExistingPageBot):
update_options = {
'text': 'This is a test text',
'summary': 'Bot: a bot test edit with Pywikibot.'
}
def treat_page(self):
"""Load the given page, do some changes, and save it."""
text = self.current_page.text
text += '\n' + self.opt.text
self.put_current(text, summary=self.opt.summary)
def main():
"""Parse command line arguments and invoke bot."""
options = {}
gen_factory = pagegenerators.GeneratorFactory()
# Option parsing
local_args = pywikibot.handle_args(args) # global options
local_args = gen_factory.handle_args(local_args) # generators options
for arg in local_args:
opt, sep, value = arg.partition(':')
if opt in ('-summary', '-text'):
options[opt[1:]] = value
MyBot(generator=gen_factory.getCombinedGenerator(), **options).run()
if __name == '__main__':
main()
For more documentation on Pywikibot see our docs.
Roadmap
Current Release Changes
Add support for btmwiki to Pywikibot (T368069)
Include image repository extensions in site.APISite.file_extensions
Ignore ValueErrordurig upcast of FilePagedue to invalid file extension (T367777)
Add pagegenerators.SupersetPageGeneratorpagegenerator (T367684)
No longer wait in data.api.Request._http_requestfor ImportError and NameError
Replace requests.utils.urlparse with urllib.parse.urlparse in comms.http.get_authentication(T367649)
Show an appropiate message if requests_oauthlib package is required but missing (T353387)
Retry DBUnexpectedError in data.api.Request._internal_api_error(T367383)
Duplicated entries found in pywikibotwere removed
Pass None instead of an empty string as expiry argument in site.APISite.protect()(T367176)
Fix keyword argument in Page.undelete()when calling site.APISite.undelete()(T367037)
Check whether BaseBot.generatormethod
Add namespaces parameter to Page.templates()and Page.itertemplates()and require keyword arguments; only use TEMPLATE namespace for meth:Page.isDisambig()<page.BasePage.isDisambig> (T365199)
Drop pheetools support for proofreadpagewhich is no longer available upstreams (T366036)
Raise exceptions.SectionErrorif a section does not exists on a page (T107141)
Retry api request on ServerError (T364275, T364393)
i18n updates
Current Deprecations
9.2.0: Imports of loggingfunctions from botmodule is deprecated and will be desupported
9.2.0: total argument in -logevents pagegenerators option is deprecated; use -limit instead (T128981)
9.0.0: The content parameter of proofreadpage.IndexPage.page_genis deprecated and will be ignored (T358635)
9.0.0: userinterfaces.transliteration.transliterator was renamed to Transliterator
9.0.0: next parameter of userinterfaces.transliteration.transliterator.transliteratewas renamed to succ
9.0.0: type parameter of site.APISite.protectedpages() was renamed to protect_type
9.0.0: all parameter of site.APISite.namespace()was renamed to all_ns
9.0.0: filter parameter of date.dhwas renamed to filter_func
9.0.0: dict parameter of data.api.OptionSetwas renamed to data
9.0.0: pywikibot.version.get_toolforge_hostname() is deprecated without replacement
9.0.0: allrevisions parameter of xmlreader.XmpDumpis deprecated, use revisions instead (T340804)
9.0.0: iteritems method of data.api.Requestwill be removed in favour of items
9.0.0: SequenceOutputter.output() is deprecated in favour of tools.formatter.SequenceOutputter.out property
9.0.0: nullcontext context manager and SimpleQueue queue of backportsare derecated
8.4.0: modules_only_mode parameter of data.api.ParamInfo, its paraminfo_keys class attribute and its preloaded_modules property will be removed
8.4.0: dropdelay and releasepid attributes of throttle.Throttlewill be removed in favour of expiry class attribute
8.2.0: tools.itertools.itergroupwill be removed in favour of backports.batched
8.2.0: normalize parameter of WbTime.toTimestrand WbTime.toWikibasewill be removed
8.1.0: Dependency of exceptions.NoSiteLinkErrorfrom exceptions.NoPageErrorwill be removed
8.1.0: exceptions.Server414Error is deprecated in favour of exceptions.Client414Error
8.0.0: Timestamp.clone()method is deprecated in favour of Timestamp.replace() method.
8.0.0: family.Family.maximum_GET_lengthmethod is deprecated in favour of config.maximum_GET_length(T325957)
8.0.0: addOnly parameter of textlib.replaceLanguageLinksand textlib.replaceCategoryLinksare deprecated in favour of add_only
8.0.0: textlib.TimeStripperregex attributes ptimeR, ptimeznR, pyearR, pmonthR, pdayR are deprecated in favour of patterns attribute which is a textlib.TimeStripperPatterns.
8.0.0: textlib.TimeStripper``groups`` attribute is deprecated in favour of textlib.TIMEGROUPS
8.0.0: LoginManager.get_login_tokenwas replaced by login.ClientLoginManager.site.tokens['login']
8.0.0: data.api.LoginManager() is deprecated in favour of login.ClientLoginManager
8.0.0: APISite.messages()method is deprecated in favour of userinfo[‘messages’]
8.0.0: Page.editTime()method is deprecated and should be replaced by Page.latest_revision.timestamp
Pending removal in Pywikibot 10
9.1.0: version.svn_rev_infoand version.getversion_svnwill be removed. SVN is no longer supported. (T362484)
7.7.0: tools.threadingclasses should no longer imported from tools
7.6.0: tools.itertoolsdatatypes should no longer imported from tools
7.6.0: tools.collectionsdatatypes should no longer imported from tools
7.5.0: textlib.tzoneFixedOffset class will be removed in favour of time.TZoneFixedOffset
7.4.0: FilePage.usingPages() was renamed to using_pages()
7.3.0: Old color escape sequences like \03{color} is deprecated in favour of new color format like <<color>>
7.3.0: linktrail method of family.Familyis deprecated; use APISite.linktrail() instead
7.2.0: Positional arguments decoder, layer and newline for loggingfunctions where dropped; keyword arguments must be used instead.
7.2.0: tb parameter of exception()function was renamed to exc_info
7.2.0: XMLDumpOldPageGenerator is deprecated in favour of a content parameter of XMLDumpPageGenerator(T306134)
7.2.0: RedirectPageBot and NoRedirectPageBot bot classes are deprecated in favour of use_redirectsattribute
7.2.0: tools.formatter.color_formatis deprecated and will be removed
7.1.0: Unused get_redirect parameter of Page.getOldVersion()will be removed
7.0.0: User.isBlocked() method is renamed to is_blocked for consistency
7.0.0: A boolean watch parameter in Page.save() is deprecated and will be desupported
7.0.0: baserevid parameter of editSource(), editQualifier(), removeClaims(), removeSources(), remove_qualifiers() DataSite methods will be removed
7.0.0: Values of APISite.allpages() parameter filterredir other than True, False and None are deprecated
7.0.0: The i18n identifier ‘cosmetic_changes-append’ will be removed in favour of ‘pywikibot-cosmetic-changes’
Release history
See https://github.com/wikimedia/pywikibot/blob/stable/HISTORY.rst
Contributing
Our code is maintained on Wikimedia’s Gerrit installation, learn how to get started.
Code of Conduct
The development of this software is covered by a Code of Conduct.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file pywikibot-9.2.0.tar.gz
.
File metadata
- Download URL: pywikibot-9.2.0.tar.gz
- Upload date:
- Size: 618.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.0 CPython/3.11.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3df116778b108207a9e03fdfbdea71eb2f41144fcc9247d8797a4b7a8417aa18 |
|
MD5 | b7b07d45b682d33cf0e0520fee792711 |
|
BLAKE2b-256 | c9732ea529c9d60ad1b5c82a90a553afc31cafb6cbec63d4caff13b73742006a |
File details
Details for the file pywikibot-9.2.0-py3-none-any.whl
.
File metadata
- Download URL: pywikibot-9.2.0-py3-none-any.whl
- Upload date:
- Size: 721.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.0 CPython/3.11.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | fd659c3111eebfb3704cddc4e2dcb7afac15f418a8c4cdd4ef680d7658237e60 |
|
MD5 | 4067206786dc3405822067a5c77fe009 |
|
BLAKE2b-256 | c9fe00f9ebd7348f5fc6f5e86cf3a3cd6ccc5cfb92aeea8c6ea9bfeee7469344 |