Import XML data into wagtail to create pages and content
Project description
Wagtail WordPress Import
A package for Wagtail CMS to import WordPress blog content from an XML file into Wagtail.
Requirements
- Wagtail CMS Installed with initial setup
- WordPress XML export of all content in a single file
- The WordPress website will need to be live and available for importing of assets such as images and documents.
Compatibility
The package has been developed and tested with:
- Wagtail: 2.14 and 2.15
- Django: 3.1 and 3.2
- Postgres and SQLite Databases
Initial app and package setup
- Setup a Wagtail site using your preferred method or follow the official documentation to get started.
- Install this package with pip install -e "git+https://github.com/torchbox/wagtail-wordpress-import.git#egg=wagtail-wordpress-import" or using any method you prefer.
- Place your XML files somewhere on your disk. The file can have any name you choose.
- Create a
log
folder in the root of your site. The import script will need to write report files to this folder, you may need to set the permissions on the folder. - Add
"wagtail_wordpress_import"
to your INSTALLED_APPS config in your settings.py file.
Site URL for importing images and documents
Add a setting of WAGTAIL_WORDPRESS_IMPORTER_SOURCE_DOMAIN
in your sites settings and set it to the the URL of the website that the images and documents will be imported from.
First steps to configure your Wagtail app
The import can be run on an existing or new site but you will need to perform some setup on your page models.
We recommend your page model inherits from the provided WPImportedPageMixin
from wagtail_wordpress_import.models import WPImportedPageMixin
class PostPage(WPImportedPageMixin, Page):
...
You will need to run python manage.py makemigrations
and python manage.py migrate
to add the fields to your page model.
It's intended that these fields are temporary for while importing, and can be removed once the content has been imported. view source
A full example of the suggested page model class
from wagtail.admin.edit_handlers import (
FieldPanel,
ObjectList,
StreamFieldPanel,
TabbedInterface,
)
from wagtail.core.fields import StreamField
from wagtail.core.models import Page
from wagtail_wordpress_import.blocks import WPImportStreamBlocks
from wagtail_wordpress_import.models import WPImportedPageMixin
class PostPage(WPImportedPageMixin, Page):
body = StreamField(WPImportStreamBlocks)
content_panels = Page.content_panels + [
StreamFieldPanel("body"),
]
edit_handler = TabbedInterface(
[
ObjectList(content_panels, heading="Content"),
ObjectList(Page.promote_panels, heading="Promote"),
ObjectList(Page.settings_panels, heading="Settings", classname="settings"),
ObjectList(WPImportedPageMixin.wordpress_panels, heading="Debug"),
]
)
def import_wordpress_data(self, data):
# Wagtail page model fields
self.title = data["title"]
self.slug = data["slug"]
self.first_published_at = data["first_published_at"]
self.last_published_at = data["last_published_at"]
self.latest_revision_created_at = data["latest_revision_created_at"]
self.search_description = data["search_description"]
# debug fields
self.wp_post_id = data["wp_post_id"]
self.wp_post_type = data["wp_post_type"]
self.wp_link = data["wp_link"]
self.wp_raw_content = data["wp_raw_content"]
self.wp_block_json = data["wp_block_json"]
self.wp_processed_content = data["wp_processed_content"]
self.wp_normalized_styles = data["wp_normalized_styles"]
# own model fields
self.body = data["body"]
Running the import command
The most basic command would be:
python manage.py import_xml path/to/xml/file.xml parent_page_id
parent_page_id
is the ID of the page in your Wagtail site where WordPress pages will be imported as children. You can find this in the Wagtail admin URL when editing the page e.g. for http://www.domain.com/admin/pages/3/edit/
the ID is 3.
Running this command will import all WordPress 'post' and 'page' types to the 'PostPage' model in an app called 'pages'
Optional command arguments
-m
can be used to specify the Wagtail Page model to use for all imported pages. The default isPostPage
when it's not specified.-a
can be used to specify the Wagtail App where you have created your Wagtail Page model. The default ispages
when it's not specified.-t
can be used to limit the WordPress page types to be imported. You can pass in a comma-separated string of page types or just a single page type. The default ispage,post
if not specified.-s
can be used to specify the status of pages you want to import. You can pass in a comma-separated string of statuses or just a single status. The default ispublish,draft
if not specified.
Import process flow
While the import process is simple to run with the options above there's a lot that happens during the import.
By changing the import configuration, you can customise the import process without modifying wagtail-wordpress-import
package files. To understand what is possible, and how you can change the behaviour at different stages, you can read an overview of the import process.
Module documentation
- Block Builder
- Categories Import
- Import Hooks
- Prefilters
- WordPress Block Shortcodes
- WordPress Inline Shortcodes
- Yoast Import
Developer Tooling
We have included some developer commands to help you with importing large datasets and analyzing the data.
Further Usage Examples
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file wagtail-wordpress-import-0.1.3.tar.gz
.
File metadata
- Download URL: wagtail-wordpress-import-0.1.3.tar.gz
- Upload date:
- Size: 65.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.7.1 importlib_metadata/4.8.2 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.8.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8eaae007cc415b6c36168d439ce454626340c5874e04461f25a0056fecff97fb |
|
MD5 | 7fe4e66105dc190e69cb4847ea29fd45 |
|
BLAKE2b-256 | 9fe52ede8bd9111eb94fdc556370413d79a0cec8958dd16439c6e0a4c7daea29 |
File details
Details for the file wagtail_wordpress_import-0.1.3-py3-none-any.whl
.
File metadata
- Download URL: wagtail_wordpress_import-0.1.3-py3-none-any.whl
- Upload date:
- Size: 87.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.7.1 importlib_metadata/4.8.2 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.8.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4aa426a201a774978969d5157675215bc68cb1dfa6bf52e3b73eaf83c9ca396f |
|
MD5 | b8447f5374e1995d46f4d0fb216f5bb3 |
|
BLAKE2b-256 | 33bb23fdfbaf041643ba666ed7d00d943b8d47ead7d03e46445ce32eb7869e75 |