Skip to main content

Clean up the HTML formatting problems introduced by pasting content from MSWord into Plone's RichText fields.

Project description

Introduction

This product cleans up the HTML formatting problems that are introduced by pasting content from MSWord into Plone’s RichText fields.

Every time an object is created or edited, the HTML in its RichText fields will be sanitized.

The HTML sanitizing feature is turned on by default for all Archetype objects, but can be turned off on a per object basis by checking a box in the ‘settings’ fieldset of the default edit view.

Implementation:

This product provides an event subscriber for all BaseContent Archetypes objects that will clean up the HTML of all the RichText fields for each object.

The cleaning and sanitizing of the HTML code is mainly done by using the lxml library: http://codespeak.net/lxml/lxmlhtml.html by means of the htmllaundry package, written by Wichert Akkerman.

Installation:

This Product does not have to be installed via quick_installer or the plone control panel.

Just add it to your buildout or install via easy_install.

Changelog

1.2.8 (2010-12-21)

  • Use Wichert Akkerman’s htmllaundry utilities for code sanitizing. (jcbrand)

1.2.7 (2010-02-17)

  • Only create a new version if at_edit would not create one anyway (thomasw)

1.2.6 (2009-12-13)

  • Don’t force target=”_blank” on links. (thomasw)

1.2.5 (2009-12-07)

  • Set add_nofollow to False, since it seems to confuse lxmk.html parser (thomasw).

1.2.4 (2009-11-25)

  • Added LinguaPlone’s generateMethods magic, so that the languageIndependent field gets propagated to all translations when the canonical is edited (thomasw)

1.2.3 (2009-11-19)

  • don’t strip ‘h1’ and ‘h2’ (jcbrand)

1.2.2 (2009-11-16)

  • added a more flexible detection mechanism - encoding (pilz)

1.2.1 (2009-11-03)

  • I was too stupid to make a successful release, here we go again (thomasw)

1.2 (2009-11-03)

  • Bugfix in event-handler: don’t fail if cleanWordPastedText field isn’t present (thomasw)

1.1 - 2009-10-27

  • Add a new sanitize method and new helper methods. Thanks to Wichert.

  • Enable the cleaner by default. (jcbrand)

1.0 - 2009-10-23

  • Initial release (jcbrand)

Contributors:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

slc.cleanwordpastedtext-1.2.8.tar.gz (17.8 kB view details)

Uploaded Source

File details

Details for the file slc.cleanwordpastedtext-1.2.8.tar.gz.

File metadata

File hashes

Hashes for slc.cleanwordpastedtext-1.2.8.tar.gz
Algorithm Hash digest
SHA256 cc1f92b424c1e6e20e1d7591f9731c94e0c7270bf0bdc04e1c5e3ae6b3f33336
MD5 5ab89f735e822552648f68d1d828da8e
BLAKE2b-256 5fff260b125a14abe32da1cd3938a8954fba072a19169df38ab0d3946f318312

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page