Skip to main content
Help improve PyPI by participating in a 5-minute user interface survey!

Flattens xml/html documents

Project Description

Flatten documents

HTML

xflat turns this html

 <html>
    <head>
        <title> Example title </title>
        <meta charset='utf-8'>
    </head>
<body>
    <h1 class="main">Example heading </h1>
    <p id="example-body"  class="class-1 class-2"> Example body, with a class and id
        <br>
        line break and some text
    </p>
    <div>
        <ol>
            <li> List item 1 </li>
            <li> List item 2 </li>
        </ol>
    </div>
    </body>
<!--[if lt IE 7]><style type="text/css">body{behavior:url("/w/static/1.26wmf9/skins/Vector/csshover.min.htc")}</style><![endif]-->
</html>

into

html
html/head
html/head/title Example title
html/head/meta          charset="utf-8"
html/body
html/body/h1    Example heading     class="main"
html/body/p Example body, with a class and id       class="class-1 class-2" id="example-body"
html/body/p/br      line break and some text
html/body/div
html/body/div/ol
html/body/div/ol/li List item 1
html/body/div/ol/li List item 2
html/!comment   [if lt IE 7]><style type="text/css">body{behavior:url("/w/static/1.26wmf9/skins/Vector/csshover.min.htc")}</style><![endif]

Format

All tags on new lines, TAB seperated values

  • Path and name - /html/body/h1
  • Text content - Example heading
  • Footer content (text after the closing of this tag, same level as parent)
  • Attrbutes e.g. id class etc

Release history Release notifications

This version
History Node

0.0.3

History Node

0.0.2

History Node

0.0.1

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
xflat-0.0.3.tar.gz (3.8 kB) Copy SHA256 hash SHA256 Source None Sep 28, 2015

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging CloudAMQP CloudAMQP RabbitMQ AWS AWS Cloud computing Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page