Skip to main content

Package to convert xml to Pandas dataframe (flattens each and every xml element to dataframe column)

Project description

XML to Pandas Dataframe

Flattens out nested xml to individual columns in dataframe

Sample input.xml
<bookstore>
<book category="COOKING">
    <title lang="en">Everyday Italian</title>
    <author>Giada De Laurentiis</author>
    <year>2005</year>
    <price>30.00</price>
    <values>
      <value id="300">1</value>
      <value id="100">5</value>
      <value id="200">2</value>
    </values>
  </book>
  <book category="CHILDREN">
    <title lang="en" language="hello">Harry Potter</title>
    <author>J K. Rowling</author>
    <cricketers>
        <cricketer1>Praveen</cricketer1>
        <cricketer2>Pathan</cricketer2>
        <country>india</country>
        <country>india1</country>
      </cricketers>
    <year>2005</year>
    <price>29.99</price>
    <values>
      <value></value>
      <value></value>
    </values>
  </book>
  <book category="WEB">
    <title lang="en">Learning XML</title>
    <author>Erik T. Ray</author>
    <year>2003</year>
    <price>39.95</price>
  </book>
</bookstore>

df = xml_to_df.convert_xml_to_df("input.xml")

df.head()

Output dataframe
book_category book_title_lang book_title book_author book_year book_price book_values_value_id book_values_value book_title_language book_cricketers_cricketer1 book_cricketers_cricketer2 book_cricketers_country
COOKING en Everyday Italian Giada De Laurentiis 2005 30.00 [300, 100, 200] [1, 5, 2] NaN NaN NaN NaN
CHILDREN en Harry Potter J K. Rowling 2005 29.99 NaN NaN hello Praveen Pathan [india, india1]
WEB en Learning XML Erik T. Ray 2003 39.95 NaN NaN NaN NaN NaN NaN

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

xml_to_df-0.0.6.tar.gz (2.6 kB view details)

Uploaded Source

Built Distribution

xml_to_df-0.0.6-py3-none-any.whl (3.5 kB view details)

Uploaded Python 3

File details

Details for the file xml_to_df-0.0.6.tar.gz.

File metadata

  • Download URL: xml_to_df-0.0.6.tar.gz
  • Upload date:
  • Size: 2.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.4

File hashes

Hashes for xml_to_df-0.0.6.tar.gz
Algorithm Hash digest
SHA256 226879c8ab85d10aca1cf1423e426f7279fbdb729c3b1743fba7165d8522be67
MD5 c0ab3d17439b60a21f701fb737243923
BLAKE2b-256 89dcdc20385db5dba15b1b7858e352e0a8221f2327934baa6721260bfe0c4fde

See more details on using hashes here.

File details

Details for the file xml_to_df-0.0.6-py3-none-any.whl.

File metadata

  • Download URL: xml_to_df-0.0.6-py3-none-any.whl
  • Upload date:
  • Size: 3.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.4

File hashes

Hashes for xml_to_df-0.0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 b079bca6ef42d3e5d2606c8f5494648011fdea1419b5dd91bcd0b671b98f329c
MD5 a3151640a50a7abb70854ca36b56c02f
BLAKE2b-256 ffe4dfecaafb54cb8a0c27e73bad8a9912c0318976e5ddad02bf8d0e1bf6e52c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page