Skip to main content

This package extracts xml metadata from FITS output to a csv or xlsx file

Project description

FITSxtractor - The metadata extractor

With FITSxtractor, you can extract metadata from XML files (generated by FITS) and save them in a .cvs or .xlsx file by calling the program from the command line.

The code is made for the XML tree structure that the Harvard tool 'FITS' (File Information Tool Set) outputs. Pay a visit to their Website for more information about their project and have a look at the Documentation regarding the output format.

This Project has been brought to life with the help of the AfZ (Archive of Contemporary History) at ETH Zürich.

Overview

The FITS metadata extractor was written to be implemented as a tool for the simplifaction of digital long term archiving workflows.

A key aspect in archiving is keeping track of the most usefull metadata while not overshooting with information. The extractor uses the vast extensive output, that FITS provides and saves only the following metadata:

  • identification
    • identity
      • externalIdentifier
    • mimetype
    • format
  • fileinfo
    • filepath
    • md5checksum
    • size
  • filestatus
    • well-formed
    • well-formed status

More metadata will be added to the program if needed. By creating a table from the metadata batch, the tool serves as a key component for an interface that keeps track of the data. Furthermore the user is able to work with the metadata by using MS Excel or other programs. This enables more in depth analysis of the recieved data.

Guide

The following shows how to get and use FITSxtractor.

Installation

$ pip install FITSxtractor

Concider that you might have to add the installation folder to your PATH.

If you would rather like to customize the code to your needs, grab a stable version under "Releases". All the files are extensively commented as well, in order to make the files more user firendly.

Usage

When in a terminal specify:

$ FITSxtractor path_to_dir --output path_to_outputfile

The programm takes a directory, which is populated with at least one (!) FITS XML output file and takes an output file location as a positional argument.

Accepted output formats are files ending with a .csv or .xlsx extension. If none are given, the program defaults to .xlsx.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

FITSxtractor-0.9.2.tar.gz (6.9 kB view details)

Uploaded Source

Built Distribution

FITSxtractor-0.9.2-py3-none-any.whl (8.0 kB view details)

Uploaded Python 3

File details

Details for the file FITSxtractor-0.9.2.tar.gz.

File metadata

  • Download URL: FITSxtractor-0.9.2.tar.gz
  • Upload date:
  • Size: 6.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.9.6

File hashes

Hashes for FITSxtractor-0.9.2.tar.gz
Algorithm Hash digest
SHA256 bd1ff2ec963736d387687d50745077f9728c3be87481869835eb037b21da08b7
MD5 e97e7163a107da2ea5512cf08f22d923
BLAKE2b-256 4f189489a89adcc93bdb501be6844303e76685863836a48531f49957f5142c92

See more details on using hashes here.

File details

Details for the file FITSxtractor-0.9.2-py3-none-any.whl.

File metadata

  • Download URL: FITSxtractor-0.9.2-py3-none-any.whl
  • Upload date:
  • Size: 8.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.9.6

File hashes

Hashes for FITSxtractor-0.9.2-py3-none-any.whl
Algorithm Hash digest
SHA256 86529866255633dbe6347a24cfd930aacdfb3210fd4e827f27bc7d7aaeefcc5e
MD5 6b2fc05013a032c2986373d01e679e86
BLAKE2b-256 e0cb8cddc222b3df5db5e5d84cc42fa6aab02d4a3bf79fb644f7998d8e773549

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page