Skip to main content

Singer.io target for extracting data

Project description

target-mako

This is a Singer target that reads JSON-formatted data following the Singer spec.

This target uses Mako templating engine to generate output files containing the stream data.

This target

  • loads the configured templates using Mako templating language
  • write the configured header (if any) into each output files (Header uses the first record values)
  • receive the streamed records
  • for each record, write one block rendered from the template (using the record data) in each output file
  • write the configured footer (if any) into each output files (Footer uses the last record values)

Build status codecov Python 3.7 Python 3.8 Python 3.9

Sources on Github

Package on PyPI

Accessing data in templates:

  • The record data can be used in the template with (Null/None values will be safely displayed as empty string)

      ${record.attribute.sub-attribute}
    
  • The schema data can be used in the template with (Null/None values will be safely displayed as empty string)

      ${schema.attribute.sub-attribute}
    
  • Some Internal data are provided by the target module:

    • record index is counting records starting from 0

        ${record.record_index}
      
    • Record number is counting records starting from 1

        ${record.record_number}
      

Formatting functions

Some formatting function exists to help create your templates

  • Protect against Null (Python None) values

First parameter is the data to format (example: record.attribute.sub-attribute)

    ${functions['null_safe'](record.attribute.sub-attribute)}
  • Specify a fixed size for strings

First parameter is the data to format (example: record.attribute.sub-attribute)

Second parameter is the required size (integer)

    ${functions['fixed_size'](record.attribute.sub-attribute, 27)}
  • Specify a fixed size for strings (align left)

First parameter is the data to format (example: record.attribute.sub-attribute)

Second parameter is the required size (integer)

    ${functions['lfixed'](record.attribute.sub-attribute, 27)}
  • Specify a fixed size for strings (align rigth)

First parameter is the data to format (example: record.attribute.sub-attribute)

Second parameter is the required size (integer)

    ${functions['rfixed'](record.attribute.sub-attribute, 27)}
  • Specify a fixed size for numbers (by adding 0 before the passed value)

First parameter is the data to format (example: record.attribute.sub-attribute)

Second parameter is the required size (integer)

    ${functions['nfixed '](record.attribute.sub-attribute, 27)}
  • JSON Date formatting

First parameter is the data to format (must be a JSON date YYYY-mm-dd)

Second parameter is the output format in Python date-time format

    ${functions['format_json_date'](record.attribute.sub-attribute,'%d%m%Y')}
  • Date formatting

First parameter is the data to format

Second parameter is the input format in Python date-time format

Third parameter is the output format in Python date-time format

    ${functions['format_date'](record.attribute.sub-attribute, '%d/%m/%Y','%d%m%Y')}
  • Lower letters

First parameter is the data to format

    ${functions['lower'](record.attribute.sub-attribute)}
  • Upper letters

First parameter is the data to format

    ${functions['upper'](record.attribute.sub-attribute)}

Config file description:

Here is a sample config file:

   {
     "disable_collection": true,
     "cache_template_dir": "/temp/mako_modules",
     "template_dir": "../default-templates",
     "output_dir": "../default-output",
     "template_list": [
     ],
     "stream_configs": {
       "sample": {
         "template_dir": "../templates",
         "output_dir": "../output",
         "template_list": [
           {
             "header_template_name": "csv/sample_header.template.csv",
             "data_template_name": "csv/sample.template.csv",
             "footer_template_name": "",
             "output_file_name": "sample.csv",
             "output_file_encoding": "utf8",
             "output_file_EOL": "\r\n"
           },
           {
             "header_template_name": "json/sample_header.template.json",
             "data_template_name": "json/sample.template.json",
             "footer_template_name": "json/sample_footer.template.json",
             "output_file_name": "sample{record_index}.json",
             "output_file_encoding": "utf8",
             "output_file_EOL": "\r\n",
             "one_file_per_record": true
           },
       }
     }
   }

First part is "global configuration":

  • "disable_collection" optional, to disable sending usage statistic to Singer platform.
  • "cache_template_dir" path to directory that will be used to cache templates content for Mako engine.

Second part is default configuration for all streams:

  • "template_dir" : path to the directory that contains the template files.
  • "output_dir" : path to the directory that where the output files will be created.
  • "template_list" : list of template for file generation. Each element of the list contains:
    • "header_template_name" : file name of the template file for header (can be empty).It is used once per stream at the beginning.
    • "data_template_name" : file name of the template file for record. It is used once per record in the stream.
    • "footer_template_name" : file name of the template file for footer (can be empty). It is used once at the end of the stream.
    • "output_file_name" : the name of the generated file (can contain variables in case of multiple file generation)
    • "output_file_encoding" : Optionnal the generated file encoding. Default value is "utf8". For possible value please refer to https://docs.python.org/3/library/io.html?highlight=newline#io.TextIOWrapper
    • "output_file_EOL" : Optionnal the generated file End Of Line. Default Value is "\r\n". For possible values please refer to https://docs.python.org/3/library/io.html?highlight=newline#io.TextIOWrapper
    • "one_file_per_record" : boolean, if true, the target will generate one file per record in the stream, else it will generate one file containing all records (repeating the "data_template_name").

Third part is stream specific configuration:

expected structure is:

   "stream_configs": {
       <stream-id1> : {},
       <stream-id2> : {}
   }

All values from second part (Default values) can be overridden for each stream.


Copyright © 2020 elebail

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

target-mako-1.2.0.tar.gz (24.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

target_mako-1.2.0-py3-none-any.whl (23.9 kB view details)

Uploaded Python 3

File details

Details for the file target-mako-1.2.0.tar.gz.

File metadata

  • Download URL: target-mako-1.2.0.tar.gz
  • Upload date:
  • Size: 24.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.10.0 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.7.1

File hashes

Hashes for target-mako-1.2.0.tar.gz
Algorithm Hash digest
SHA256 dcab7dbfb63a35d3bde8a43d75c98dfd948dc76b3ca877f3cd43e0f7da677f17
MD5 c4d2b02ba06eaa46b491acc272ec72fd
BLAKE2b-256 8752664d79851221509fb828387053b2e4adc235df727a11ddfb4d2e4ca6d6b3

See more details on using hashes here.

File details

Details for the file target_mako-1.2.0-py3-none-any.whl.

File metadata

  • Download URL: target_mako-1.2.0-py3-none-any.whl
  • Upload date:
  • Size: 23.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.10.0 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.7.1

File hashes

Hashes for target_mako-1.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 2dec4b549acd334f8c4a86a311be06787791e4492a0c8cbff2d9f969486a2731
MD5 14ce735138bd68a84443c0a96a671760
BLAKE2b-256 ecae3f5029def5f0958cd98c477d774af36643c36323281472497b5ecdfc0c3d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page