Skip to main content

Convert sas7bdat and xport files into other formats

Project description

sas7bdat-converter: Convert sas7bdat files into other formats

Tests Status pre-commit.ci status Coverage PyPI version PyPI - Python Version

Converts proprietary sas7bdat and/or xport files from SAS into formats such as csv, json, and Excel useable by other programs. Currently supported conversiaions are csv, Excel (xlsx format), json, Pandas DataFrame, Parquet and XML.

Conversions can be done on either a single file, an entire directory, or a batch of specified files.

Install

pip install sas7bdat-converter

If you would like to be able to convert to Excel files you will need to install with the extra Excel dependency.

pip install sas7bdat-converter[excel]

If you would like to be able to convert to Parquet files you will need to install with the extra parquet dependency.

pip install sas7bdat-converter[parquet]

If you would like to use Conda, it includes both the extras required to convert to Excel & Parquet files.

conda install -c conda-forge sas7bdat-converter

Usage

In all cases either sas7bdat or xport files can be converted. Examples below all use the .sas7bdat extension, xport files with a .xpt extension will also work.

  • batch_to_csv(file_dicts) - Convert multiple sas7bdat files into csv files at once.

    • file_dicts = A list containing a dictionary for each file to convert. The dictionary is required to contain 'sas7bdat_file' containing the path and name for the sas7bdat file, and 'export_file' containing the path and name for the csv files. The csv file extension should be .csv. File paths can be sent as either strings or Path objects.
    • continue_on_error = If set to true processing of files in a batch will continue if there is a file conversion error instead of raising an exception. Default = False

    Example

    import sas7bdat_converter
    
    file_dicts = [
      {
        'sas7bdat_file': '/path/to/sas7bdat/files/example_1.sas7bdat',
        'export_file': '/path/to/new/files/example_1.csv',
      },
      {
        'sas7bdat_file': '/path/to/sas7bdat/files/example_2.sas7bdat',
        'export_file': '/path/to/new/files/example_2.csv',
      },
    ]
    sas7bdat_converter.batch_to_csv(file_dicts)
    

    Note: Example uses Mac/Linux type file paths. For Windows use paths like c:\path\to\sas7bdat\files\example_1.sas7bdat.

  • batch_to_excel(file_dicts) - Convert multiple sas7bdat files into Excel files at once.

    • file_dicts = A list containing a dictionary for each file to convert. The dictionary is required to contain 'sas7bdat_file' containing the path and name for the sas7bdat file, and 'export_file' containing the path and name for the excel files. The Excel file extension should be .xlsx. File paths can be sent as either strings or Path objects.
    • continue_on_error = If set to true processing of files in a batch will continue if there is a file conversion error instead of raising an exception. Default = False

    Example

    import sas7bdat_converter
    
    file_dicts = [
      {
        'sas7bdat_file': '/path/to/sas7bdat/files/example_1.sas7bdat',
        'export_file': '/path/to/new/files/example_1.xlsx',
      },
      {
        'sas7bdat_file': '/path/to/sas7bdat/files/example_2.sas7bdat',
        'export_file': '/path/to/new/files/example_2.xlsx',
      },
    ]
    sas7bdat_converter.batch_to_excel(file_dicts)
    

    Note: Example uses Mac/Linux type file paths. For Windows use paths like c:\path\to\sas7bdat\files\example_1.sas7bdat.

  • batch_to_parquet(file_dicts) - Convert multiple sas7bdat files into Parquet files at once.

    • file_dicts = A list containing a dictionary for each file to convert. The dictionary is required to contain 'sas7bdat_file' containing the path and name for the sas7bdat file, and 'export_file' containing the path and name for the perquet files. The Perquet file extension should be .perquet. File paths can be sent as either strings or Path objects.
    • continue_on_error = If set to true processing of files in a batch will continue if there is a file conversion error instead of raising an exception. Default = False

    Example

    import sas7bdat_converter
    
    file_dicts = [
      {
        'sas7bdat_file': '/path/to/sas7bdat/files/example_1.sas7bdat',
        'export_file': '/path/to/new/files/example_1.parquet',
      },
      {
        'sas7bdat_file': '/path/to/sas7bdat/files/example_2.sas7bdat',
        'export_file': '/path/to/new/files/example_2.parquet',
      },
    ]
    sas7bdat_converter.batch_to_parquet(file_dicts)
    

    Note: Example uses Mac/Linux type file paths. For Windows use paths like c:\path\to\sas7bdat\files\example_1.sas7bdat.

  • batch_to_json(file_dicts) - Convert multiple sas7bdat files into json files at once.

    • file_dicts = A list containing a dictionary for each file to convert. The dictionary is required to contain 'sas7bdat_file' containing the path and name for the sas7bdat file, and 'export_file' containing the path and name for the json files. The json file extension should be .json. File paths can be sent as either strings or Path objects.
    • continue_on_error = If set to true processing of files in a batch will continue if there is a file conversion error instead of raising an exception. Default = False

    Example

    import sas7bdat_converter
    
    file_dicts = [
      {
        'sas7bdat_file': '/path/to/sas7bdat/files/example_1.sas7bdat',
        'export_file': '/path/to/new/files/example_1.json',
      },
      {
        'sas7bdat_file': '/path/to/sas7bdat/files/example_2.sas7bdat',
        'export_file': '/path/to/new/files/example_2.json',
      },
    ]
    sas7bdat_converter.batch_to_json(file_dicts)
    

    Note: Example uses Mac/Linux type file paths. For Windows use paths like c:\path\to\sas7bdat\files\example_1.sas7bdat.

  • batch_to_xml(file_dicts) - Convert multiple sas7bdat files into XML files at once.

    • file_dicts = A list containing a dictionary for each file to convert. The dictionary is required to contain 'sas7bdat_file' containing the path and name for the sas7bdat file, and 'export_file' containing the path and name for the xml files. The XML file extension should be .xml. File paths can be sent as either strings or Path objects.
    • continue_on_error = If set to true processing of files in a batch will continue if there is a file conversion error instead of raising an exception. Default = False

    Example

    import sas7bdat_converter
    
    file_dicts = [
      {
        'sas7bdat_file': '/path/to/sas7bdat/files/example_1.sas7bdat',
        'export_file': '/path/to/new/files/example_1.xml',
      },
      {
        'sas7bdat_file': '/path/to/sas7bdat/files/example_2.sas7bdat',
        'export_file': '/path/to/new/files/example_2.xml',
      },
    ]
    sas7bdat_converter.batch_to_xml(file_dicts)
    

    Note: Example uses Mac/Linux type file paths. For Windows use paths like c:\path\to\sas7bdat\files\example_1.sas7bdat.

  • dir_to_csv(dir_path, export_path=None) - Convert all sas7bdat files in a directory into csv files at once. File paths can be sent as either strings or Path objects.

    • dir_path = The dictionary that contains the sas7bdat file to convert.
    • export_path = Optional path for the converted files. If no path is supplied the new files will be put into the dir_path directory with the sas7bdat files. File paths can be sent as either strings or Path objects. Default = None
    • continue_on_error = If set to true processing of files in a batch will continue if there is a file conversion error instead of raising an exception. Default = False

    Example

    import sas7bdat_converter
    
    # Option 1: put the converted files in the same directory as the sas7bdat files
    sas7bdat_converter.dir_to_csv('/path/to/sas7bdat/files')
    
    # Option 2: put the converted fiels in a diffferent directory
    sas7bdat_converter.dir_to_csv('/path/to/sas7bdat/files', 'path/for/new/files')
    

    Note: Example uses Mac/Linux type file paths. For Windows use paths like c:\path\to\sas7bdat\files.

  • dir_to_excel(dir_path, export_path=None) - Convert all sas7bdat files in a directory into Excel files at once. File paths can be sent as either strings or Path objects.

    • dir_path = The dictionary that contains the sas7bdat file to convert.
    • export_path = Optional path for the converted files. If no path is supplied the new files will be put into the dir_path directory with the sas7bdat files Default = None
    • continue_on_error = If set to true processing of files in a batch will continue if there is a file conversion error instead of raising an exception. Default = False

    Example

    import sas7bdat_converter
    
    # Option 1: put the converted files in the same directory as the sas7bdat files
    sas7bdat_converter.dir_to_excel('/path/to/sas7bdat/files')
    
    # Option 2: put the converted fiels in a diffferent directory
    sas7bdat_converter.dir_to_excel('/path/to/sas7bdat/files', 'path/for/new/files')
    

    Note: Example uses Mac/Linux type file paths. For Windows use paths like c:\path\to\sas7bdat\files.

  • dir_to_json(dir_path, export_path=None) - Convert all sas7bdat files in a directory into json files at once. File paths can be sent as either strings or Path objects.

    • dir_path = The dictionary that contains the sas7bdat file to convert.
    • export_path = Optional path for the converted files. If no path is supplied the new files will be put into the dir_path directory with the sas7bdat files. Default = None
    • continue_on_error = If set to true processing of files in a batch will continue if there is a file conversion error instead of raising an exception. Default = False

    Example

    import sas7bdat_converter
    
    # Option 1: put the converted files in the same directory as the sas7bdat files
    sas7bdat_converter.dir_to_json('/path/to/sas7bdat/files')
    
    # Option 2: put the converted fiels in a diffferent directory
    sas7bdat_converter.dir_to_json('/path/to/sas7bdat/files', 'path/for/new/files')
    

    Note: Example uses Mac/Linux type file paths. For Windows use paths like c:\path\to\sas7bdat\files.

  • dir_to_xml(dir_path, export_path=None) - Convert all sas7bdat files in a directory into XML files at once. File paths can be sent as either strings or Path objects.

    • dir_path = The dictionary that contains the sas7bdat file to convert.
    • export_path = Optional path for the converted files. If no path is supplied the new files will be put into the dir_path directory with the sas7bdat files. Default = None
    • continue_on_error = If set to true processing of files in a batch will continue if there is a file conversion error instead of raising an exception. Default = False

    Example

    import sas7bdat_converter
    
    # Option 1: put the converted files in the same directory as the sas7bdat files
    sas7bdat_converter.dir_to_xml('/path/to/sas7bdat/files')
    
    # Option 2: put the converted fiels in a diffferent directory
    sas7bdat_converter.dir_to_xml('/path/to/sas7bdat/files', 'path/for/new/files')
    

    Note: Example uses Mac/Linux type file paths. For Windows use paths like c:\path\to\sas7bdat\files.

  • to_csv(sas7bdat_file, export_file) - convert a sas7bdat file into a csv file. File path can be sent as either a string or Path objects.

    • sas7bdat_file = the path and name for sas7bdat file to convert.
    • export_file = the path and name for the csv file. The csv file extension should be .csv.

    Example

    import sas7bdat_converter
    
    sas7bdat_converter.to_csv('/path/to/sas7bdat/file/example.sas7bdat', 'path/to/new/file/example.csv')
    

    Note: Example uses Mac/Linux type file paths. For Windows use paths like c:\path\to\sas7bdat\files\example.sas7bdat.

  • to_dataframe(sas7bdat_file) - Convert a sas7bdat file into a Pandas DataFrame. File path can be sent as either a string or Path objects.

    • sas7bdat_file = The path and name for sas7bdat file to convert.

    Example

    import sas7bdat_converter
    
    sas7bdat_converter.to_dataframe('/path/to/sas7bdat/file/example.sas7bdat')
    

    Note: Example uses Mac/Linux type file paths. For Windows use paths like c:\path\to\sas7bdat\files\example_1.sas7bdat.

  • to_excel(sas7bdat_file, export_file) - convert a sas7bdat file into a Excel file. File path can be sent as either a string or Path objects.

    • sas7bdat_file = the path and name for sas7bdat file to convert.
    • export_file = the path and name for the Excel file. The Excel file extension should be .xlsx.

    Example

    import sas7bdat_converter
    
    sas7bdat_converter.to_excel('/path/to/sas7bdat/file/example.sas7bdat',
    'path/to/new/file/example.xlsx')
    

    Note: Example uses Mac/Linux type file paths. For Windows use paths like c:\path\to\sas7bdat\files\example.sas7bdat.

  • to_parquet(sas7bdat_file, export_file) - convert a sas7bdat file into a Parquet file. File path can be sent as either a string or Path objects.

    • sas7bdat_file = the path and name for sas7bdat file to convert.
    • export_file = the path and name for the Parquet file. The Parquet file extension should be .parquet.

    Example

    import sas7bdat_converter
    
    sas7bdat_converter.to_parquet('/path/to/sas7bdat/file/example.sas7bdat',
    'path/to/new/file/example.parquet')
    

    Note: Example uses Mac/Linux type file paths. For Windows use paths like c:\path\to\sas7bdat\files\example.sas7bdat.

  • to_json(sas7bdat_file, export_file) - convert a sas7bdat file into a json file. File path can be sent as either a string or Path objects.

    • sas7bdat_file = the path and name for sas7bdat file to convert.
    • export_file = the path and name for the json file. the json file extension should be .json.

    Example

    import sas7bdat_converter
    
    sas7bdat_converter.to_json('/path/to/sas7bdat/file/example.sas7bdat', 'path/to/new/file/example.json')
    

    Note: Example uses Mac/Linux type file paths. For Windows use paths like c:\path\to\sas7bdat\files\example.sas7bdat.

  • to_xml(sas7bdat_file, export_file, root_node='root', first_node='item') - convert a sas7bdat file into a XML file. File path can be sent as either a string or Path objects.

    • sas7bdat_file = the path and name for sas7bdat file to convert.
    • export_file = the path and name for the XML file. The XML file extension should be .xlm.
    • root_node = The name to uses for the top level node. If no name is supplied "root" will be used.
    • first_node = The name to use for the first node under root. If no name is supplied "item" will be used.

    Example

    import sas7bdat_converter
    
    sas7bdat_converter.to_xml('/path/to/sas7bdat/file/example.sas7bdat', 'path/to/new/file/example.xml')
    

    Note: Example uses Mac/Linux type file paths. For Windows use paths like c:\path\to\sas7bdat\files\example.sas7bdat.

Contributing

If you are interested in contributing to this project please see our contributing guide

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sas7bdat_converter-2.0.1.tar.gz (9.1 kB view details)

Uploaded Source

Built Distribution

sas7bdat_converter-2.0.1-py3-none-any.whl (7.9 kB view details)

Uploaded Python 3

File details

Details for the file sas7bdat_converter-2.0.1.tar.gz.

File metadata

  • Download URL: sas7bdat_converter-2.0.1.tar.gz
  • Upload date:
  • Size: 9.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.6.1 CPython/3.11.6 Linux/6.2.0-1015-azure

File hashes

Hashes for sas7bdat_converter-2.0.1.tar.gz
Algorithm Hash digest
SHA256 3936327f185ac92e61288ed0ba2f6887cad8a2c406f2f7d53b818e3386c8adfa
MD5 521b6c2c08d630fad8af4d0139cfb048
BLAKE2b-256 a68ed54ce30c63cabed7e6460ddddb61cb2498b260c3a2f994a44d09218548c6

See more details on using hashes here.

File details

Details for the file sas7bdat_converter-2.0.1-py3-none-any.whl.

File metadata

File hashes

Hashes for sas7bdat_converter-2.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 c2cf46253c0cf7e16cf4a83e3ad15dfde5ba8aee8f98536273a5e062934aa7ee
MD5 cd0e183053dc7ffb2df313364e6c66ef
BLAKE2b-256 732e1e44b74df4ae406962819a6bfa98bdaf7886dc2d226f734abedc1cc4cfa8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page