Export

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 5 - Production/Stable
Intended Audience
- Developers
Natural Language
- English
Programming Language

Project description

Export CSV To Influx

Export CSV To Influx: Process CSV data, and export the data to influx db

Install

Use the pip to install the library. Then the binary export_csv_to_influx is ready.

pip install ExportCsvToInflux

Features

[Highlight :star2::tada::heart_eyes:] Allow to use binary export_csv_to_influx to run exporter
[Highlight :star2::tada::heart_eyes:] Allow to check dozens of csv files in a folder
[Highlight :star2::tada::heart_eyes::confetti_ball::four_leaf_clover::balloon:] Auto convert csv data to int/float/string in Influx
[Highlight :star2::tada::heart_eyes:] Allow to match or filter the data by using string or regex.
[Highlight :star2::tada::heart_eyes:] Allow to count, and generate count measurement
Allow to limit string length in Influx
Allow to judge the csv has new data or not
Allow to use the latest file modify time as time column
Auto Create database if not exist
Allow to drop database before inserting data
Allow to drop measurements before inserting data

Command Arguments

You could use export_csv_to_influx -h to see the help guide.

-c, --csv: Input CSV file path, or the folder path. Mandatory

-db, --dbname: InfluxDB Database name. Mandatory

-m, --measurement: Measurement name. Mandatory

-fc, --field_columns: List of csv columns to use as fields, separated by comma. Mandatory

-d, --delimiter: CSV delimiter. Default: ','.

-lt, --lineterminator: CSV lineterminator. Default: '\n'.

-s, --server: InfluxDB Server address. Default: localhost:8086.

-u, --user: InfluxDB User name. Default: admin

-p, --password: InfluxDB Password. Default: admin

-t, --time_column: Timestamp column name. Default column name: timestamp.

If no timestamp column, the timestamp is set to the last file modify time for whole csv rows.

Note: Also support the pure timestamp, like: 1517587275. Auto detected.

-tf, --time_format: Timestamp format. Default: '%Y-%m-%d %H:%M:%S' e.g.: 1970-01-01 00:00:00.

-tz, --time_zone: Timezone of supplied data. Default: UTC.

-tc, --tag_columns: List of csv columns to use as tags, separated by comma. Default: None

-b, --batch_size: Batch size when inserting data to influx. Default: 500.

-lslc, --limit_string_length_columns: Limit string length column, separated by comma. Default: None.

-ls, --limit_length: Limit length. Default: 20.

-dd, --drop_database: Drop database before inserting data. Default: False.

-dm, --drop_measurement: Drop measurement before inserting data. Default: False.

-mc, --match_columns: Match the data you want to get for certain columns, separated by comma. Match Rule: All matches, then match. Default: None.

-mbs, --match_by_string: Match by string, separated by comma. Default: None.

-mbr, --match_by_regex: Match by regex, separated by comma. Default: None.

-fic, --filter_columns: Filter the data you want to filter for certain columns, separated by comma. Filter Rule: Any one filter success, the filter. Default: None.

-fibs, --filter_by_string: Filter by string, separated by comma. Default: None.

-fibr, --filter_by_regex: Filter by regex, separated by comma. Default: None.

-ecm, --enable_count_measurement: Enable count measurement. Default: False.

-fi, --force_insert_even_csv_no_update: Force insert data to influx, even csv no update. Default: False.

-fsc, --force_string_columns: Force columns as string type, seperated as comma. Default: None

-fintc, --force_int_columns: Force columns as int type, seperated as comma. Default: None

-ffc, --force_float_columns: Force columns as float type, seperated as comma. Default: None

Note:

You could pass * to --field_columns to match all the fields: --field_columns=*, --field_columns '*'

CSV data won't insert into influx again if no update. Use to force insert: --force_insert_even_csv_no_update=True, --force_insert_even_csv_no_update True

If some csv cells have no value, auto fill the influx db based on column data type: int: -999, float: -999.0, string: -

Programmatically

Also, we could run the exporter programmatically.

from ExportCsvToInflux import ExporterObject

exporter = ExporterObject()
exporter.export_csv_to_influx(...)

# You could get the export_csv_to_influx parameter details by:
print(exporter.export_csv_to_influx.__doc__)

Sample

Here is the demo.csv.

timestamp,url,response_time
2019-07-11 02:04:05,https://jmeter.apache.org/,1.434
2019-07-11 02:04:06,https://jmeter.apache.org/,2.434
2019-07-11 02:04:07,https://jmeter.apache.org/,1.200
2019-07-11 02:04:08,https://jmeter.apache.org/,1.675
2019-07-11 02:04:09,https://jmeter.apache.org/,2.265
2019-07-11 02:04:10,https://sample-demo.org/,1.430
2019-07-12 08:54:13,https://sample-show.org/,1.300
2019-07-12 14:06:00,https://sample-7.org/,1.289
2019-07-12 18:45:34,https://sample-8.org/,2.876

Command to export whole data into influx:

export_csv_to_influx \
--csv demo.csv \
--dbname demo \
--measurement demo \
--tag_columns url \
--field_columns response_time \
--user admin \
--password admin \
--force_insert_even_csv_no_update True \
--server 127.0.0.1:8086

Command to export whole data into influx, but: drop database

export_csv_to_influx \
--csv demo.csv \
--dbname demo \
--measurement demo \
--tag_columns url \
--field_columns response_time \
--user admin \
--password admin \
--server 127.0.0.1:8086 \
--force_insert_even_csv_no_update True \
--drop_database=True

Command to export part of data: timestamp matches 2019-07-12 and url matches sample-\d+

export_csv_to_influx \
--csv demo.csv \
--dbname demo \
--measurement demo \
--tag_columns url \
--field_columns response_time \
--user admin \
--password admin \
--server 127.0.0.1:8086 \
--drop_database=True \
--force_insert_even_csv_no_update True \
--match_columns=timestamp,url \
--match_by_reg='2019-07-12,sample-\d+'

Filter part of data, and the export into influx: url filter sample

export_csv_to_influx \
--csv demo.csv \
--dbname demo \
--measurement demo \
--tag_columns url \
--field_columns response_time \
--user admin \
--password admin \
--server 127.0.0.1:8086 \
--drop_database True \
--force_insert_even_csv_no_update True \
--filter_columns timestamp,url \
--filter_by_reg 'sample'

Enable count measurement. A new measurement named: demo.count generated, with match: timestamp matches 2019-07-12 and url matches sample-\d+

export_csv_to_influx \
--csv demo.csv \
--dbname demo \
--measurement demo \
--tag_columns url \
--field_columns response_time \
--user admin \
--password admin \
--server 127.0.0.1:8086 \
--drop_database True \
--force_insert_even_csv_no_update True \
--match_columns timestamp,url \
--match_by_reg '2019-07-12,sample-\d+' \
--enable_count_measurement True

The count measurement is:

select * from "demo.count"

name: demo.count
time                match_timestamp match_url total
----                --------------- --------- -----
1562957134000000000 3               2         9

Special Thanks

The lib is inspired by: https://github.com/fabio-miranda/csv-to-influxdb

For more info, please refer to the https://github.com/Bugazelle/export-csv-to-influx

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 5 - Production/Stable
Intended Audience
- Developers
Natural Language
- English
Programming Language

Release history Release notifications | RSS feed

0.2.2

Mar 14, 2022

0.2.1

Mar 13, 2022

0.1.25

Oct 7, 2020

This version

0.1.24

Mar 18, 2020

0.1.23

Feb 20, 2020

0.1.22

Feb 15, 2020

0.1.21

Feb 14, 2020

0.1.20

Nov 23, 2019

0.1.19

Oct 15, 2019

0.1.18

Aug 29, 2019

0.1.17

Aug 10, 2019

0.1.16

Jul 27, 2019

0.1.15

Jul 19, 2019

0.1.14

Jul 19, 2019

0.1.13

Jul 19, 2019

0.1.11

Jul 17, 2019

0.1.10

Jul 14, 2019

0.1.9

Jul 14, 2019

0.1.8

Jul 12, 2019

0.1.7

Jul 12, 2019

0.1.6

Jul 12, 2019

0.1.5

Jul 12, 2019

0.1.4

Jul 12, 2019

0.1.3

Jul 12, 2019

0.1.2

Jul 12, 2019

0.1.1

Jul 12, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ExportCsvToInflux-0.1.24.tar.gz (18.2 kB view hashes)

Uploaded Mar 18, 2020 Source

Hashes for ExportCsvToInflux-0.1.24.tar.gz

Hashes for ExportCsvToInflux-0.1.24.tar.gz
Algorithm	Hash digest
SHA256	`ccf77f89f01c9edf00248435da0a236be96f48131b9f321894bc2b3a98de2893`
MD5	`b762d11d065edbf16693c6689ff1ab9e`
BLAKE2b-256	`8e190ffc546e23d484236877cf4f041ffbbe246de0489a93aa17493a6dad55fd`