Connor's Lightning Parser Lib (LPL) is an extremely powerful analysis utility with a simplistic front-end in mind.

These details have not been verified by PyPI

Project links

Project description

Connor's Lightning Parser Lib (LPL)

Connor's Lightning Parser Lib (LPL) is an extremely powerful analysis utility with a simplistic front-end in mind.

The analyzer is capable of processing millions of LYLOUT datapoints in mere minutes by using a SQL database back-end for initial filtering, and then uses optimized work-arounds for computationally expensive methods that omit square-root and trig functions for distances between points. Not to mention it's back-end parses most data with indexes (list[int], list[list[int]], etc.) instead of the entire data itself. Additionally, it uses multi-processing when necessary to accelerate processes.

[!NOTE] The stitchings are done on a temporal and delta-magnitude basis. This basically means that scipy's cKDTree, despite being powerful and timely, does not include temporal thresholds and therefore would not accurately stitch lightning points that respect the lightning strike. Therefore, this code will go from 4 minutes with 3 million points to 2 hours for 10 million points. I will likely switch to C or C++ down the line to improve processing time. But for now, this is the most optimized Python library I can make for temporal distance lightning stitching.

All of these methods allow extremely fast computation times, given the immense scale and size of the data itself.

most_pts

most_pts_stitched

[!NOTE] The gray, un-stitched dots in the figure are points that do not meet the criteria to be a part of the lightning strike, as determined by stitching parameters. By comparing stitched vs. non-stitched points in the visualization, you can verify the validity of the strike stitching algorithm

This library extracts LYLOUT data, store it into a lightning database, and processes millions of datapoints in the database to a reasonably fast and optimized speed. This project is meant to be a framework for applications to implement and parse data more appropriately.

Assuming the following specs (tested on a laptop with Ubuntu 22.04):

64 GB DDR4 RAM
RTX 2060 Mobile (6GB)
Intel i7-10750H (6 Cores -> 12 Threads)
Python 3.12.3 (Regular version/Not conda)

Three million datapoints should take roughly 4 minutes to process (excluding generating plots). Running the same exact parameters again would take 18-20 seconds due to caching.

Start

Getting Started

Install project in your environment: pip install lightning-parser-lib
Create a main.py and paste the boilerplate sample code:

####################################################################################
#
# About: A top-down view of what is going on
#
####################################################################################
"""
This program processes LYLOUT data files, such as "LYLOUT_20220712_pol.exported.dat"

1. It first reads through all the files, and then puts all of the points into an 
SQLite database

2. Then, with user-specified filters, the user extracts a pandas DataFrame 
(DataFrame "events") from the SQLite database that meets all of the 
filter criteria. 

3. Afterwards, with user-specified parameters, the lightning_bucketer processes 
all of the "events" data to return a list of lightning strikes, which each 
lightning strike is simply a list of indices for the "events" DataFrame
(a list of lists).

4. You can use the events with the lightning strikes data to plot data or analyze 
the data. Examples in the code and comments below show how to do so.
"""
####################################################################################
print("Starting up. Importing...")
import lightning_parser_lib.config_and_parser as config_and_parser
from lightning_parser_lib.number_crunchers.toolbox import tprint
import lightning_parser_lib.number_crunchers.toolbox as toolbox

import time
import datetime
import pandas as pd

# what percent of the total number of cores to be utilized. 
# Set to 0.0 to use only one core
CPU_PCT = 0.9 

lightning_configuration = config_and_parser.LightningConfig(
    num_cores = toolbox.cpu_pct_to_cores(CPU_PCT),
    lightning_data_folder = "lylout_files",
    data_extension = ".dat",
    cache_dir ="cache_dir",
    csv_dir = "strikes_csv_files",
    export_dir = "export",
    strike_dir = "strikes",
    strike_stitchings_dir = "strike_stitchings"
)

EXPORT_AS_CSV = True 
EXPORT_GENERAL_STATS = True
EXPORT_ALL_STRIKES = False
EXPORT_ALL_STRIKES_STITCHINGS = False

config_and_parser.lightning_bucketer.USE_CACHE = True

def main():

    # This parses data from "lylout_files" directory and stashes it in a database
    config_and_parser.cache_and_parse(config=lightning_configuration)

    # Column/Header descriptions:
    # 'time_unix'    -> float   Seconds (Unix timestamp, UTC)
    # 'lat'          -> float   Degrees (WGS84 latitude)
    # 'lon'          -> float   Degrees (WGS84 longitude)
    # 'alt'          -> float   Meters (Altitude above sea level)
    # 'reduced_chi2' -> float   Reduced chi-square goodness-of-fit metric
    # 'num_stations' -> int     Count (Number of contributing stations)
    # 'power_db'     -> float   Decibels (dBW) (Power of the detected event in decibel-watts)
    # 'power'        -> float   Watts (Linear power, converted from power_db using 10^(power_db / 10))
    # 'mask'         -> str     Hexadecimal bitmask (Indicates contributing stations)
    # 'stations'     -> str     Comma-separated string (Decoded station names from the mask)
    # 'x'            -> float   Meters (ECEF X-coordinate in WGS84)
    # 'y'            -> float   Meters (ECEF Y-coordinate in WGS84)
    # 'z'            -> float   Meters (ECEF Z-coordinate in WGS84)
    # `file_name`    -> str     The name of the file used that contains the point information

    # Mark process start time
    process_start_time = time.time()

    ####################################################################################
    # Filter params for extracting data points from the SQLite database
    ####################################################################################
    start_time = datetime.datetime(2020, 4, 29, 13, 0, tzinfo=datetime.timezone.utc).timestamp()  # Timestamp converts to unix (float)
    end_time = datetime.datetime(2020, 4, 29, 14, 59, tzinfo=datetime.timezone.utc).timestamp()  # Timestamp converts to unix (float)

    # Build filter list for time_unix boundaries.
    # Look at "List of headers" above for additional
    # Filterings
    filters = [
        ("time_unix", ">=", start_time),  # In unix
        ("time_unix", "<=", end_time),  # In unix
        ("reduced_chi2", "<", 5.0,),  # The chi^2 (reliability index) value to accept the data
        ("num_stations", ">=", 5),  # Number of stations that have visibly seen the strike
        ("alt", "<=", 24000),  # alt is in meters. Therefore 20 km = 20000m
        ("alt", ">", 0),  # Above ground
        ("power_db", ">", -4),  # In dBW
        ("power_db", "<", 50),  # In dBW
    ]
    events: pd.DataFrame = config_and_parser.get_events(filters, config=lightning_configuration)
    tprint("Events:", events)

    ####################################################################################
    # Identifying the lightning strikes
    ####################################################################################

    # Additional parameters that determines "What points make up a single lightning strike"
    # They are explicitly defined
    params = {
        # Creating an initial lightning strike
        "max_lightning_dist": 30000,  # Max distance between two points to determine it being involved in the same strike
        "max_lightning_speed": 1.4e8,  # Max speed between two points in m/s (essentially dx/dt)
        "min_lightning_speed": 0,  # Min speed between two points in m/s (essentially dx/dt)
        "min_lightning_points": 100,  # The minimum number of points to pass the system as a "lightning strike"
        "max_lightning_time_threshold": 0.3,  # Max number of seconds between points 
        "max_lightning_duration": 30, # Max seconds that define an entire lightning strike. This is essentially a "time window" for all of the points to fill the region that determines a "lightning strike"

        # Combining intercepting lightning strike data filtering
        "combine_strikes_with_intercepting_times": True, # Set to true to ensure that strikes with intercepting times get combined. 
        "intercepting_times_extension_buffer": 0.6, # Number of seconds of additional overlap to allow an additional strike to be involved
        "intercepting_times_extension_max_distance": 100000, # The max distance between the start point of one lightning strike and at least one from the entirety of another lightning strike's points
    }
    bucketed_strikes_indices, bucketed_lightning_correlations = config_and_parser.bucket_dataframe_lightnings(events, config=lightning_configuration, params=params)

    # Example: To get a Pandas DataFrame of the first strike in the list, you do:
    # ```
    # first_strikes = events.iloc[bucketed_strikes_indices[0]]
    # ```
    #
    # Example 2: Iterating through all lightning strikes:
    # ```
    # for i in range(len(bucketed_strikes_indices)):
    #   sub_strike = events.iloc[bucketed_strikes_indices[i]]
    #   # Process the dataframe however you please of the designated lightning strike
    # ```

    process_time = time.time() - process_start_time
    tprint(f"Process time: {process_time:.2f} seconds.")
    config_and_parser.display_stats(events, bucketed_strikes_indices)

    ####################################################################################
    # Plotting and exporting
    ####################################################################################

    # Only export plot data with more than n datapoints
    MAX_N_PTS = 1000
    bucketed_strikes_indices, bucketed_lightning_correlations = config_and_parser.limit_to_n_points(bucketed_strikes_indices, bucketed_lightning_correlations, MAX_N_PTS)

    if EXPORT_AS_CSV:
        config_and_parser.export_as_csv(bucketed_strikes_indices, events, config=lightning_configuration) 

    if EXPORT_GENERAL_STATS:
        config_and_parser.export_general_stats(bucketed_strikes_indices, bucketed_lightning_correlations, events, config=lightning_configuration)

    if EXPORT_ALL_STRIKES:
        config_and_parser.export_all_strikes(bucketed_strikes_indices, events, config=lightning_configuration)

    if EXPORT_ALL_STRIKES_STITCHINGS:
        config_and_parser.export_strike_stitchings(bucketed_lightning_correlations, events, config=lightning_configuration)

    tprint("Finished generating plots")

if __name__ == '__main__':
    main()

Run in terminal: python main.py
Drag and drop your LYLOUT text files into lylout_files directory.

lylout

[!NOTE] Some individuals may upload a compressed LYLOUT file without adding a suggestive extension filename. Make sure that all LYLOUT files are able to be readable as a text file. If they are not, they are likely compressed, with or without the extension name. It is suggested to try to add the ".gz" extension at the end manually by renaming the file, and attempt to unzip it. If that is not successful, try adding ".zip" and attempt to unzip.

[!NOTE] When data is added to "lylout_files", everything gets hashed and recorded into "lylout_db.db". This ".db" file is a SQL database that stores all historical lightning strikes. If the database is becoming too large, you can simply delete the "lylout_db.db" file.

Modify the filters in "main.py":

start_time = datetime.datetime(2020, 4, 29, 0, 0, tzinfo=datetime.timezone.utc).timestamp()  # Timestamp converts to unix (float)
end_time = datetime.datetime(2020, 4, 29, 23, 59, tzinfo=datetime.timezone.utc).timestamp()  # Timestamp converts to unix (float)


# Build filter list for time_unix boundaries.
# Look at "List of headers" above for additional
# Filterings
filters = [
        ("time_unix", ">=", start_time),  # In unix
        ("time_unix", "<=", end_time),  # In unix
        ("reduced_chi2", "<", 5.0,),  # The chi^2 (reliability index) value to accept the data
        ("num_stations", ">=", 5),  # Number of stations that have visibly seen the strike
        ("alt", "<=", 24000),  # alt is in meters. Therefore 20 km = 20000m
        ("alt", ">", 0),  # Above ground
        ("power_db", ">", -4),  # In dBW
        ("power_db", "<", 50),  # In dBW
    ]

Modify parameters

# Additional parameters that determines "What points make up a single lightning strike"
# They are explicitly defined
params = {
        # Creating an initial lightning strike
        "max_lightning_dist": 30000,  # Max distance between two points to determine it being involved in the same strike
        "max_lightning_speed": 1.4e8,  # Max speed between two points in m/s (essentially dx/dt)
        "min_lightning_speed": 0,  # Min speed between two points in m/s (essentially dx/dt)
        "min_lightning_points": 100,  # The minimum number of points to pass the system as a "lightning strike"
        "max_lightning_time_threshold": 0.3,  # Max number of seconds between points 
        "max_lightning_duration": 30, # Max seconds that define an entire lightning strike. This is essentially a "time window" for all of the points to fill the region that determines a "lightning strike"

        # Combining intercepting lightning strike data filtering
        "combine_strikes_with_intercepting_times": True, # Set to true to ensure that strikes with intercepting times get combined. 
        "intercepting_times_extension_buffer": 0.6, # Number of seconds of additional overlap to allow an additional strike to be involved
        "intercepting_times_extension_max_distance": 100000, # The max distance between the start point of one lightning strike and at least one from the entirety of another lightning strike's points
    }

Run with python main.py again and observe the images in their respective directories

Useful Functions

Run in background: python main.py > output.log 2>&1 & disown
List all files in directory './' and sizes: du -h --max-depth=1 ./ | sort -hr

[!NOTE] Because of Python 3.12 onwards, you may need to consider running via the following:

./.venv/bin/python main.py

./.venv/bin/python main.py > output.log 2>&1 & disown

./.venv/bin/pip install -r requirements.txt

./.venv/bin/pip show setuptools

./.venv/bin/python3 -m build

Building from source

.venv/bin/python -m build
.venv/bin/python3 -m twine upload --repository lightning_parser_lib dist/*

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.3.50

Sep 5, 2025

0.3.48

Sep 5, 2025

0.3.47

Sep 5, 2025

0.3.46

Sep 5, 2025

0.3.45

Sep 5, 2025

0.3.44

Sep 5, 2025

0.3.43

Sep 5, 2025

0.3.42

Sep 4, 2025

0.3.41

May 6, 2025

0.3.40

May 6, 2025

0.3.39

May 6, 2025

0.3.38

Apr 22, 2025

0.3.37

Apr 22, 2025

0.3.36

Apr 22, 2025

0.3.35

Apr 22, 2025

0.3.34

Apr 22, 2025

0.3.33

Apr 22, 2025

0.3.32

Apr 22, 2025

0.3.31

Apr 22, 2025

0.3.30

Apr 21, 2025

0.3.29

Apr 16, 2025

0.3.28

Apr 15, 2025

0.3.27

Apr 15, 2025

0.3.26

Apr 15, 2025

0.3.25

Apr 15, 2025

0.3.24

Apr 15, 2025

0.3.23

Apr 15, 2025

0.3.22

Apr 15, 2025

0.3.21

Apr 15, 2025

0.3.20

Apr 15, 2025

0.3.19

Apr 15, 2025

0.3.18

Apr 15, 2025

0.3.17

Apr 15, 2025

0.3.16

Apr 15, 2025

0.3.14

Apr 14, 2025

0.3.13

Apr 14, 2025

0.3.12

Apr 14, 2025

0.3.11

Apr 14, 2025

0.3.10

Apr 13, 2025

0.3.9

Apr 13, 2025

0.3.8

Apr 13, 2025

0.3.6

Apr 13, 2025

0.3.5

Apr 13, 2025

0.3.4

Apr 13, 2025

0.3.3

Apr 13, 2025

0.3.2

Apr 12, 2025

0.3.1

Apr 12, 2025

0.3.0

Apr 12, 2025

0.2.95

Apr 12, 2025

0.2.94

Apr 12, 2025

0.2.93

Apr 12, 2025

0.2.92

Apr 12, 2025

0.2.91

Apr 12, 2025

0.2.90

Apr 12, 2025

0.2.89

Apr 12, 2025

0.2.88

Apr 12, 2025

0.2.87

Apr 11, 2025

0.2.86

Apr 11, 2025

0.2.85

Apr 11, 2025

0.2.84

Apr 11, 2025

0.2.83

Apr 11, 2025

0.2.82

Apr 11, 2025

0.2.81

Apr 11, 2025

0.2.80

Apr 11, 2025

0.2.79

Apr 11, 2025

0.2.78

Apr 11, 2025

0.2.77

Apr 11, 2025

0.2.76

Apr 11, 2025

0.2.75

Apr 11, 2025

0.2.74

Apr 11, 2025

0.2.73

Apr 11, 2025

0.2.72

Apr 11, 2025

0.2.71

Apr 11, 2025

0.2.70

Apr 11, 2025

0.2.65

Apr 11, 2025

0.2.64

Apr 11, 2025

0.2.63

Apr 11, 2025

0.2.62

Apr 11, 2025

0.2.61

Apr 11, 2025

0.2.60

Apr 11, 2025

0.2.59

Apr 11, 2025

0.2.58

Apr 11, 2025

0.2.57

Apr 11, 2025

0.2.56

Apr 11, 2025

0.2.55

Apr 11, 2025

0.2.54

Apr 11, 2025

0.2.53

Apr 11, 2025

0.2.52

Apr 11, 2025

0.2.51

Apr 11, 2025

0.2.50

Apr 11, 2025

0.2.49

Apr 11, 2025

0.2.48

Apr 11, 2025

0.2.47

Apr 11, 2025

0.2.46

Apr 10, 2025

0.2.45

Apr 10, 2025

0.2.44

Apr 10, 2025

0.2.43

Apr 10, 2025

0.2.42

Apr 10, 2025

0.2.41

Apr 10, 2025

0.2.40

Apr 10, 2025

0.2.39

Apr 10, 2025

0.2.37

Apr 10, 2025

0.2.36

Apr 10, 2025

0.2.35

Apr 10, 2025

0.2.34 yanked

Apr 10, 2025

Reason this release was yanked:

Unstable

0.2.33 yanked

Apr 10, 2025

Reason this release was yanked:

Unstable

0.2.31 yanked

Apr 10, 2025

0.2.30 yanked

Apr 10, 2025

0.2.29 yanked

Apr 10, 2025

0.2.28 yanked

Apr 10, 2025

0.2.27 yanked

Apr 10, 2025

0.2.26 yanked

Apr 10, 2025

0.2.25

Apr 10, 2025

0.2.24

Apr 10, 2025

0.2.23

Apr 10, 2025

0.2.22

Apr 10, 2025

0.2.21

Apr 10, 2025

0.2.20

Apr 10, 2025

0.2.19

Apr 10, 2025

0.2.18

Apr 10, 2025

0.2.17

Apr 10, 2025

0.2.16

Apr 10, 2025

0.2.15

Apr 10, 2025

0.2.14

Apr 10, 2025

0.2.13

Apr 10, 2025

0.2.12

Apr 10, 2025

0.2.11

Apr 10, 2025

0.2.10

Apr 10, 2025

0.2.8

Apr 10, 2025

0.2.7

Apr 10, 2025

0.2.6

Apr 10, 2025

0.2.5

Apr 10, 2025

0.2.4

Apr 10, 2025

0.2.3

Apr 10, 2025

0.2.2

Apr 10, 2025

0.2.1

Apr 10, 2025

0.2.0

Apr 10, 2025

0.1.64

Apr 6, 2025

0.1.63

Apr 6, 2025

0.1.62

Apr 6, 2025

0.1.61

Apr 5, 2025

0.1.60

Apr 5, 2025

0.1.59

Apr 5, 2025

0.1.58

Apr 5, 2025

0.1.57

Apr 5, 2025

0.1.56

Apr 5, 2025

0.1.55

Apr 5, 2025

0.1.54

Apr 5, 2025

0.1.53

Apr 5, 2025

0.1.52

Apr 5, 2025

0.1.51

Apr 3, 2025

0.1.50

Apr 3, 2025

0.1.49

Apr 3, 2025

0.1.48

Apr 3, 2025

0.1.47

Apr 3, 2025

0.1.46

Apr 3, 2025

0.1.45

Apr 3, 2025

0.1.44

Apr 3, 2025

0.1.43

Apr 3, 2025

0.1.42

Apr 3, 2025

0.1.41

Apr 3, 2025

0.1.40

Apr 3, 2025

0.1.39

Apr 3, 2025

0.1.38

Apr 2, 2025

0.1.37

Apr 2, 2025

0.1.36

Apr 2, 2025

This version

0.1.35

Apr 2, 2025

0.1.34

Apr 2, 2025

0.1.33

Apr 2, 2025

0.1.32

Apr 2, 2025

0.1.31

Apr 2, 2025

0.1.30

Apr 2, 2025

0.1.29

Apr 2, 2025

0.1.28

Apr 2, 2025

0.1.27

Apr 1, 2025

0.1.26

Apr 1, 2025

0.1.25

Apr 1, 2025

0.1.23

Apr 1, 2025

0.1.22

Apr 1, 2025

0.1.21

Apr 1, 2025

0.1.20

Apr 1, 2025

0.1.19

Apr 1, 2025

0.1.18

Apr 1, 2025

0.1.17

Apr 1, 2025

0.1.16

Apr 1, 2025

0.1.15

Apr 1, 2025

0.1.14

Apr 1, 2025

0.1.13

Apr 1, 2025

0.1.12

Apr 1, 2025

0.1.11

Apr 1, 2025

0.1.10

Apr 1, 2025

0.1.9

Apr 1, 2025

0.1.8

Apr 1, 2025

0.1.7

Apr 1, 2025

0.1.6

Mar 31, 2025

0.1.4

Mar 31, 2025

0.1.3

Mar 31, 2025

0.1.2

Mar 31, 2025

0.1.1

Mar 31, 2025

0.1.0

Mar 31, 2025

0.0.3

Mar 31, 2025

0.0.2

Mar 31, 2025

0.0.1

Mar 31, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lightning_parser_lib-0.1.35.tar.gz (36.6 kB view details)

Uploaded Apr 2, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

lightning_parser_lib-0.1.35-py3-none-any.whl (36.3 kB view details)

Uploaded Apr 2, 2025 Python 3

File details

Details for the file lightning_parser_lib-0.1.35.tar.gz.

File metadata

Download URL: lightning_parser_lib-0.1.35.tar.gz
Upload date: Apr 2, 2025
Size: 36.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for lightning_parser_lib-0.1.35.tar.gz
Algorithm	Hash digest
SHA256	`46d39ac86a29970c1caf008a65e444f42a63f2e2d2042546984ffe5307ec6000`
MD5	`8e3ace6f5d70c9c1993944fb622af66b`
BLAKE2b-256	`e0e00dd031f9f0da3e7996ed9980ad25286f2582d690c5fbcfd1a50fac90bf09`

See more details on using hashes here.

File details

Details for the file lightning_parser_lib-0.1.35-py3-none-any.whl.

File metadata

Download URL: lightning_parser_lib-0.1.35-py3-none-any.whl
Upload date: Apr 2, 2025
Size: 36.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for lightning_parser_lib-0.1.35-py3-none-any.whl
Algorithm	Hash digest
SHA256	`efffec8fd0174ec80005b3de4c1dce04abab72f3b10efd220eb03536bd06eb56`
MD5	`2770f179ce4c35893087a9cf757fa9f7`
BLAKE2b-256	`14c83ecb89d4a8b8c2fbd712ff4083bcbf595ec859b6eabadd705bdc967b5550`

See more details on using hashes here.

lightning-parser-lib 0.1.35

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Connor's Lightning Parser Lib (LPL)

Start

Getting Started

Useful Functions

Building from source

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes