Bro Analysis Tools
Project description
Bro Analysis Tools (BAT)
The BAT Python package supports the processing and analysis of Bro data with Pandas, scikit-learn, and Spark
Recent Improvements (Fall 2019):
- Better Docs (https://supercowpowers.github.io/bat/)
- Faster/Smaller Pandas dataframes for large log files: Large Dataframes
BroCon 2017 Presentation
Data Analysis, Machine Learning, Bro, and You! (Video)
Why BAT?
Bro already has a flexible, powerful scripting language why should I use BAT?
Offloading: Running complex tasks like statistics, state machines, machine learning, etc.. should be offloaded from Bro so that Bro can focus on the efficient processing of high volume network traffic.
Data Analysis: We have a large set of support classes that help bridge from raw Bro data to packages like Pandas, scikit-learn, and Spark. We also have example notebooks that show step-by-step how to get from here to there.
Getting Started
How-To Analysis Notebooks
- Bro to Scikit-Learn
- Bro to Matplotlib
- Bro to Parquet to Spark
- Bro to Kafka to Spark
- Clustering: Picking K (or not)
- Anomaly Detection Exploration
- Risky Domains Stats and Deployment
Install
$ pip install bat
Documentation
https://supercowpowers.github.io/bat/
Thanks
- The DummyEncoder is inspired by Tom Augspurger's great PyData Chicago 2016 Talk
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.