CSV file reader with header management, fieldnames, delimiters and comment skipping
Project description
ofunctions
Collection of useful python functions
ofunctions is a set of various recurrent functions amongst
- bisection: bisection algorithm for any function with any number of arguments, works LtoR and RtoL
- checksums: various SHA256 tools for checking and creating checksum files
- csv: CSV file reader with various enhancements over generic reader
- delayed_keyboardinterrupt: just a nifty tool to catch CTRL+C signals
- file_utils: file handling functions of which
- get_paths_recursive: Walks a path for directories / files, can deal with permission errors, has include / exclude lists with wildcard support...
- check_path_access: Checks whether a path is writable, with fallback for read test, and splits path until it finds which part denies permissions
- check_file_timestamp_delta: Check a time delta (seconds, minutes, hours...) against file ctime, mtime or atime
- hide_file: Hides/unhides files under windows & linux
- get_writable_temp_dir: Returns a temporary dir in which we are allowed to write
- get_writable_random_file: Returns a filename of a not-yet existing file we can write into
- json_sanitize: make sure json does not contain unsupported chars, yes I look at you Windows eventlog
- logger_utils: basic no brain console + file log creation
- mailer: A class to deal with email sending, regardless of ssl/tls protocols, in batch or as single mail, with attachments
- network: various tools like ping, internet check, MTU probing, public IP discovery, network interface IO counters
- platform: nothing special here, just check what arch we are running on
- process: simple kill-them-all function to terminate subprocesses
- random: basic random string & password generator
- service_control: control Windows / Linux service start / stop / status
- string_handling: remove accents / special chars from strings
- threading: threading decorator for functions, also contains a function call anti-flood system
ofunctions is compatible with Python 2.7 and 3.5+ and is tested on both Linux and Windows. There are still two subpackages that will only work with Python 3.5+
- delayed_keyboardinterrupt (signal handling is different in Python 2.7)
- threading (we don't have concurrent_futures in python 2.7, so the @threaded decorator will indeed work, but can't return a result)
Setup
pip install ofunctions.<subpackage>
bisection Usage
ofunctions.bisection is a dichotomy algorithm that can be used for all kind of bisections, mathematical operations, kernel bisections... Let's imagine we have a function foo that takes argument x. x might be between 0 and 999, and for a given value of x above 712, foo(x) returns "gotcha". In order to find at which x value foo(x) becomes "gotcha", we could run foo(x) for every possible value of x until the result becomes what we expect. The above solution works, but takes time (up to 1000 foo(x) runs). We can achieve the same result in max 10 steps by checking foo(x) where x will be the middle of all possible values. Looking at the result from that middle value, we'll know if the expected result should be a lower or higher value of x. We can repeat this action until we'll get the precise result.
Now let's code the above example in less abstract:
def foo(x):
# We'll need to find value 712 te quickest way possible
if x >= 712:
return "gotcha"
return False
from ofunctions.bisection import bisect
value = bisect(foo, range(0, 1000), expected_result="gotcha")
print('Value is %s' % value)
The above concept can be adapted in order to compute ethernet MTU values or whatever other values need to be calculated. See ofunctions.network code for MTU probing example.
checksums Usage
csv Usage
delayed_keyboardinterrupt Usage
The DelayedKeyboardInterrupt class allows to intercept a CTRL+C call in order to finish atomic operations without interruption. Easy to use, we use a pythonic syntax as follows:
Setup:
pip install ofunctions.mailer
Usage:
with DelayedKeyboardInterrupt():
<your code that should not be interrupted>
file_utils Usage
ofuntions.file_utils is a collection of tools to handle:
- listing of paths
Setup
pip install ofunctions.file_utils
Most interesting function in file utils is get_paths_recursive(), which yields a list of directories and/or files corresponding to a pattern.
Example:
from ofunctions.file_utils import get_paths_recursive
paths = get_paths_recursive("/", exclude_dirs=True, ext_include_list=".txt")
for path in paths:
print(path)
get_paths_recursive
also can execute a function when an error is encountered, such as checking permissions or even fix them.
Example:
from ofunctions.file_utils import get_paths_recursive, check_path_access
paths = get_paths_recursive("/", exclude_dirs=True, ext_include_list=".txt", fn_on_perm_error=check_path_access)
for path in paths:
print(path)
On every permission error, check_path_access will be launched, and will check read/write permissions and log them.
json_sanitize Usage
json_sanitize will remove any control characters from json content (0x00-0x1F and 0x7F-0x9F) of which some are usually non printable and non visible. This is especially useful when dealing with various log files (ex: windows event logs) that need to be passed as json. It will also remove dots from value names, since those are prohibited in json standard.
Setup:
pip install ofunctions.json_sanitize
Usage:
my_json = {'some.name': 'some\tvalue'}
my_santized_json = json_sanitize(my_json)
my_santized_json will contain {'somename': 'somevalue'}
logger_utils Usage
ofunctions.logger_utils is an easy implementation of logger which promises to always work, regardless of encoding issues. Easy usage:
from ofunctions.logger_utils import logger_get_logger
logger = logger_get_logger(log_file='/path/to/log/file')
logger_utils will automatically try to open a temp log file if given log_file is not writable.
You can also disable console output with console=False
, enable debug_mode with debug=True
(or later with logger.setLevel(logging.DEBUG)
).
Also allows to inject more LOGGER formatter objects, eg:
logger = logger_get_logger(formatter_insert="%(processName)s")
logger_utils also allows to know what was the worst loglevel that has been called in your program:
from ofunctions.logger_utils import logger_get_logger, get_worst_logger_level
logger = logger_get_logger()
logger.error("Oh no !")
print("worst log level was :", get_worst_logger_level()) # 10-50, 10 = debug, 50 = critical
mailer Usage
ofunctions.mailer is a simple mailing class and a rfc822 email validation function.
Setup:
pip install ofunctions.mailer
Quick usage:
from ofunctions.mailer import Mailer
mailer = Mailer() # Uses localhost:25
mailer.send_email(subject='test', sender_mail='me@example.com', recipient_mails='them@example.com', body='some body just told me')
SmartRelay usage:
from ofunctions.mailer import Mailer
mailer = Mailer(smtp_server='mail.example.com', smtp_port=587, security='tls', smtp_user='me', smtp_password='secure_p@$$w0rd_lol')
mailer.send_email(subject='test', sender_mail='me@example.com', recipient_mails='them@example.com ; another_recipient@example.com', body='some body just told me')
Bulk mailer usage:
from ofunctions.mailer import Mailer
recipients = ['me@example.com', 'them@example.com', 'anyone@example.com', 'malformed_address_at_example.com']
mailer = Mailer(smtp_server='mail.example.com', smtp_port=465, security='ssl', debug=True, verify_certificates=False)
# split_mails=True will send one email per recipient
# split_mails=False will send one email for all recipients, which will be limited to the number of recipients the destination SMTP server allows
mailer.send_email(subject='test', sender_mail='me@example.com', recipient_mails=recipients, body='some body just told me', split_mails=True)
Attachment usage:
from ofunctions.mailer import Mailer
mailer = Mailer() # Uses localhost:25
# attachment can be a binary blob or a file path
# filename is optional, and will rename a binary blob to something more meaningful
mailer.send_email(subject='test', sender_mail='me@example.com', recipient_mails='them@example.com', body='some body just told me', attachment=attachment, filename='My Attachment File.txt')
misc Usage
Misc is a collection of somehow useful functions.
fn_name
Get the caller function name of current context.
print(fn_name()) will show current caller function name
print(fn_name(2)) will show parent of current caller function name
BytesConverter
BytesConverter is that little tool that you want when handling bits and byte units. Internally, BytesConverter always represents data an int number of bytes. BytesConverter will return a float or a str if human output is requested.
Example (output is shown as comment):
from ofunctions.misc import BytesConverter
print(BytesConverter("64 KB")) # 64000.0
print(BytesConverter("64 KiB")) # 65536.0
print(BytesConverter("64 Kb")) # 8000.0
print(BytesConverter("64 KiB")) # 65536.0
value = BytesConverter("20MB")
print(value.human) # 20.0 MB
print(value.human_iec_bytes) # 19.1 MiB
print(value.human_bits) # 160.0 Mb
print(value.human_iec_bits) # 152.6 Mib
print(BytesConverter(1234)) # 1234.0
print(BytesConverter(1234).bits) # 9872.0
print(BytesConverter(1234).kbytes) # 1.2
print(BytesConverter(1234).human) # 1.2 KB
print(BytesConverter(65535).kbytes) # 64.0
print(BytesConverter(9000000).mbytes) # 8.6
print(BytesConverter("4MB")) # 4000000.0
print(BytesConverter("4MiB")) # 4194304.0
print(BytesConverter("9600 Kb").mbytes) # 1.1
Arithmetics: BytesConverter objects can be added just as other mathematic types:
print(BytesConverter("50 MB") + BytesConverter("8192 Kb")) # 51024000.0
print(BytesConverter(BytesConverter("50 MB") + BytesConverter("8192 Kb")).human) # 51.0 MB
fn_name()
fn_name()
is a quick way to find out what the parent function name is.
Example:
from ofunctions.misc import fn_name
def test_a():
def test_b():
print(fn_name()) # prints "test_b"
print(fn_name(1)) # prinrs "test_a"
network Usage
ofunctions.network is a collection of various tools making network diag / mapping easier.
Setup:
pip install ofunctions.network
get_public_ip()
Easy way to find Public IPv4 or IPv6 using multiple online services
from ofunctions.netowrk import get_public_ip
print("My IP is", get_public_ip())
print("My IPv4 is", get_public_ip(ip_version=4))
print("My IPv6 is", get_public_ip(ip_version=6))
IOCounters
IOCounters is a class that will log instant sent/received bytes as well as total sent/received bytes. Once an instance is created, logging begins as a thread. You may specify which interfaces to track at which resolution. If none is given, all interfaces are tracked every second.
Example of IO counters for network interfaces:
counter = IOCounters()
while True:
print(counter.interfaces['eth0'].recv_bytes, counter.interfaces['eth0'].recv_bytes_total)
time.sleep(1)
counter = IOCounters(['Ethernet Connection 2', 'Wi-Fi'], resolution=2)
while True:
print(counter.interfaces['Ethernet Connection 2'].sent_bytes)
time.sleep(1)
platform Usage
process Usage
kill_childs()
kill_childs
allows to walk a process and kill all it's children, with various options:
- pid(int): If none given, current process pid is used
- itself (bool): Shall we also kill current process (defaults to False)
- children (bool): Shall we kill current process' childs (defaults to True)
- verbose (bool): Log more actions (defaults to False)
- grace_period (int): Period before we consider hard killing a process. Defaults to 1 second
- fast_kill (bool): Kill children using threads, in order to parallelize grace_period and kill faster
kill_childs
will first try to send SIGTERM to the process, and if not successful in grace_period, it will send SIGKILL.
Works well on both Windows and Linux, and has fallback mecanisms to make sure process tree gets properly killed.
Example:
from ofunctions.process import kill_childs, get_process_by_name
process = get_process_by_name("notepad.exe")
result = kill_childs(process[0].pid)
get_process_by_name
As said in the title, takes a process name and returns it's process handle. Example:
from ofunctions.process import get_process_by_name
print(get_process_by_name("bash"))
# Prints a list of all the process handles for bash processes
get_absolute_path
Searches for absolute path of an executable in PATH variables. Example:
from ofunctions.process import get_absolute_path
print(get_absolute_path("bash"))
# prints /usr/bin/bash
random Usage
service_control Usage
string_handling Usage
threading Usage
@threaded
threading comes with a couple of decorators that allow to modify functions.
In order to thread a function, you can simply apply the @threaded
decorator like below.
Once you call the function, it will automatically be threaded, and you get to keep your execution flow. You can then execute whatever you want, or wait for it's result:
from ofunctions.threading import threaded, wait_for_threaded_result
@threaded
def my_nice_function():
# Do some nice stuff
return result
def main():
# Some stuff
thread = my_nice_function()
# Some other stuff being executed while my_nice_function runs in a thread
# now let's wait for my function result
result = wait_for_threaded_result(thread)
There's a special argument in order to bypass the decorator called __no_threads
, which can be used like the following example.
This allows manual threading bypass without having to change the code
from ofunctions.threading import threaded, wait_for_threaded_result
@threaded
def my_nice_function():
# Do some nice stuff
return result
def main():
# Some stuff
thread = my_nice_function(__no_threads=True)
# In
result = wait_for_threaded_result(thread)
Also please note that Python 2.7 can't give you a result, so the function will be threaded, but without any possible return codes.
@no_flood
There are situations where some code can call multiple times the same function (on a trigger for example), but you don't want that function to run multiple times in a short time span.
That's a situation where we should handle function call antiflooding.
Example:
# Adding @no_flood(5) only allows one execution of my_function per 5 seconds
@no_flood(5)
def my_function():
print("Hey, it's me !")
# Will run my_function() only once
for _ in range(0, 20):
my_function()
Multiple executions of a functions are permitted as long as they're called with different arguments.
The @no_flood
decorator can be setup to prevent any multiple function execution in a given timespan, regarless of it's arguments:
@no_flood(5, multiple_instances_diff_args=False)
@def my_function(var):
print("Hey, it's me: {}".format(var))
# Will run my_function() only once
for i in range(0, 20):
my_function(i)
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file ofunctions.csv-1.0.2.tar.gz
.
File metadata
- Download URL: ofunctions.csv-1.0.2.tar.gz
- Upload date:
- Size: 24.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.12.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 217468dd1726d17e033df16d7c9596986d3b79cc8c82467e517494f5cf5c7737 |
|
MD5 | 1842cf8ec433aac504bf3d22e45eeedd |
|
BLAKE2b-256 | 875f8dc20ee07670dcdfff0bc582a4f6c693973c7c5ce60c2667c8ea35f2810c |
File details
Details for the file ofunctions.csv-1.0.2-py3-none-any.whl
.
File metadata
- Download URL: ofunctions.csv-1.0.2-py3-none-any.whl
- Upload date:
- Size: 10.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.12.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | cf4e3eae54da58f397af49250cfc4ca69190eb1c4b49622b2672452da00d4ab6 |
|
MD5 | 78b8b84e132d8d231d990af42a9c9ec3 |
|
BLAKE2b-256 | bf12d619552071cb8db36ba3eba9a753951ed7ccb899aa7ee94a49695a0b49c2 |