iman

Python package for daily Tasks

These details have not been verified by PyPI

Development Status
- 3 - Alpha
Intended Audience
- Education
Programming Language
- Python :: 3

Project description

Overview

iman is a comprehensive Python package offering a wide array of utilities for audio processing, file manipulation, machine learning, system operations, web utilities, and more. It provides tools for tasks such as audio feature extraction, voice activity detection, file I/O, system monitoring, and integration with frameworks like PyTorch and TensorFlow. The package is organized into multiple submodules, each designed for specific functionalities, as detailed below.

Installation

Install iman via pip:

pip install iman

Ensure dependencies like numpy, torch, tensorflow, speechbrain, librosa, matplotlib, pandas, and external tools like ffmpeg, ffprobe, and WinRAR are installed. Some functions require pre-trained models or specific paths (e.g., model files, ffmpeg_path).

Usage

Below are examples of key functionalities from the iman package. For detailed function signatures and parameters, refer to the sections below or use the built-in help system:

Example: Audio Processing

from iman import Audio

# Read a WAV file
data, sr = Audio.Read("audio.wav", sr=16000, start_from=0, dur=None, mono=True, ffmpeg_path="c:\\ffmpeg.exe", ffprobe_path="c:\\ffprobe.exe")

# Resample and write audio
resampled = Audio.Resample(data, fs=sr, sr=8000)
Audio.Write("output.wav", resampled, fs=8000)

Example: File Operations

from iman import *

# Get files matching a pattern
files = gf("*.txt")

# Write a dictionary to a file
my_dict = {"key1": "value1", "key2": "value2"}
Write_Dic(my_dict, "output.txt")

Example: VAD with Segmenter

from iman.sad_torch_mfcc import Segmenter

seg = Segmenter(batch_size=32, vad_type="vad", sr=8000, model_path="c:\\sad_model_pytorch.pth", tq=1, ffmpeg_path="c:\\ffmpeg.exe", complete_output=False, device="cuda", input_type="file")
isig, wav, mfcc = seg("audio.wav")

Modules and Functions

The iman package is organized into several submodules, each with specific functions. Below is a complete list of modules and their functions as provided.

plt: Matplotlib plotting library.
now(): Get current time.
F: Format floating-point number.
D: Format integer number.
Write_List(MyList, Filename): Write a list to a text file.
Write_Dic(MyDic, Filename): Write a dictionary to a text file.
Read(Filename): Read a text file.
Read_Lines(Filename): Read a text file line by line and return a list.
Write(_str, Filename): Write a string to a text file.
gf(pattern): Get files in a directory matching a pattern.
gfa(directory_pattern, ext="*.*"): Get files in a directory and subdirectories.
ReadE(Filename): Read Excel files.
PM(dir): Create a directory.
PB(fname): Get basename of a file.
PN(fname): Get filename without path.
PE(fname): Get file extension.
PD(fname): Get directory of a file.
PS(fname): Get file size.
PJ(segments): Join path segments.
clear(): Clear command-line interface.
os: Python os module.
np: NumPy module.
RI(start_int, end_int, count=1): Generate random integers.
RF(start_float, end_float, count=1): Generate random floats.
RS(Arr): Shuffle an array.
LJ(job_file_name): Load job file (details not specified).
SJ(value, job_file_name): Save job file (details not specified).
LN(np_file_name): Load NumPy file (details not specified).
SN(arr, np_file_name): Save NumPy array to file.
cmd(command, redirect=True): Run a command in CMD.
PX(fname): Check existence of a file.
RC(Arr, size=1): Random choice from an array.
onehot(data, nb_classes): Convert data to one-hot encoding.
exe(pyfile): Convert Python file to executable (requires PyInstaller).
FWL(wavfolder, sr): Get total audio length in a folder.
norm(vector): Normalize a vector (vector/magnitude(vector)).
delete(pattern): Delete files matching a pattern.
rename(fname, fout): Rename a file.
separate(pattern, folout): Separate vocal from music.
dll(fname): Create a .pyd file from a Python file.
get_hard_serial(): Get hardware serial number.
mute_mic(): Toggle microphone on/off.
PA(fname): Get absolute path of a file.

iman.Audio

Read(filename, sr, start_from, dur, mono, ffmpeg_path, ffprobe_path): Read WAV, ALAW, MP3, and other audio formats.
Resample(data, fs, sr): Resample audio data.
Write(filename, data, fs): Write audio data to a file.
frame(y): Frame audio data (details not specified).
split(y): Split audio data (details not specified).
ReadT(filename, sr, mono=True): Read and resample WAV file with torchaudio.
VAD(y, top_db=40, frame_length=200, hop_length=80): Voice activity detection.
compress(fname_pattern, sr=16000, ext='mp3', mono=True, ffmpeg_path='c:\\ffmpeg.exe', ofolder=None, worker=4): Compress audio files.
clip_value(wav): Return clipping percentage in an audio file.
WriteS(filename, data, fs): Convert and write audio to stereo.

iman.info

get(): Get information about CPU and GPU (requires torch).
cpu(): Get CPU percentage usage.
gpu(): Get GPU memory usage.
memory(): Get RAM usage in GB.
plot(fname="log.txt", delay=1): Plot system metrics from a log file.

iman.metrics

EER(lab, score): Compute Equal Error Rate.
cosine_distance(v1, v2): Compute cosine distance between two vectors.
roc(lab, score): Compute ROC curve.
wer(ref, hyp): Compute Word Error Rate.
cer(ref, hyp): Compute Character Error Rate.
wer_list(ref_list, hyp_list): Compute WER for lists.
cer_list(ref_list, hyp_list): Compute CER for lists.
DER(ref_list, res_list, file_dur=-1, sr=8000): Compute Detection Error Rate.

iman.tsne

plot(fea, label): Plot t-SNE visualization of features.

iman.xvector

xvec, lda_xvec, gender = get(filename, model(model_path, model_name, model_speaker_num)): Extract x-vectors for speaker recognition.

iman.web

change_wallpaper(): Change system wallpaper.
dl(url): Download a file from a URL.
links(url, filter_text=None): Extract links from a URL.
imgs(url, filter_text=None): Extract images from a URL.

iman.matlab

np2mat(param, mat_file_name): Convert NumPy array to MATLAB file.
dic2mat(param, mat_file_name): Convert dictionary to MATLAB file.
mat2dic(mat_file_name): Convert MATLAB file to dictionary.

iman.Features

mfcc_fea, mspec, log_energy = mfcc.SB.Get(wav, sample_rate): Compute MFCC with SpeechBrain (input must be read with torchaudio).
mfcc.SB.Normal(MFCC): Mean-variance normalization of MFCC with SpeechBrain.
mfcc_fea, log_energy = mfcc.LS.Get(wav, sample_rate, le=False): Compute MFCC with Librosa (input is NumPy array).
mfcc.LS.Normal(MFCC, win_len=150): Mean-variance normalization (local, 150 frames left and right).

iman.AUG

Add_Noise(data, noise, snr): Add noise to audio data.
Add_Reverb(data, rir): Add reverberation to audio data.
Add_NoiseT(data, noise, snr): Add noise using torchaudio.
Add_ReverbT(data, rir): Add reverberation using torchaudio.
mp3(fname, fout, sr_out, ratio, ffmpeg_path='c:\\ffmpeg.exe'): Convert to MP3.
speed(fname, fout, ratio, ffmpeg_path='c:\\ffmpeg.exe'): Change audio speed.
volume(fname, fout, ratio, ffmpeg_path='c:\\ffmpeg.exe'): Adjust audio volume.

iman.sad_torch_mfcc | iman.sad_tf

Initializer (PyTorch):

seg = Segmenter(batch_size, vad_type=['sad'|'vad'], sr=8000, model_path="c:\\sad_model_pytorch.pth", tq=1, ffmpeg_path='c:\\ffmpeg.exe', complete_output=False, device='cuda', input_type='file')

Initializer (TensorFlow):

seg = Segmenter(batch_size, vad_type=['sad'|'vad'], sr=16000, model_path="c:\\keras_speech_music_noise_cnn.hdf5", gender_path="c:\\keras_male_female_cnn.hdf5", ffmpeg_path='c:\\ffmpeg.exe', detect_gender=False, complete_output=False, device='cuda', input_type='file')

isig, wav, mfcc = seg(fname): Process audio file (MFCC output only in PyTorch model).
nmfcc = filter_fea(isig, mfcc, sr, max_time): Filter features (PyTorch only).
mfcc = MVN(mfcc): Mean-variance normalization (PyTorch only).
isig = filter_output(isig, max_silence, ignore_small_speech_segments, max_speech_len, split_speech_bigger_than): Filter output when complete_output=False.
seg2aud(isig, filename): Convert segments to audio.
seg2json(isig): Convert segments to JSON.
seg2Gender_Info(isig): Extract gender information from segments.
seg2Info(isig): Extract segment information.
wav_speech, wav_noise = filter_sig(isig, wav, sr): Get speech and noise parts (when complete_output=False).

sad_tf.segmentero:

from sad_tf.segmentero import Segmenter  # Use ONNX models (requires onnxruntime)

iman.sad_torch_mfcc_speaker

Initializer:

seg = Segmenter(batch_size, vad_type=['sad'|'vad'], sr=8000, model_path="c:\\sad_model_pytorch.pth", max_time=120, tq=1, ffmpeg_path='c:\\ffmpeg.exe', device='cuda', pad=False)

mfcc, len(sec) = seg(fname): Process audio file, MFCC padded to max_time if pad=True.

iman.sad_tf_mlp_speaker

Initializer:

seg = Segmenter(batch_size, vad_type=['sad'|'vad'], sr=8000, model_path="sad_tf_mlp.h5", max_time=120, tq=1, ffmpeg_path='c:\\ffmpeg.exe', device='cuda', pad=False)

mfcc, len(sec) = seg(fname): Process audio file, MFCC padded to max_time if pad=True.

iman.Report

Initializer:
```
r = Report.rep(log_dir=None)
```
WS(_type, _name, value, itr): Add scalar to TensorBoard.
WT(_type, _name, _str, itr): Add text to TensorBoard.
WG(pytorch_model, example_input): Add graph to TensorBoard.
WI(_type, _name, images, itr): Add image to TensorBoard.

iman.par

Parallel Processing:

if __name__ == '__main__':
    res = par.par(files, func, worker=4, args=[])  # func defined as: def func(fname, _args): ...

iman.Image

Image.convert(fname_pattern, ext='jpg', ofolder=None, w=-1, h=-1, level=100, worker=4, ffmpeg_path='c:\\ffmpeg.exe'): Convert images to specified format.
Image.resize(fname_pattern, ext='jpg', ofolder=None, w=2, h=2, worker=4, ffmpeg_path='c:\\ffmpeg.exe'): Resize images to 1/w and 1/h.

iman.Boors

Boors.get(sahm): Get stock information.

iman.Text

Initializer:

norm = Text.normal("c:\\Replace_List.txt")

norm.rep(str): Replace text based on normalization rules.
norm.from_file(filename, file_out=None): Normalize text from a file.

iman.num2fa

words(number): Convert number to Persian words.

iman.Rar

rar(fname, out="", rar_path=r"C:\\Program Files\\WinRAR\\winrar.exe"): Create RAR archive.
zip(fname, out="", rar_path=r"C:\\Program Files\\WinRAR\\winrar.exe"): Create ZIP archive.
unrar(fname, out="", rar_path=r"C:\\Program Files\\WinRAR\\winrar.exe"): Extract RAR archive.
unzip(fname, out="", rar_path=r"C:\\Program Files\\WinRAR\\winrar.exe"): Extract ZIP archive.

iman.Enhance

Enhance.Dereverb(pattern, out_fol, sr=16000, batchsize=16, device="cuda", model_path=r"C:\\UVR-DeEcho-DeReverb.pth"): Dereverberate audio files.
Enhance.Denoise(pattern, out_fol, sr=16000, batchsize=16, device="cuda", model_path=r"C:\\UVR-DeNoise-Lite.pth"): Denoise audio files.

iman.tf

flops(model): Get FLOPs of a TensorFlow model.
param(model): Get parameter count of a TensorFlow model.
paramp(model): Get parameter count and print model layers.
gpu(): Return True if GPU is available.
gpun(): Return number of GPUs.
limit(): Limit GPU memory allocation for TensorFlow models.

iman.torch

param(model): Get parameter and trainable count of a PyTorch model.
paramp(model): Get parameter count and print model layers.
layers(model): Get layers of a PyTorch model.
gpu(): Return True if GPU is available.
gpun(): Return number of GPUs.

iman.yt

dl(url): Download a YouTube video.
list_formats(url): List available formats for a YouTube link.

iman.svad

segments, wav = svad(filename, sampling_rate=16000, min_speech_duration_ms=250, max_speech_duration_s=float('inf'), min_silence_duration_ms=100): Run fast speech activity detection and return speech segments.

Dependencies

The iman package requires the following:

Python Packages: numpy, torch, tensorflow, speechbrain, librosa, matplotlib, pandas, onnxruntime (for ONNX models).
External Tools: ffmpeg, ffprobe, WinRAR (for RAR/ZIP operations).
Optional: Pre-trained models (e.g., for VAD, x-vector, dereverberation) specified in function arguments.

Check the package’s requirements.txt for specific versions.

Documentation

For detailed usage, refer to the source code or use the built-in help system:

from iman import examples
examples.help("Audio")  # Get help on the Audio module

Contributing

Contributions are welcome! Submit bug reports, feature requests, or pull requests via the project’s GitHub repository (if available). Follow contribution guidelines and include tests for new features.

License

iman is licensed under the MIT License (assumed). See the LICENSE file for details.

Contact

For support, contact the maintainers via the project’s GitHub page or email (if provided).

Project details

These details have not been verified by PyPI

Development Status
- 3 - Alpha
Intended Audience
- Education
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

This version

2.0.3

Oct 1, 2025

1.0.28

Aug 27, 2025

1.0.27

Jun 10, 2025

1.0.26

Jan 20, 2025

1.0.25

Jan 20, 2025

1.0.24

Dec 17, 2024

1.0.23

Nov 27, 2024

1.0.22

Jan 28, 2024

1.0.21

Jan 27, 2024

1.0.20

Jan 27, 2024

1.0.19

Jan 14, 2024

1.0.18

Jan 14, 2024

1.0.17

Jan 14, 2024

1.0.16

Jan 14, 2024

1.0.15

Jan 8, 2024

1.0.12

Jan 3, 2024

1.0.11

Dec 12, 2023

1.0.10

Nov 20, 2023

1.0.9

Nov 6, 2023

1.0.8

Oct 31, 2023

1.0.7

Oct 28, 2023

1.0.6

Oct 8, 2023

1.0.5

Sep 10, 2023

1.0.4

Sep 4, 2023

1.0.3

Sep 4, 2023

1.0.2

Sep 2, 2023

1.0.1

Sep 2, 2023

1.0.0

Aug 27, 2023

0.0.99

Aug 8, 2023

0.0.98

Aug 7, 2023

0.0.97

Aug 6, 2023

0.0.96

Aug 5, 2023

0.0.95

Aug 5, 2023

0.0.94

Jul 29, 2023

0.0.93

Jul 26, 2023

0.0.92

Jul 26, 2023

0.0.91

Jul 23, 2023

0.0.90

Jul 23, 2023

0.0.89

Jul 22, 2023

0.0.88

Jul 19, 2023

0.0.87

Jul 19, 2023

0.0.86

Jul 17, 2023

0.0.85

Jul 17, 2023

0.0.84

Jul 16, 2023

0.0.83

Jul 12, 2023

0.0.82

Jul 11, 2023

0.0.81

Jul 5, 2023

0.0.80

Jul 5, 2023

0.0.79

Jul 5, 2023

0.0.78

Jun 10, 2023

0.0.77

Feb 5, 2023

0.0.76

Feb 5, 2023

0.0.75

Feb 1, 2023

0.0.74

Jan 31, 2023

0.0.73

Jan 29, 2023

0.0.72

Jan 18, 2023

0.0.71

Jan 8, 2023

0.0.70

Dec 26, 2022

0.0.68

Dec 25, 2022

0.0.67

Dec 25, 2022

0.0.65

Dec 25, 2022

0.0.64

Dec 21, 2022

0.0.63

Dec 21, 2022

0.0.62

Dec 21, 2022

0.0.61

Dec 20, 2022

0.0.60

Dec 17, 2022

0.0.59

Dec 17, 2022

0.0.58

Dec 17, 2022

0.0.57

Dec 17, 2022

0.0.56

Dec 17, 2022

0.0.55

Dec 14, 2022

0.0.54

Dec 14, 2022

0.0.53

Dec 14, 2022

0.0.52

Dec 14, 2022

0.0.51

Dec 12, 2022

0.0.50

Dec 12, 2022

0.0.49

Dec 12, 2022

0.0.48

Dec 12, 2022

0.0.47

Dec 11, 2022

0.0.46

Dec 11, 2022

0.0.45

Dec 10, 2022

0.0.44

Dec 5, 2022

0.0.43

Dec 5, 2022

0.0.42

Dec 5, 2022

0.0.41

Dec 4, 2022

0.0.40

Dec 3, 2022

0.0.39

Nov 30, 2022

0.0.38

Nov 30, 2022

0.0.37

Nov 29, 2022

0.0.36

Nov 29, 2022

0.0.35

Nov 29, 2022

0.0.34

Nov 28, 2022

0.0.33

Nov 26, 2022

0.0.32

Nov 22, 2022

0.0.31

Nov 20, 2022

0.0.30

Nov 20, 2022

0.0.29

Nov 13, 2022

0.0.28

Nov 13, 2022

0.0.27

Nov 12, 2022

0.0.26

Nov 9, 2022

0.0.25

Nov 9, 2022

0.0.24

Nov 8, 2022

0.0.23

Nov 8, 2022

0.0.22

Nov 8, 2022

0.0.21

Nov 8, 2022

0.0.20

Nov 8, 2022

0.0.19

Nov 8, 2022

0.0.18

Nov 8, 2022

0.0.17

Nov 8, 2022

0.0.16

Nov 8, 2022

0.0.15

Nov 6, 2022

0.0.14

Nov 6, 2022

0.0.13

Nov 6, 2022

0.0.12

Nov 6, 2022

0.0.11

Oct 26, 2022

0.0.10

Oct 26, 2022

0.0.9

Oct 26, 2022

0.0.8

Oct 26, 2022

0.0.7

Oct 26, 2022

0.0.6

Oct 25, 2022

0.0.5

Oct 24, 2022

0.0.4

Oct 24, 2022

0.0.3

Oct 24, 2022

0.0.2

Oct 24, 2022

0.0.1

Oct 24, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

iman-2.0.3.tar.gz (2.1 MB view details)

Uploaded Oct 1, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

iman-2.0.3-py3-none-any.whl (2.2 MB view details)

Uploaded Oct 1, 2025 Python 3

File details

Details for the file iman-2.0.3.tar.gz.

File metadata

Download URL: iman-2.0.3.tar.gz
Upload date: Oct 1, 2025
Size: 2.1 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.7.0

File hashes

Hashes for iman-2.0.3.tar.gz
Algorithm	Hash digest
SHA256	`d13b99efefb97d6e66e24373dac9a5f126f6eb94025e62fa0f2eb23bfafac586`
MD5	`584d784146ebff1eaa5a2a6e8929e553`
BLAKE2b-256	`0ee9a918b682a14b24992183b23aa2284342c535f88565ae31cd2968f56cc980`

See more details on using hashes here.

File details

Details for the file iman-2.0.3-py3-none-any.whl.

File metadata

Download URL: iman-2.0.3-py3-none-any.whl
Upload date: Oct 1, 2025
Size: 2.2 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.7.0

File hashes

Hashes for iman-2.0.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3c4d22822121662135d4781cda528be4b9d343eb1823245e14bdb7f011b6d9c8`
MD5	`bd42febe889577a3f1922ebcfe74197a`
BLAKE2b-256	`8d542d6980af54147231ea15805684c936c4f664fde10c9d2db03783e5746042`

See more details on using hashes here.

iman 2.0.3

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Overview

Installation

Usage

Modules and Functions

iman

iman.Audio

iman.info

iman.metrics

iman.tsne

iman.xvector

iman.web

iman.matlab

iman.Features

iman.AUG

iman.sad_torch_mfcc | iman.sad_tf

iman.sad_torch_mfcc_speaker

iman.sad_tf_mlp_speaker

iman.Report

iman.par

iman.Image

iman.Boors

iman.Text

iman.num2fa

iman.Rar

iman.Enhance

iman.tf

iman.torch

iman.yt

iman.svad

Dependencies

Documentation

Contributing

License

Contact

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes