Skip to main content

Community detection via Louvain/Leiden + Genetic Algorithm

Project description

TAU: Parallel Genetic Clustering with Louvain and Leiden

PyPI Build License: MIT Python 3.8+

TAU is a high-performance Python package for modularity-based community detection using a hybrid of genetic algorithms and graph clustering methods (Louvain and Leiden). It is built for scalability, parallelism, and usability across research and applied machine learning settings.


Features

  • Genetic Initialization: Enhances clustering quality by optimizing initial conditions with evolutionary search.
  • Pluggable Clustering Engines: Supports Louvain and Leiden algorithms.
  • Parallel Execution: Uses Python's multiprocessing/threading for speed-up across CPUs.
  • Flexible Graph Input: Read from adjacency lists, edge lists, CSVs, or pandas DataFrames.
  • CLI and API Access: Run from the terminal or call programmatically in Python.
  • Reproducible Runs: Optional random seed parameter for repeatable experiments.

Installation

From PyPI

pip install community_TAU

From Source

git clone https://github.com/HillelCharbit/community_TAU.git
cd community_TAU
pip install .

Quick Start

Command-Line Interface

python -m community_TAU --graph data/example.graph --size 80 --workers 4 --max_generations 300

Command-Line Arguments

Argument Description Default
--graph Path to graph file (adjacency list, edge list, etc.) Required
--size Population size for genetic algorithm 60
--workers Number of parallel workers All available cores
--max_generations Maximum number of generations 500
--seed Random seed (optional) None

Python API Example

from community_TAU import community_TAU
import networkx as nx

G = nx.read_edgelist("example.graph")
result = community_TAU(G, size=100, workers=8, max_generations=400)

# Access results
print(result.partition)
print(result.modularity)

Input Formats Supported

  • Adjacency list (.graph)
  • Edge list (.csv, .txt)
  • Adjacency matrix (CSV or DataFrame)

Conversion tools are available in the graph_loader module.


Output

  • Final partition (dictionary of node → community)
  • Final modularity score
  • Optional generation logs and fitness history

Testing

To run unit tests:

pytest tests/

Test coverage includes input parsing, genetic algorithm logic, clustering evaluation, and parallel execution.


Contributing

We welcome contributions! To get started:

  1. Fork this repository
  2. Create a feature branch (git checkout -b feature/my-feature)
  3. Make your changes and commit (git commit -am 'Add new feature')
  4. Push to the branch (git push origin feature/my-feature)
  5. Open a pull request

Please review the contributing guidelines before submitting changes.


License

This project is licensed under the MIT License.


Acknowledgments

  • Louvain and Leiden algorithms based on work by Blondel et al. and Traag et al.
  • Graph handling inspired by NetworkX and igraph interfaces.

Project Status

TAU is actively maintained and under continuous development. Feedback and issues are welcome on the GitHub issue tracker.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tau_community_detection-0.3.23.tar.gz (11.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tau_community_detection-0.3.23-py3-none-any.whl (9.0 kB view details)

Uploaded Python 3

File details

Details for the file tau_community_detection-0.3.23.tar.gz.

File metadata

File hashes

Hashes for tau_community_detection-0.3.23.tar.gz
Algorithm Hash digest
SHA256 d254a67337e876327df6fd5494d9ab2015206b221e8c28ad6c2526751112daf8
MD5 eec343f0028a89f3510349723b96077b
BLAKE2b-256 de0762d459838f1a0b7451f8d714e02407b8a95984abc64ac9035b7c30aae37a

See more details on using hashes here.

File details

Details for the file tau_community_detection-0.3.23-py3-none-any.whl.

File metadata

File hashes

Hashes for tau_community_detection-0.3.23-py3-none-any.whl
Algorithm Hash digest
SHA256 4a7a310678fc21076a7464aa9b307e4ab460f22f0640fa34060d8ccd18d53be7
MD5 d35ed5967a0830e89e07fb8b2eac3653
BLAKE2b-256 57028f7455f5b633f8cf78dae50a727827c697a6c786d295220e09866f057507

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page