Skip to main content

Community detection via Louvain/Leiden + Genetic Algorithm

Project description

TAU: Parallel Genetic Clustering with Louvain and Leiden

PyPI Build License: MIT Python 3.8+

TAU is a high-performance Python package for modularity-based community detection using a hybrid of genetic algorithms and graph clustering methods (Louvain and Leiden). It is built for scalability, parallelism, and usability across research and applied machine learning settings.


Features

  • Genetic Initialization: Enhances clustering quality by optimizing initial conditions with evolutionary search.
  • Pluggable Clustering Engines: Supports Louvain and Leiden algorithms.
  • Parallel Execution: Uses Python's multiprocessing/threading for speed-up across CPUs.
  • Flexible Graph Input: Read from adjacency lists, edge lists, CSVs, or pandas DataFrames.
  • CLI and API Access: Run from the terminal or call programmatically in Python.
  • Reproducible Runs: Optional random seed parameter for repeatable experiments.

Installation

From PyPI

pip install community_TAU

From Source

git clone https://github.com/HillelCharbit/community_TAU.git
cd community_TAU
pip install .

Quick Start

Command-Line Interface

python -m community_TAU --graph data/example.graph --size 80 --workers 4 --max_generations 300

Command-Line Arguments

Argument Description Default
--graph Path to graph file (adjacency list, edge list, etc.) Required
--size Population size for genetic algorithm 60
--workers Number of parallel workers All available cores
--max_generations Maximum number of generations 500
--seed Random seed (optional) None

Python API Example

from community_TAU import community_TAU
import networkx as nx

G = nx.read_edgelist("example.graph")
result = community_TAU(G, size=100, workers=8, max_generations=400)

# Access results
print(result.partition)
print(result.modularity)

Input Formats Supported

  • Adjacency list (.graph)
  • Edge list (.csv, .txt)
  • Adjacency matrix (CSV or DataFrame)

Conversion tools are available in the graph_loader module.


Output

  • Final partition (dictionary of node → community)
  • Final modularity score
  • Optional generation logs and fitness history

Testing

To run unit tests:

pytest tests/

Test coverage includes input parsing, genetic algorithm logic, clustering evaluation, and parallel execution.


Contributing

We welcome contributions! To get started:

  1. Fork this repository
  2. Create a feature branch (git checkout -b feature/my-feature)
  3. Make your changes and commit (git commit -am 'Add new feature')
  4. Push to the branch (git push origin feature/my-feature)
  5. Open a pull request

Please review the contributing guidelines before submitting changes.


License

This project is licensed under the MIT License.


Acknowledgments

  • Louvain and Leiden algorithms based on work by Blondel et al. and Traag et al.
  • Graph handling inspired by NetworkX and igraph interfaces.

Project Status

TAU is actively maintained and under continuous development. Feedback and issues are welcome on the GitHub issue tracker.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tau_community_detection-0.3.10.tar.gz (10.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tau_community_detection-0.3.10-py3-none-any.whl (8.9 kB view details)

Uploaded Python 3

File details

Details for the file tau_community_detection-0.3.10.tar.gz.

File metadata

File hashes

Hashes for tau_community_detection-0.3.10.tar.gz
Algorithm Hash digest
SHA256 7efc3d49dbdb36a02dc943943bc05b06567a63e5aefb1533ecb0cbaee25c8aa6
MD5 238687633d89e67d3c9c98a7fec976a7
BLAKE2b-256 87978b917a3eb719d53940b5929d44ba28db16342d79ff17faacbf851e693493

See more details on using hashes here.

File details

Details for the file tau_community_detection-0.3.10-py3-none-any.whl.

File metadata

File hashes

Hashes for tau_community_detection-0.3.10-py3-none-any.whl
Algorithm Hash digest
SHA256 3cba3d9fb6a1e92f4dafbf1ab79abf777f112292b02833925f2bc764db8c6092
MD5 7d4ed002f9a4aa541938b2120c48728f
BLAKE2b-256 55eb903bc134751ea126c5d10c7f0fd05a4f5dec0bad1d53475cb2bd7558219d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page