Skip to main content

Community detection via Louvain/Leiden + Genetic Algorithm

Project description

TAU: Parallel Genetic Clustering with Louvain and Leiden

PyPI Build License: MIT Python 3.8+

TAU is a high-performance Python package for modularity-based community detection using a hybrid of genetic algorithms and graph clustering methods (Louvain and Leiden). It is built for scalability, parallelism, and usability across research and applied machine learning settings.


Features

  • Genetic Initialization: Enhances clustering quality by optimizing initial conditions with evolutionary search.
  • Pluggable Clustering Engines: Supports Louvain and Leiden algorithms.
  • Parallel Execution: Uses Python's multiprocessing/threading for speed-up across CPUs.
  • Flexible Graph Input: Read from adjacency lists, edge lists, CSVs, or pandas DataFrames.
  • CLI and API Access: Run from the terminal or call programmatically in Python.
  • Reproducible Runs: Optional random seed parameter for repeatable experiments.

Installation

From PyPI

pip install community_TAU

From Source

git clone https://github.com/HillelCharbit/community_TAU.git
cd community_TAU
pip install .

Quick Start

Command-Line Interface

python -m community_TAU --graph data/example.graph --size 80 --workers 4 --max_generations 300

Command-Line Arguments

Argument Description Default
--graph Path to graph file (adjacency list, edge list, etc.) Required
--size Population size for genetic algorithm 60
--workers Number of parallel workers All available cores
--max_generations Maximum number of generations 500
--seed Random seed (optional) None

Python API Example

from community_TAU import community_TAU
import networkx as nx

G = nx.read_edgelist("example.graph")
result = community_TAU(G, size=100, workers=8, max_generations=400)

# Access results
print(result.partition)
print(result.modularity)

Input Formats Supported

  • Adjacency list (.graph)
  • Edge list (.csv, .txt)
  • Adjacency matrix (CSV or DataFrame)

Conversion tools are available in the graph_loader module.


Output

  • Final partition (dictionary of node → community)
  • Final modularity score
  • Optional generation logs and fitness history

Testing

To run unit tests:

pytest tests/

Test coverage includes input parsing, genetic algorithm logic, clustering evaluation, and parallel execution.


Contributing

We welcome contributions! To get started:

  1. Fork this repository
  2. Create a feature branch (git checkout -b feature/my-feature)
  3. Make your changes and commit (git commit -am 'Add new feature')
  4. Push to the branch (git push origin feature/my-feature)
  5. Open a pull request

Please review the contributing guidelines before submitting changes.


License

This project is licensed under the MIT License.


Acknowledgments

  • Louvain and Leiden algorithms based on work by Blondel et al. and Traag et al.
  • Graph handling inspired by NetworkX and igraph interfaces.

Project Status

TAU is actively maintained and under continuous development. Feedback and issues are welcome on the GitHub issue tracker.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tau_community_detection-0.3.22.tar.gz (11.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tau_community_detection-0.3.22-py3-none-any.whl (9.0 kB view details)

Uploaded Python 3

File details

Details for the file tau_community_detection-0.3.22.tar.gz.

File metadata

File hashes

Hashes for tau_community_detection-0.3.22.tar.gz
Algorithm Hash digest
SHA256 f30e79a430d12ff379d9f36bdc63b48df92afeaab02d6ae47ac77995aa40fd38
MD5 3b4a437c300f4af4340e277cb7d99f62
BLAKE2b-256 fa8344e22a23bdb578bf1007adcbabbb4fee5e9916c3ac7dd18f3dae5d1f34e4

See more details on using hashes here.

File details

Details for the file tau_community_detection-0.3.22-py3-none-any.whl.

File metadata

File hashes

Hashes for tau_community_detection-0.3.22-py3-none-any.whl
Algorithm Hash digest
SHA256 1ae4471d02e24e209f1073320dd1e59c8df46c3e68e8ecfe8094c71987fc465c
MD5 6f3bec3395344d4dc86e26026eaf8ad5
BLAKE2b-256 b693387c6086d92ed99400a33b293676da267c8658f19bcee721d1f35b2f7d2d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page