Estimating the Minimum Dominating Set with a sublogarithmic approximation ratio for undirected graph encoded in DIMACS format.
Project description
Baldor: Minimum Dominating Set Solver
This work builds upon New Insights and Developments on the Dominating Set Problem.
Overview of the Minimum Dominating Set (MDS)
Definition:
A dominating set in a graph $G = (V, E)$ is a subset $D \subseteq V$ such that every vertex not in $D$ is adjacent to at least one vertex in $D$. The minimum dominating set (MDS) is the smallest possible dominating set in terms of the number of vertices.
Key Concepts:
-
Graph Representation:
- $V$: Set of vertices.
- $E$: Set of edges connecting the vertices.
-
Dominating Set:
- A set $D$ where for every vertex $v \in V$, either $v \in D$ or $v$ is adjacent to some vertex in $D$.
-
Minimum Dominating Set:
- The dominating set with the smallest cardinality (i.e., the fewest number of vertices).
Applications:
- Network Design: Ensuring coverage in wireless sensor networks.
- Social Networks: Identifying influential nodes.
- Game Theory: Strategies in certain types of games.
- Biology: Modeling protein-protein interaction networks.
Computational Complexity:
- NP-Hard: Finding the minimum dominating set is computationally intensive for large graphs.
- Approximation Algorithms: Used to find near-optimal solutions in polynomial time.
Algorithms:
-
Greedy Algorithm:
- Iteratively selects the vertex that covers the most uncovered vertices.
- Provides a logarithmic approximation ratio.
-
Integer Linear Programming (ILP):
- Formulates the problem as an optimization problem.
- Solvable using ILP solvers for exact solutions, though computationally expensive.
-
Heuristics and Metaheuristics:
- Genetic algorithms, simulated annealing, etc., for large-scale problems.
Challenges:
- Scalability: Exact algorithms are infeasible for very large graphs.
- Dynamic Graphs: Maintaining a minimum dominating set in graphs that change over time.
Research Directions:
- Parallel Algorithms: Leveraging multi-core processors and distributed computing.
- Machine Learning: Using learning-based approaches to predict dominating sets.
- Hybrid Methods: Combining exact and heuristic methods for better performance.
Conclusion:
The minimum dominating set problem is a fundamental issue in graph theory with wide-ranging applications. While it is computationally challenging, various algorithms and heuristics provide practical solutions for different scenarios. Ongoing research continues to improve the efficiency and applicability of these methods.
Problem Statement
Input: A Boolean Adjacency Matrix $M$.
Answer: Find a Minimum Dominating Set.
Example Instance: 5 x 5 matrix
| c1 | c2 | c3 | c4 | c5 | |
|---|---|---|---|---|---|
| r1 | 0 | 0 | 1 | 0 | 1 |
| r2 | 0 | 0 | 0 | 1 | 0 |
| r3 | 1 | 0 | 0 | 0 | 1 |
| r4 | 0 | 1 | 0 | 0 | 0 |
| r5 | 1 | 0 | 1 | 0 | 0 |
The input for undirected graph is typically provided in DIMACS format. In this way, the previous adjacency matrix is represented in a text file using the following string representation:
p edge 5 4
e 1 3
e 1 5
e 2 4
e 3 5
This represents a 5x5 matrix in DIMACS format such that each edge $(v,w)$ appears exactly once in the input file and is not repeated as $(w,v)$. In this format, every edge appears in the form of
e W V
where the fields W and V specify the endpoints of the edge while the lower-case character e signifies that this is an edge descriptor line.
Example Solution:
Dominating Set Found 1, 2: Nodes 1 and 2 constitute an optimal solution.
Approximate Dominating Set Algorithm Analysis
Overview
This algorithm computes an approximate Dominating Set for an undirected graph in polynomial time. It leverages edge covers and dominating sets on trees to achieve a sublogarithmic approximation ratio. The implementation is carried out using the NetworkX library in Python.
Runtime Analysis
The runtime complexity of the algorithm is analyzed as follows:
- Removing isolated nodes: $O(n)$, where $n$ is the number of nodes.
- Finding minimum edge cover: $O(n^3)$, using the Edmonds-Gallai decomposition.
- Creating subgraph: $O(m)$, where $m$ is the number of edges in the minimum edge cover.
- Finding connected components: $O(n + m)$.
- For each connected component:
- Creating subgraph: $O(n_i + m_i)$, where $n_i$ and $m_i$ are the number of nodes and edges in the component.
- Finding the Dominating Set of the subgraph (Hint: It is a tree): $O(n_i)$.
- Remove redundant nodes: $O(n \cdot m)$.
The dominant factor in the runtime is the computation of the minimum edge cover, which has a cubic time complexity. Thus, the overall time complexity of the algorithm is $O(n^3)$.
Correctness
The correctness of the algorithm is grounded in the following principles:
- It correctly handles edge cases, such as empty graphs or graphs with no edges.
- Isolated nodes are removed since they do not contribute to the Dominating Set.
- The minimum edge cover ensures that all edges are covered.
- Finding the minimum Dominating Set in a tree is solvable in polynomial time.
- Each connected component is processed independently, ensuring correctness for disconnected graphs.
- Redundant nodes in the Dominating Set are removed at the conclusion of the algorithm.
While this algorithm does not guarantee an optimal solution, it provides an approximation with a ratio of at most sublogarithmic, which is theoretically sound for the Dominating Set problem.
Compile and Environment
Prerequisites
- Python ≥ 3.10
Installation
pip install baldor
Execution
-
Clone the repository:
git clone https://github.com/frankvegadelgado/baldor.git cd baldor
-
Run the script:
solve -i ./benchmarks/testMatrix1
utilizing the
solvecommand provided by Baldor's Library to execute the Boolean adjacency matrixbaldor\benchmarks\testMatrix1. The filetestMatrix1represents the example described herein. We also support.xz,.lzma,.bz2, and.bzip2compressed text files.Example Output:
testMatrix1: Dominating Set Found 1, 2This indicates nodes
1, 2form a Dominating Set.
Dominating Set Size
Use the -c flag to count the nodes in the Dominating Set:
solve -i ./benchmarks/testMatrix2 -c
Output:
testMatrix2: Dominating Set Size 2
Command Options
Display help and options:
solve -h
Output:
usage: solve [-h] -i INPUTFILE [-a] [-b] [-c] [-v] [-l] [--version]
Estimating the Minimum Dominating Set with a sublogarithmic approximation ratio encoded for undirected graph in DIMACS format.
options:
-h, --help show this help message and exit
-i INPUTFILE, --inputFile INPUTFILE
input file path
-a, --approximation enable comparison with another polynomial-time approximation approach within a logarithmic factor
-b, --bruteForce enable comparison with the exponential-time brute-force approach
-c, --count calculate the size of the Dominating Set
-v, --verbose anable verbose output
-l, --log enable file logging
--version show program's version number and exit
Batch Execution
Batch execution allows you to solve multiple graphs within a directory consecutively.
To view available command-line options for the batch_solve command, use the following in your terminal or command prompt:
batch_solve -h
This will display the following help information:
usage: batch_solve [-h] -i INPUTDIRECTORY [-a] [-b] [-c] [-v] [-l] [--version]
Estimating the Minimum Dominating Set with a sublogarithmic approximation ratio for all undirected graphs encoded in DIMACS format and stored in a directory.
options:
-h, --help show this help message and exit
-i INPUTDIRECTORY, --inputDirectory INPUTDIRECTORY
Input directory path
-a, --approximation enable comparison with another polynomial-time approximation approach within a logarithmic factor
-b, --bruteForce enable comparison with the exponential-time brute-force approach
-c, --count calculate the size of the Dominating Set
-v, --verbose anable verbose output
-l, --log enable file logging
--version show program's version number and exit
Testing Application
A command-line utility named test_solve is provided for evaluating the Algorithm using randomly generated, large sparse matrices. It supports the following options:
usage: test_solve [-h] -d DIMENSION [-n NUM_TESTS] [-s SPARSITY] [-a] [-b] [-c] [-w] [-v] [-l] [--version]
The Baldor Testing Application using randomly generated, large sparse matrices.
options:
-h, --help show this help message and exit
-d DIMENSION, --dimension DIMENSION
an integer specifying the dimensions of the square matrices
-n NUM_TESTS, --num_tests NUM_TESTS
an integer specifying the number of tests to run
-s SPARSITY, --sparsity SPARSITY
sparsity of the matrices (0.0 for dense, close to 1.0 for very sparse)
-a, --approximation enable comparison with another polynomial-time approximation approach within a logarithmic factor
-b, --bruteForce enable comparison with the exponential-time brute-force approach
-c, --count calculate the size of the Dominating Set
-w, --write write the generated random matrix to a file in the current directory
-v, --verbose anable verbose output
-l, --log enable file logging
--version show program's version number and exit
Code
- Python implementation by Frank Vega.
Complexity
+ We present a polynomial-time algorithm achieving a sublogarithmic approximation factor for MDS, providing strong evidence that P = NP by efficiently solving a computationally hard problem with near-optimal solutions.
License
- MIT License.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file baldor-0.0.5.tar.gz.
File metadata
- Download URL: baldor-0.0.5.tar.gz
- Upload date:
- Size: 17.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
76f8c6b1a9fc64f75655a9c3eb987535d979544cdf04a82b5fb575c540cd42b9
|
|
| MD5 |
8e6738179959d80eb8866d9b913eb06f
|
|
| BLAKE2b-256 |
b9dc0c1571ad09e1773c113c6cbc6e0ca766c1ddb9044820e3a80932be1852cd
|
Provenance
The following attestation bundles were made for baldor-0.0.5.tar.gz:
Publisher:
publish.yml on frankvegadelgado/baldor
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
baldor-0.0.5.tar.gz -
Subject digest:
76f8c6b1a9fc64f75655a9c3eb987535d979544cdf04a82b5fb575c540cd42b9 - Sigstore transparency entry: 177035257
- Sigstore integration time:
-
Permalink:
frankvegadelgado/baldor@0e827b37a107f36cac395ec5cfac1314943dc1fe -
Branch / Tag:
refs/tags/v0.0.5 - Owner: https://github.com/frankvegadelgado
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@0e827b37a107f36cac395ec5cfac1314943dc1fe -
Trigger Event:
release
-
Statement type:
File details
Details for the file baldor-0.0.5-py3-none-any.whl.
File metadata
- Download URL: baldor-0.0.5-py3-none-any.whl
- Upload date:
- Size: 17.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
42efc2eba39b6792f5180f5e855c1e503a9ef75e2dd5f91e828f18bc3ab3cf11
|
|
| MD5 |
424749e9efad2b158bf2271e2006c95c
|
|
| BLAKE2b-256 |
dd78329e6faf0734cfc1dee1308e32c73e6e07c74c0d6bb55ef6c6c2b80f3176
|
Provenance
The following attestation bundles were made for baldor-0.0.5-py3-none-any.whl:
Publisher:
publish.yml on frankvegadelgado/baldor
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
baldor-0.0.5-py3-none-any.whl -
Subject digest:
42efc2eba39b6792f5180f5e855c1e503a9ef75e2dd5f91e828f18bc3ab3cf11 - Sigstore transparency entry: 177035277
- Sigstore integration time:
-
Permalink:
frankvegadelgado/baldor@0e827b37a107f36cac395ec5cfac1314943dc1fe -
Branch / Tag:
refs/tags/v0.0.5 - Owner: https://github.com/frankvegadelgado
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@0e827b37a107f36cac395ec5cfac1314943dc1fe -
Trigger Event:
release
-
Statement type: