Compute the Approximate Vertex Cover for undirected graph encoded in DIMACS format.
Project description
Hvala: Approximate Vertex Cover Solver
This work builds upon Disproving the Unique Games Conjecture.
The Minimum Vertex Cover Problem
The Minimum Vertex Cover (MVC) problem is a classic optimization problem in computer science and graph theory. It involves finding the smallest set of vertices in a graph that covers all edges, meaning at least one endpoint of every edge is included in the set.
Formal Definition
Given an undirected graph $G = (V, E)$, a vertex cover is a subset $V' \subseteq V$ such that for every edge $(u, v) \in E$, at least one of $u$ or $v$ belongs to $V'$. The MVC problem seeks the vertex cover with the smallest cardinality.
Importance and Applications
- Theoretical Significance: MVC is a well-known NP-hard problem, central to complexity theory.
- Practical Applications:
- Network Security: Identifying critical nodes to disrupt connections.
- Bioinformatics: Analyzing gene regulatory networks.
- Wireless Sensor Networks: Optimizing sensor coverage.
Related Problems
- Maximum Independent Set: The complement of a vertex cover.
- Set Cover Problem: A generalization of MVC.
Problem Statement
Input: A Boolean Adjacency Matrix $M$.
Answer: Find a Minimum Vertex Cover.
Example Instance: 5 x 5 matrix
| c1 | c2 | c3 | c4 | c5 | |
|---|---|---|---|---|---|
| r1 | 0 | 0 | 1 | 0 | 1 |
| r2 | 0 | 0 | 0 | 1 | 0 |
| r3 | 1 | 0 | 0 | 0 | 1 |
| r4 | 0 | 1 | 0 | 0 | 0 |
| r5 | 1 | 0 | 1 | 0 | 0 |
The input for undirected graph is typically provided in DIMACS format. In this way, the previous adjacency matrix is represented in a text file using the following string representation:
p edge 5 4
e 1 3
e 1 5
e 2 4
e 3 5
This represents a 5x5 matrix in DIMACS format such that each edge $(v,w)$ appears exactly once in the input file and is not repeated as $(w,v)$. In this format, every edge appears in the form of
e W V
where the fields W and V specify the endpoints of the edge while the lower-case character e signifies that this is an edge descriptor line.
Example Solution:
Vertex Cover Found 1, 2, 3: Nodes 1, 2, and 3 constitute an optimal solution.
Vertex Cover via Degree Reduction Algorithm
Algorithm Overview
The Vertex Cover via Degree Reduction Algorithm is a polynomial-time approximation algorithm that finds near-optimal vertex covers by transforming the input graph into a simpler structure where optimal solutions can be computed efficiently.
Core Approach
- Preprocessing: Remove self-loops and isolated vertices from the input graph
- Component Decomposition: Process each connected component independently
- Degree Reduction: Transform each component using a novel reduction technique:
- Replace each vertex
uof degreekwithkauxiliary vertices - Connect each auxiliary vertex to one of
u's original neighbors - Assign weight
1/kto each auxiliary vertex - Resulting graph has maximum degree ≤ 1 (paths and cycles only)
- Replace each vertex
- Optimal Solving: Apply two different greedy algorithms on the reduced graph:
- Minimum weighted dominating set algorithm
- Minimum weighted vertex cover algorithm
- Solution Selection: Choose the better of the two solutions
- Extraction: Map auxiliary vertices back to original vertices
Key Innovation
The algorithm's strength lies in its dual-approach strategy: by solving both dominating set and vertex cover problems optimally on the degree-1 reduced graph and selecting the better solution, it consistently outperforms single-approach algorithms.
Performance Guarantees
Approximation Ratio
- Theoretical Bound:
< 2(strict inequality) - Practical Performance: Often significantly better than 2, approaching optimal for many graph classes
- Comparison: Outperforms classical algorithms like the standard edge-based 2-approximation
Time Complexity
- Overall Runtime:
O(|V| + |E|)- linear time - Space Complexity:
O(|V| + |E|)for storing the reduced graph
Complexity Breakdown
| Phase | Time Complexity | Description |
|---|---|---|
| Preprocessing | `O( | V |
| Component Finding | `O( | V |
| Graph Reduction | `O( | E |
| Optimal Solving | `O( | V |
| Solution Extraction | `O( | V |
Advantages
✅ Superior Approximation: Achieves approximation ratio < 2 (better than classical algorithms)
✅ Optimal Time Complexity: Linear time O(|V| + |E|) - matches the best possible for graph problems
✅ Practical Efficiency: Often produces near-optimal solutions in real-world instances
✅ Theoretical Rigor: Formal proofs guarantee correctness and performance bounds
✅ Robust Design: Handles all graph types including disconnected graphs and edge cases
Use Cases
The algorithm is particularly effective for:
- Large sparse graphs where linear time complexity is crucial
- Graphs with moderate vertex degrees where the reduction preserves structure well
- Applications requiring proven approximation guarantees with practical efficiency
- Real-time systems where predictable linear performance is essential
Implementation Notes
The algorithm requires:
- NetworkX for graph operations
- Custom greedy solvers for minimum weighted dominating set and vertex cover on degree-1 graphs
- Efficient data structures for mapping between original and auxiliary vertices
The dual-solution approach (trying both dominating set and vertex cover) is essential for achieving the < 2 approximation ratio and should not be omitted in implementations.
Compile and Environment
Prerequisites
- Python ≥ 3.12
Installation
pip install hvala
Execution
-
Clone the repository:
git clone https://github.com/frankvegadelgado/hvala.git cd hvala
-
Run the script:
idemo -i ./benchmarks/testMatrix1
utilizing the
idemocommand provided by Hvala's Library to execute the Boolean adjacency matrixhvala\benchmarks\testMatrix1. The filetestMatrix1represents the example described herein. We also support.xz,.lzma,.bz2, and.bzip2compressed text files.Example Output:
testMatrix1: Vertex Cover Found 1, 2, 3This indicates nodes
1, 2, 3form a vertex cover.
Vertex Cover Size
Use the -c flag to count the nodes in the vertex cover:
idemo -i ./benchmarks/testMatrix2 -c
Output:
testMatrix2: Vertex Cover Size 5
Command Options
Display help and options:
idemo -h
Output:
usage: idemo [-h] -i INPUTFILE [-a] [-b] [-c] [-v] [-l] [--version]
Compute the Approximate Vertex Cover for undirected graph encoded in DIMACS format.
options:
-h, --help show this help message and exit
-i INPUTFILE, --inputFile INPUTFILE
input file path
-a, --approximation enable comparison with a polynomial-time approximation approach within a factor of at most 2
-b, --bruteForce enable comparison with the exponential-time brute-force approach
-c, --count calculate the size of the vertex cover
-v, --verbose anable verbose output
-l, --log enable file logging
--version show program's version number and exit
Batch Execution
Batch execution allows you to solve multiple graphs within a directory consecutively.
To view available command-line options for the batch_idemo command, use the following in your terminal or command prompt:
batch_idemo -h
This will display the following help information:
usage: batch_idemo [-h] -i INPUTDIRECTORY [-a] [-b] [-c] [-v] [-l] [--version]
Compute the Approximate Vertex Cover for all undirected graphs encoded in DIMACS format and stored in a directory.
options:
-h, --help show this help message and exit
-i INPUTDIRECTORY, --inputDirectory INPUTDIRECTORY
Input directory path
-a, --approximation enable comparison with a polynomial-time approximation approach within a factor of at most 2
-b, --bruteForce enable comparison with the exponential-time brute-force approach
-c, --count calculate the size of the vertex cover
-v, --verbose anable verbose output
-l, --log enable file logging
--version show program's version number and exit
Testing Application
A command-line utility named test_idemo is provided for evaluating the Algorithm using randomly generated, large sparse matrices. It supports the following options:
usage: test_idemo [-h] -d DIMENSION [-n NUM_TESTS] [-s SPARSITY] [-a] [-b] [-c] [-w] [-v] [-l] [--version]
The Hvala Testing Application using randomly generated, large sparse matrices.
options:
-h, --help show this help message and exit
-d DIMENSION, --dimension DIMENSION
an integer specifying the dimensions of the square matrices
-n NUM_TESTS, --num_tests NUM_TESTS
an integer specifying the number of tests to run
-s SPARSITY, --sparsity SPARSITY
sparsity of the matrices (0.0 for dense, close to 1.0 for very sparse)
-a, --approximation enable comparison with a polynomial-time approximation approach within a factor of at most 2
-b, --bruteForce enable comparison with the exponential-time brute-force approach
-c, --count calculate the size of the vertex cover
-w, --write write the generated random matrix to a file in the current directory
-v, --verbose anable verbose output
-l, --log enable file logging
--version show program's version number and exit
Code
- Python implementation by Frank Vega.
License
- MIT License.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file hvala-0.0.1.tar.gz.
File metadata
- Download URL: hvala-0.0.1.tar.gz
- Upload date:
- Size: 18.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e0474a5d217fb9701f6871d70a3c0c25f3862f64bdb4bcf9d212a81b8212d544
|
|
| MD5 |
b48f16100c062fc1c7b2aa92af28a935
|
|
| BLAKE2b-256 |
21a2eff8e3e242387fe401270a2822387ac882d48dc27c96fad668115fd5082e
|
Provenance
The following attestation bundles were made for hvala-0.0.1.tar.gz:
Publisher:
publish.yml on frankvegadelgado/hvala
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
hvala-0.0.1.tar.gz -
Subject digest:
e0474a5d217fb9701f6871d70a3c0c25f3862f64bdb4bcf9d212a81b8212d544 - Sigstore transparency entry: 315691985
- Sigstore integration time:
-
Permalink:
frankvegadelgado/hvala@7123bf53b76edf58e056021e951727c7e9db8fbf -
Branch / Tag:
refs/tags/v0.0.1 - Owner: https://github.com/frankvegadelgado
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@7123bf53b76edf58e056021e951727c7e9db8fbf -
Trigger Event:
release
-
Statement type:
File details
Details for the file hvala-0.0.1-py3-none-any.whl.
File metadata
- Download URL: hvala-0.0.1-py3-none-any.whl
- Upload date:
- Size: 18.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5b23fee7749f64576aed05ae2f54d321bdc11db0a6fb524b6e96af5a921d1a86
|
|
| MD5 |
139db65760b76340fcfc53e1c0c22877
|
|
| BLAKE2b-256 |
2f9f9cba58be2b6fff8e9e007213398fb7537c8e9f92ee2fa1df8edbe7010935
|
Provenance
The following attestation bundles were made for hvala-0.0.1-py3-none-any.whl:
Publisher:
publish.yml on frankvegadelgado/hvala
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
hvala-0.0.1-py3-none-any.whl -
Subject digest:
5b23fee7749f64576aed05ae2f54d321bdc11db0a6fb524b6e96af5a921d1a86 - Sigstore transparency entry: 315691992
- Sigstore integration time:
-
Permalink:
frankvegadelgado/hvala@7123bf53b76edf58e056021e951727c7e9db8fbf -
Branch / Tag:
refs/tags/v0.0.1 - Owner: https://github.com/frankvegadelgado
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@7123bf53b76edf58e056021e951727c7e9db8fbf -
Trigger Event:
release
-
Statement type: