Estimating the Minimum Vertex Cover with an approximation factor of ≤ 3/2 for an undirected graph encoded in DIMACS format.
Project description
Varela: Minimum Vertex Cover Solver
This work builds upon The Unique Games Conjecture.
The Minimum Vertex Cover Problem
The Minimum Vertex Cover (MVC) problem is a classic optimization problem in computer science and graph theory. It involves finding the smallest set of vertices in a graph that covers all edges, meaning at least one endpoint of every edge is included in the set.
Formal Definition
Given an undirected graph $G = (V, E)$, a vertex cover is a subset $V' \subseteq V$ such that for every edge $(u, v) \in E$, at least one of $u$ or $v$ belongs to $V'$. The MVC problem seeks the vertex cover with the smallest cardinality.
Importance and Applications
- Theoretical Significance: MVC is a well-known NP-hard problem, central to complexity theory.
- Practical Applications:
- Network Security: Identifying critical nodes to disrupt connections.
- Bioinformatics: Analyzing gene regulatory networks.
- Wireless Sensor Networks: Optimizing sensor coverage.
Related Problems
- Maximum Independent Set: The complement of a vertex cover.
- Set Cover Problem: A generalization of MVC.
Problem Statement
Input: A Boolean Adjacency Matrix $M$.
Answer: Find a Minimum Vertex Cover.
Example Instance: 5 x 5 matrix
| c1 | c2 | c3 | c4 | c5 | |
|---|---|---|---|---|---|
| r1 | 0 | 0 | 1 | 0 | 1 |
| r2 | 0 | 0 | 0 | 1 | 0 |
| r3 | 1 | 0 | 0 | 0 | 1 |
| r4 | 0 | 1 | 0 | 0 | 0 |
| r5 | 1 | 0 | 1 | 0 | 0 |
The input for undirected graph is typically provided in DIMACS format. In this way, the previous adjacency matrix is represented in a text file using the following string representation:
p edge 5 4
e 1 3
e 1 5
e 2 4
e 3 5
This represents a 5x5 matrix in DIMACS format such that each edge $(v,w)$ appears exactly once in the input file and is not repeated as $(w,v)$. In this format, every edge appears in the form of
e W V
where the fields W and V specify the endpoints of the edge while the lower-case character e signifies that this is an edge descriptor line.
Example Solution:
Vertex Cover Found 1, 2, 3: Nodes 1, 2, and 3 constitute an optimal solution.
Our Algorithm - Polynomial Runtime
Algorithm Overview
- Input: Adjacency matrix of graph G
- Create Edge Graph:
- Nodes represent edges of G
- Connect nodes if edges share a vertex
- Find an Approximate Minimum Maximal Matching in edge graph
- Extract Vertex Cover:
- Add common vertices from edge cover
- Handle Isolated Edges:
- Add vertices for uncovered edges
- Remove Redundant Vertices
- Output: Approximate minimum vertex cover
Key Features:
- Polynomial-time complexity: O($|E|^3$)
- Approximation ratio: ≤ 3/2
- Suitable for large, sparse graphs
Correctness
-
Edge Graph Construction
- Preserves edge relationships of original graph
-
Minimum Maximal Matching Approximation
- Ensures every edge is incident to an edge in the matching
-
Vertex Cover Extraction
- Guarantees at least one endpoint of each edge is in cover
-
Isolated Edge Handling
- Covers any remaining uncovered edges
-
Redundancy Removal
- Optimizes cover size while maintaining validity
Correctness Guarantee:
- Every edge in original graph has at least one endpoint in cover
- Resulting set is a valid, approximate minimum vertex cover
Approximation Quality:
- Achieves ≤ 3/2 approximation ratio
- Trade-off between accuracy and polynomial-time efficiency
Runtime Analysis
This section analyzes the runtime and space complexity of the given vertex cover approximation algorithm.
Time Complexity Breakdown
-
Input Processing and Graph Creation: $O(|E|)$
- $|E|$ represents the number of edges in the input graph.
- This step involves converting the sparse matrix representation to a NetworkX graph, which iterates through the non-zero entries (edges).
-
Edge Graph Construction: $O(|E| \Delta)$
- $\Delta$ represents the maximum degree of any vertex in the graph.
- For each edge in the original graph, we examine its endpoints' neighbors to create corresponding edges in the edge graph. The number of neighbors is bounded by $\Delta$.
-
Approximate Minimum Maximal Matching Computation: $O(|E|\Delta)$
- This step utilizes the
nx.approximation.min_maximal_matching()function which computes the minimum maximal matching with an approximation ratio of at most 2. The complexity linearly relates to the number of edges in the edge graph (which is $|E|\Delta$). The complexity is therefore $O(|E|\Delta)$.
- This step utilizes the
-
Vertex Cover Extraction: $O(|E|)$
- This step iterates through the edges in the computed minimum edge cover, which is bounded by the number of edges in the original graph.
-
Isolated Edge Handling: $O(|E|)$
- We iterate through all edges in the original graph to handle any isolated edges.
-
Redundancy Removal: $O(k |E|)$, or $O(|V| |E|)$ in the worst case.
- $k$ is the size of the vertex cover. In the worst-case, the size of the vertex cover can be $O(|V|)$, so the overall time complexity is $O(|V| |E|)$.
- For each vertex in the (potentially large) vertex cover, we check if its removal still leaves a valid cover. This check involves examining all edges.
Overall Complexity
- Time Complexity: $O(|V| |E|)$ (dominated by the removal redundacy computation).
- Space Complexity: $O(|V| + |E| + |E|\Delta)$. In the worst-case scenario (dense graphs where $E = O(|V|^2)$ and $\Delta = O(|V|)$), this becomes $O(|V|^3)$.
Key Observations
- This algorithm provides an approximation of the minimum vertex cover, trading accuracy for a polynomial runtime.
- The practical runtime performance can often be significantly better than the worst-case theoretical bound, especially for sparse graphs.
- This approach is suitable for sparse and large graphs where computing the exact minimum vertex cover is computationally infeasible.
Compile and Environment
Prerequisites
- Python ≥ 3.10
Installation
pip install varela
Execution
-
Clone the repository:
git clone https://github.com/frankvegadelgado/varela.git cd varela
-
Run the script:
approx -i ./benchmarks/testMatrix1
utilizing the
approxcommand provided by Varela's Library to execute the Boolean adjacency matrixvarela\benchmarks\testMatrix1. The filetestMatrix1represents the example described herein. We also support.xz,.lzma,.bz2, and.bzip2compressed text files.Example Output:
testMatrix1: Vertex Cover Found 1, 2, 3This indicates nodes
1, 2, 3form a vertex cover.
Vertex Cover Size
Use the -c flag to count the nodes in the vertex cover:
approx -i ./benchmarks/testMatrix2 -c
Output:
testMatrix2: Vertex Cover Size 5
Command Options
Display help and options:
approx -h
Output:
usage: approx [-h] -i INPUTFILE [-a] [-b] [-c] [-v] [-l] [--version]
Estimating the Minimum Vertex Cover with an approximation factor of ≤ 3/2 for an undirected graph encoded in DIMACS format.
options:
-h, --help show this help message and exit
-i INPUTFILE, --inputFile INPUTFILE
input file path
-a, --approximation enable comparison with a polynomial-time approximation approach within a factor of at most 2
-b, --bruteForce enable comparison with the exponential-time brute-force approach
-c, --count calculate the size of the vertex cover
-v, --verbose enable verbose output
-l, --log enable file logging
--version show program's version number and exit
Testing Application
A command-line utility named test_approx is provided for evaluating the Algorithm using randomly generated, large sparse matrices. It supports the following options:
usage: test_approx [-h] -d DIMENSION [-n NUM_TESTS] [-s SPARSITY] [-a] [-b] [-c] [-w] [-v] [-l] [--version]
The Varela Testing Application.
options:
-h, --help show this help message and exit
-d DIMENSION, --dimension DIMENSION
an integer specifying the dimensions of the square matrices
-n NUM_TESTS, --num_tests NUM_TESTS
an integer specifying the number of tests to run
-s SPARSITY, --sparsity SPARSITY
sparsity of the matrices (0.0 for dense, close to 1.0 for very sparse)
-a, --approximation enable comparison with a polynomial-time approximation approach within a factor of at most 2
-b, --bruteForce enable comparison with the exponential-time brute-force approach
-c, --count calculate the size of the vertex cover
-w, --write write the generated random matrix to a file in the current directory
-v, --verbose enable verbose output
-l, --log enable file logging
--version show program's version number and exit
Batch Execution
Batch execution allows you to solve multiple graphs within a directory simultaneously.
To view available command-line options for the batch_approx command, use the following in your terminal or command prompt:
batch_approx -h
This will display the following help information:
usage: batch_approx [-h] -i INPUTDIRECTORY [-a] [-b] [-c] [-v] [-l] [--version]
Estimating the Minimum Vertex Cover with an approximation factor of ≤ 3/2 for all undirected graphs encoded in DIMACS format and stored in a directory.
options:
-h, --help show this help message and exit
-i INPUTDIRECTORY, --inputDirectory INPUTDIRECTORY
Input directory path
-a, --approximation enable comparison with a polynomial-time approximation approach within a factor of at most 2
-b, --bruteForce enable comparison with the exponential-time brute-force approach
-c, --count calculate the size of the vertex cover
-v, --verbose anable verbose output
-l, --log enable file logging
--version show program's version number and exit
Code
- Python implementation by Frank Vega.
Complexity
+ This result contradicts the Unique Games Conjecture, suggesting that many optimization problems may admit better solutions, revolutionizing theoretical computer science.
License
- MIT License.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file varela-0.1.0.tar.gz.
File metadata
- Download URL: varela-0.1.0.tar.gz
- Upload date:
- Size: 17.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
32750bd1c5258bec3a27448ea52f48cc355af0ff59b6b92e297790afc6bfa36f
|
|
| MD5 |
0bc028a873fb397579681e1448751036
|
|
| BLAKE2b-256 |
daf013ac09bc8138fcd1a72407e3b5722cda3ddef51be548172efa30a8b7d9a5
|
Provenance
The following attestation bundles were made for varela-0.1.0.tar.gz:
Publisher:
publish.yml on frankvegadelgado/varela
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
varela-0.1.0.tar.gz -
Subject digest:
32750bd1c5258bec3a27448ea52f48cc355af0ff59b6b92e297790afc6bfa36f - Sigstore transparency entry: 171251482
- Sigstore integration time:
-
Permalink:
frankvegadelgado/varela@0fd9b55d62898e35c3ba7a4f0a318ea3a8996af1 -
Branch / Tag:
refs/tags/v0.1.0 - Owner: https://github.com/frankvegadelgado
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@0fd9b55d62898e35c3ba7a4f0a318ea3a8996af1 -
Trigger Event:
release
-
Statement type:
File details
Details for the file varela-0.1.0-py3-none-any.whl.
File metadata
- Download URL: varela-0.1.0-py3-none-any.whl
- Upload date:
- Size: 16.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e9df56e9201a099ea15bf56d6bceb2adf827e2ed2996799782736cec7c12bfe2
|
|
| MD5 |
935e02e78b6d4d7013521095cf1de294
|
|
| BLAKE2b-256 |
4e25abcc3febc50af0b72c327e669bff119798ef0706e31dc188e1d6faf9822f
|
Provenance
The following attestation bundles were made for varela-0.1.0-py3-none-any.whl:
Publisher:
publish.yml on frankvegadelgado/varela
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
varela-0.1.0-py3-none-any.whl -
Subject digest:
e9df56e9201a099ea15bf56d6bceb2adf827e2ed2996799782736cec7c12bfe2 - Sigstore transparency entry: 171251484
- Sigstore integration time:
-
Permalink:
frankvegadelgado/varela@0fd9b55d62898e35c3ba7a4f0a318ea3a8996af1 -
Branch / Tag:
refs/tags/v0.1.0 - Owner: https://github.com/frankvegadelgado
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@0fd9b55d62898e35c3ba7a4f0a318ea3a8996af1 -
Trigger Event:
release
-
Statement type: