Skip to main content

Estimating the Minimum Vertex Cover with an approximation factor less than 2 for undirected graph encoded in DIMACS format.

Project description

Varela: Minimum Vertex Cover Solver

Honoring the Memory of Felix Varela y Morales (Cuban Catholic priest and independence leader)

This work builds upon The Unique Games Conjecture.


The Minimum Vertex Cover Problem

The Minimum Vertex Cover (MVC) problem is a classic optimization problem in computer science and graph theory. It involves finding the smallest set of vertices in a graph that covers all edges, meaning at least one endpoint of every edge is included in the set.

Formal Definition

Given an undirected graph $G = (V, E)$, a vertex cover is a subset $V' \subseteq V$ such that for every edge $(u, v) \in E$, at least one of $u$ or $v$ belongs to $V'$. The MVC problem seeks the vertex cover with the smallest cardinality.

Importance and Applications

  • Theoretical Significance: MVC is a well-known NP-hard problem, central to complexity theory.
  • Practical Applications:
    • Network Security: Identifying critical nodes to disrupt connections.
    • Bioinformatics: Analyzing gene regulatory networks.
    • Wireless Sensor Networks: Optimizing sensor coverage.

Related Problems

  • Maximum Independent Set: The complement of a vertex cover.
  • Set Cover Problem: A generalization of MVC.

Problem Statement

Input: A Boolean Adjacency Matrix $M$.

Answer: Find a Minimum Vertex Cover.

Example Instance: 5 x 5 matrix

c1 c2 c3 c4 c5
r1 0 0 1 0 1
r2 0 0 0 1 0
r3 1 0 0 0 1
r4 0 1 0 0 0
r5 1 0 1 0 0

The input for undirected graph is typically provided in DIMACS format. In this way, the previous adjacency matrix is represented in a text file using the following string representation:

p edge 5 4
e 1 3
e 1 5
e 2 4
e 3 5

This represents a 5x5 matrix in DIMACS format such that each edge $(v,w)$ appears exactly once in the input file and is not repeated as $(w,v)$. In this format, every edge appears in the form of

e W V

where the fields W and V specify the endpoints of the edge while the lower-case character e signifies that this is an edge descriptor line.

Example Solution:

Vertex Cover Found 3, 4, 5: Nodes 3, 4, and 5 constitute an optimal solution.


Approximate Vertex Cover Algorithm Analysis

Overview

The algorithm works as follows:

  1. It first checks if the graph is empty or has no edges, returning an empty set in these cases.
  2. It then iterates over all connected components of the graph.
  3. For each connected component, it finds a maximal independent set.
  4. The vertex cover for each component is computed as the complement of the maximal independent set.
  5. The final vertex cover is the union of all component-wise vertex covers.

Runtime Analysis

The runtime of this algorithm can be broken down as follows:

  1. Checking for empty graph or no edges: $O(1)$
  2. Finding connected components: $O(|V| + |E|)$, where $|V|$ is the number of vertices and $|E|$ is the number of edges.
  3. For each component:
    • Creating a subgraph: $O(|V| + |E|)$
    • Finding a maximal independent set: $O(|V| + |E|)$
    • Computing the complement: $O(|V|)$
    • Updating the final set: $O(|V|)$

The total runtime is dominated by the operations on connected components, which are performed at most once for each vertex and edge. Therefore, the overall time complexity is $O(|V| + |E|)$.

Correctness

The algorithm produces a valid vertex cover with an approximation ratio of less than 2. Here's why:

  1. The algorithm correctly handles edge cases (empty graph or no edges).
  2. For each connected component, it finds a maximal independent set. By definition, a maximal independent set is a set of vertices where no two vertices are adjacent, and no more vertices can be added while maintaining this property.
  3. The complement of a maximal independent set is a valid vertex cover. This is because any edge not covered by the complement would have both its endpoints in the independent set, which contradicts the definition of an independent set.
  4. The union of vertex covers for all components forms a valid vertex cover for the entire graph.
  5. The approximation ratio is less than 2 because:
    • The size of a maximal independent set is at least half the size of a maximum independent set.
    • The complement of a maximum independent set is a minimum vertex cover.
    • Therefore, the size of our approximate vertex cover is at most twice the size of the minimum vertex cover.

In conclusion, this algorithm provides a polynomial-time 2-approximation for the vertex cover problem, which is consistent with the known bounds for this NP-hard problem.


Compile and Environment

Prerequisites

  • Python ≥ 3.10

Installation

pip install varela

Execution

  1. Clone the repository:

    git clone https://github.com/frankvegadelgado/varela.git
    cd varela
    
  2. Run the script:

    approx -i ./benchmarks/testMatrix1
    

    utilizing the approx command provided by Varela's Library to execute the Boolean adjacency matrix varela\benchmarks\testMatrix1. The file testMatrix1 represents the example described herein. We also support .xz, .lzma, .bz2, and .bzip2 compressed text files.

    Example Output:

    testMatrix1: Vertex Cover Found 3, 4, 5
    

    This indicates nodes 3, 4, 5 form a vertex cover.


Vertex Cover Size

Use the -c flag to count the nodes in the vertex cover:

approx -i ./benchmarks/testMatrix2 -c

Output:

testMatrix2: Vertex Cover Size 5

Command Options

Display help and options:

approx -h

Output:

usage: approx [-h] -i INPUTFILE [-a] [-b] [-c] [-v] [-l] [--version]

Estimating the Minimum Vertex Cover with an approximation factor of less than 2 encoded for undirected graph in DIMACS format.

options:
  -h, --help            show this help message and exit
  -i INPUTFILE, --inputFile INPUTFILE
                        input file path
  -a, --approximation   enable comparison with another polynomial-time approximation approach within a factor of at most 2
  -b, --bruteForce      enable comparison with the exponential-time brute-force approach
  -c, --count           calculate the size of the vertex cover
  -v, --verbose         anable verbose output
  -l, --log             enable file logging
  --version             show program's version number and exit

Batch Execution

Batch execution allows you to solve multiple graphs within a directory consecutively.

To view available command-line options for the batch_approx command, use the following in your terminal or command prompt:

batch_approx -h

This will display the following help information:

usage: batch_approx [-h] -i INPUTDIRECTORY [-a] [-b] [-c] [-v] [-l] [--version]

Estimating the Minimum Vertex Cover with an approximation factor of less than 2 for all undirected graphs encoded in DIMACS format and stored in a directory.

options:
  -h, --help            show this help message and exit
  -i INPUTDIRECTORY, --inputDirectory INPUTDIRECTORY
                        Input directory path
  -a, --approximation   enable comparison with another polynomial-time approximation approach within a factor of at most 2
  -b, --bruteForce      enable comparison with the exponential-time brute-force approach
  -c, --count           calculate the size of the vertex cover
  -v, --verbose         anable verbose output
  -l, --log             enable file logging
  --version             show program's version number and exit

Testing Application

A command-line utility named test_approx is provided for evaluating the Algorithm using randomly generated, large sparse matrices. It supports the following options:

usage: test_approx [-h] -d DIMENSION [-n NUM_TESTS] [-s SPARSITY] [-a] [-b] [-c] [-w] [-v] [-l] [--version]

The Varela Testing Application using randomly generated, large sparse matrices.

options:
  -h, --help            show this help message and exit
  -d DIMENSION, --dimension DIMENSION
                        an integer specifying the dimensions of the square matrices
  -n NUM_TESTS, --num_tests NUM_TESTS
                        an integer specifying the number of tests to run
  -s SPARSITY, --sparsity SPARSITY
                        sparsity of the matrices (0.0 for dense, close to 1.0 for very sparse)
  -a, --approximation   enable comparison with another polynomial-time approximation approach within a factor of at most 2
  -b, --bruteForce      enable comparison with the exponential-time brute-force approach
  -c, --count           calculate the size of the vertex cover
  -w, --write           write the generated random matrix to a file in the current directory
  -v, --verbose         anable verbose output
  -l, --log             enable file logging
  --version             show program's version number and exit

Code

  • Python implementation by Frank Vega.

Complexity

+ This result contradicts the Unique Games Conjecture, suggesting that many optimization problems may admit better solutions, revolutionizing theoretical computer science.

License

  • MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

varela-0.1.6.tar.gz (16.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

varela-0.1.6-py3-none-any.whl (16.0 kB view details)

Uploaded Python 3

File details

Details for the file varela-0.1.6.tar.gz.

File metadata

  • Download URL: varela-0.1.6.tar.gz
  • Upload date:
  • Size: 16.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for varela-0.1.6.tar.gz
Algorithm Hash digest
SHA256 88b3f18920d9d7995ccd93a858287c8b1a2e52839d35a906cf149c62c5ebb059
MD5 f14a0cb313cccb3c6e5c300b701a17f0
BLAKE2b-256 09f682722810f9706cd7dd66dcb54976bdbf4630d1743d5f2fd1b2eb0ab7a809

See more details on using hashes here.

Provenance

The following attestation bundles were made for varela-0.1.6.tar.gz:

Publisher: publish.yml on frankvegadelgado/varela

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file varela-0.1.6-py3-none-any.whl.

File metadata

  • Download URL: varela-0.1.6-py3-none-any.whl
  • Upload date:
  • Size: 16.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for varela-0.1.6-py3-none-any.whl
Algorithm Hash digest
SHA256 323a4acb76f6509cb1fa4d0618d869db1e3af5dd0c988db51a9d279c4c665426
MD5 3ebddb02ac8527cc1c32000076e581fb
BLAKE2b-256 8d24d89be7a7b09111b790de1cea6e4fa00fa5bb8a0323511af94d39308a975e

See more details on using hashes here.

Provenance

The following attestation bundles were made for varela-0.1.6-py3-none-any.whl:

Publisher: publish.yml on frankvegadelgado/varela

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page