Educative: Interactive Courses for Software Developers

Statement#

We’re given an undirected graph consisting of $n$ nodes. The graph is represented as list called edges, of length $n$ , where edges[i] = [a, b] indicates that there is an edge between nodes a and b in the graph.

Return an edge that can be removed to make the graph a tree of $n$ nodes. If there are multiple candidates for removal, return the edge that occurs last in edges.

Constraints:

$3 \leq$ $n$ $\leq 1000$
edges.length $=$ $n$
edges[i].length $=$ 2
$1 \leq$ a $<$ b $\leq n$
$a \neq b$
There are no repeated edges.
The given graph is connected.
The graph contains only one cycle.

Solution#

So far, you have probably brainstormed some approaches and have an idea of how to solve this problem. Let’s explore some of these approaches and figure out which one to follow based on considerations such as time complexity and any implementation constraints.

Naive approach#

We can solve this problem using techniques like DFS or BFS, however, the naive approach using DFS or BFS has a time complexity of $O(n^2)$ in the worst-case scenario. This is because the DFS or BFS algorithm will explore the graph by visiting each node and its edges, which may result in visiting many nodes multiple times, and not necessarily finding the redundant connection in an efficient manner.

Optimized approach using union find#

We are going to solve this problem with the help of the union find pattern and will be using union by rank and path compression.

Union by rank: We connect the nodes that come afterward to the nodes that came before them. This allows us to add new nodes to the subset of the representative node of the larger connected component. Using ranks to connect nodes in this manner helps reduce the depth of the recursion call stack of the find function, since the trees within each disjoint set remain relatively shallow.
Path compression: On each find operation on a node of a tree, we update the parent of that node to point directly to the root. This reduces the length of the path of that node to the root, ensuring we don’t have to travel all the intermediate nodes on future find operations.

The slides below illustrate how the algorithm runs:

Note: In the following section, we will gradually build the solution. Alternatively, you can skip straight to just the code.

Step by step solution construction#

We start with the basic implementation of Union Find (without the use of rank or path compression):

File: union_find.py

In the UnionFind class, we declare the parent list with length based on the edges list.

File: main.py

In the redundant_connection function, we’ll declare an object of the given class and initialize the parent list with default values.

main.py

union_find.py

class UnionFind:
    # Constructor
    def __init__(self, n):
        self.parent = []
        for i in range(n + 1):
            self.parent.append(i)
    # Function to find which subset a particular element belongs to
    def find_parent(self, v):
        if self.parent[v] != v:
            return self.find_parent(self.parent[v])
   
    # Function to join two subsets into a single subset
    def union(self, v1, v2):
        p1, p2 = self.find_parent(v1), self.find_parent(v2)
        self.parent[p1] = p2

Enter to Rename, Shift+Enter to Preview

Redundant Connection

File: union_find.py

Next, we need to check if the vertices forming an edge have the same parent. In the union function, we’ll call the parent of the first vertex p1 and the parent of the second vertex p2. The algorithm is described as follows:

If both v1 and v2 have the same parent, i.e., p1 is equal to p2, the given edge is redundant, so we return FALSE.
Otherwise, this edge is connecting two vertices that were not already connected. So, we’ll update the parent list by making a connection based on the current edge and then return TRUE.

File: main.py

In the redundant_connection function, we traverse the edges list and for each edge, we attempt to connect its two vertices, v1, and v2, through the union(v1, v2) method.

main.py

union_find.py

class UnionFind:
    def __init__(self, n):
        self.parent = []
        self.rank = rank = [1] * (n + 1)
        for i in range(n + 1):
            self.parent.append(i)
    def find_parent(self, v):
        if self.parent[v] != v:
            self.parent[v] = self.find_parent(self.parent[v])
        return self.parent[v]
    # Function to find which subset a particular element belongs to
    # Returns FALSE if both vertices have the same parent, otherwise, updates the parent and rank lists by making a connection based on the passed edge
    # Returns TRUE if no cycle exits in the graph
    def union(self, v1, v2):
        print("\n\tChecking if the vertices have the same parent")
        print("\t\tEdge: [", v1, ", ", v2, "]", sep = "")
        # Find the root parents of v1 and v2
        p1, p2 = self.find_parent(v1), self.find_parent(v2)
        print(f"\t\t\tParent of vertex {v1}: ", p1, sep = "")
        print(f"\t\t\tParent of vertex {v2}: ", p2, sep = "")
        # If both parents are the same, a cycle exists and v1,v2 is the redundant edge
        if p1 == p2:
            print("\t\tSince both vertices have the same parent, this is a redundant edge")
            return False
        # Updates the parent and rank lists otherwise 

Enter to Rename, Shift+Enter to Preview

Redundant Connection

We will now add union by rank and path compression to the Union Find algorithm. We’ll make the following changes to the UnionFind class:

Union by rank:

We initialize a rank list (set to the length of the edges list) with $1$ s.
In the union function, if the parents, p1 and p2 of the vertices are not the same, we make the connection based on the ranks of both parents. This is done in the following way:
- If p1's rank is greater than p2's rank, we’ll update p2's parent with p1, and add p2's rank to p1's rank. For example, if rank[p1] $= 5$ and rank[p2] $= 2$ , we’ll add p2 to p1's set and update its rank to $5 + 2 = 7$ .
- Similarly, if p2's rank is greater than p1's rank, we’ll update p1's parent with p2, and add p1's rank to p2's rank.

Path compression:

In the find function, for a node v, we make the found root as the parent of v so that we don’t have to traverse all the intermediate nodes again on further find operations.

main.py

union_find.py

class UnionFind:
    def __init__(self, n):
        self.parent = []
        self.rank = rank = [1] * (n + 1)
        for i in range(n + 1):
            self.parent.append(i)
    def find_parent(self, v):
        if self.parent[v] != v:
            self.parent[v] = self.find_parent(self.parent[v])
        return self.parent[v]
    # Function to find which subset a particular element belongs to
    # Returns FALSE if both vertices have the same parent, otherwise, updates the parent and rank lists by making a connection based on the passed edge
    # Returns TRUE if no cycle exits in the graph
    def union(self, v1, v2):
        print("\n\tChecking if the vertices have the same parent")
        print("\t\tEdge: [", v1, ", ", v2, "]", sep = "")
        print("\t\tFirst vertex: ", v1, sep = "")
        print("\t\tSecond vertex: ", v2, sep = "")
        #Find the root parents of v1 and v2
        p1, p2 = self.find_parent(v1), self.find_parent(v2)
        print("\t\t\tParent of the first vertex: ", p1, sep = "")
        print("\t\t\tParent of the second vertex: ", p2, sep = "")
        # If both parents are the same, a cycle exists and v1,v2 is the redundant edge
        if p1 == p2:
            print("\t\tSince both vertices have the same parent, this is a redundant edge")

Enter to Rename, Shift+Enter to Preview

Redundant Connection

main.py

union_find.py

from union_find import UnionFind
def redundant_connection(edges):
    
    print("\tDeclaring the parent and rank lists")
    # Declares the parent and rank list with lengths based on the edges list
    graph = UnionFind(len(edges))
    print("\t\tParent: ", graph.parent, sep = "")
    print("\t\tRank: ", graph.rank, sep = "")
    
    # Traverses the edges of the graph to check for the redundant edge
    for v1, v2 in edges:
        if not graph.union(v1, v2):
            return [v1, v2]
def main():
    edges = [
        [[1, 2], [1, 3], [2, 3]], 
        [[1, 2], [2, 3], [1, 3]], 
        [[1, 2], [2, 3], [3, 4], [1, 4], [1, 5]], 
        [[1, 2], [1, 3], [1, 4], [3, 4], [2, 4]], 
        [[1, 2], [1, 3], [1, 4], [1,5], [2, 3], [2, 4], [2, 5]]
    ]
    for i in range(len(edges)):
        print(i+1, ".\tEdges: ", edges[i], sep = "")
        print("\n\tThe redundant connection in the graph is: ", redundant_connection(edges[i]), sep = "")
        print("-" * 100)
if __name__ == '__main__':

Enter to Rename, Shift+Enter to Preview

Redundant Connection

main.py

union_find.py

from union_find import UnionFind
def redundant_connection(edges):
    
    graph = UnionFind(len(edges))
    
    for v1, v2 in edges:
        if not graph.union(v1, v2):
            return [v1, v2]
def main():
    edges = [
        [[1, 2], [1, 3], [2, 3]], 
        [[1, 2], [2, 3], [1, 3]], 
        [[1, 2], [2, 3], [3, 4], [1, 4], [1, 5]], 
        [[1, 2], [1, 3], [1, 4], [3, 4], [2, 4]], 
        [[1, 2], [1, 3], [1, 4], [1,5], [2, 3], [2, 4], [2, 5]]
    ]
    for i in range(len(edges)):
        print(i+1, ".\tEdges: ", edges[i], sep = "")
        print("\tThe redundant connection in the graph is: ", redundant_connection(edges[i]), sep = "")
        print("-" * 100)
if __name__ == '__main__':
    main()

Enter to Rename, Shift+Enter to Preview

Redundant Connection

Solution summary#

To recap, the solution to this problem can be divided into the following two main parts:

Initialize parent and rank lists based on the length of the edges list.
Traverse the edges list and for each edge, compare the parents of both vertices:
- If the parents are the same, the current edge is redundant, so we return it.
- Otherwise, we connect the two vertices based on their respective ranks.

Time complexity#

The time complexity of this solution is $O(n)$ , where $n$ is the number of edges.

Explanation:

We use a loop to traverse the edges list, resulting in $O(n)$ iterations.
The time complexity of the the Union and Find functions is $O( \alpha (n))$ , since both path compression and union find by rank are being used. $\alpha (n)$ is almost constant time and grows very slowly with the input size, so the time complexity of both these functions can be considered as constant time for practical purposes.

Therefore, the overall time complexity becomes $O(n \times \alpha (n))$ , which simplifies to $O(n)$ .

Space Complexity#

The space complexity of this solution is $O(n)$ .

Solution: Redundant Connection

Statement#

Solution#

Naive approach#

Optimized approach using union find#

Step by step solution construction#

Just the code#

Solution summary#

Time complexity#

Space Complexity#