Educative: Interactive Courses for Software Developers

Statement#

Given a string, str, rearrange it so that any two adjacent characters are not the same. If such a reorganization of the characters is possible, output any possible valid arrangement. Otherwise, return an empty string.

Constraints:

$1\leq$ str.length $\leq500$
Input string consists of lowercase English letters.

Solution#

So far, you’ve probably brainstormed some approaches and have an idea of how to solve this problem. Let’s explore some of these approaches and figure out which one to follow based on considerations such as time complexity and any implementation constraints.

Naive approach#

The naive approach is to generate all possible permutations of the given string and check if the generated string is a valid arrangement or not. If any permutation satisfies the condition of a valid reorganization of the string, return that permutation of the input string. If no permutation is a valid reorganization of the string, return an empty string.

The number of possible permutations for a string of length $n$ is $n!$ , and it might require iterating through the $n$ characters to construct each permutation. Therefore, it will take $O(n! \times n)$ time to generate all these permutations. Then, for each permutation, we need to check whether it satisfies the condition of having no adjacent characters that are the same. Checking this condition requires iterating through the permutation once, which takes $O(n)$ time. Therefore, the overall time complexity of this naive approach is $O((n! \times n)\times n)$ .

Optimized solution using top $k$ elements#

Let’s use the top $k$ elements technique to reorganize the input string. This technique will use a max-heap to store characters along with their frequencies so that the character with the highest frequency will always be at the root of the heap. If we build the required order from the most frequent element, followed by the second most frequent element, and keep following this trend, we’ll likely find a valid reorganization of the string. If that fails, this means that it’s impossible to rearrange the string.

The illustration below shows the whole process:

# importing libraries
from collections import Counter
import heapq
def reorganize_string(str):
    # Calculate the frequency of characters in string and store counts
    # of each character along with the character itself in hash map.
    char_counter = Counter(str)
    return char_counter
# Driver code
def main():
    test_cases = ["programming", "hello", "fofjjb",
              "abbacdde", "aba", "awesome"]
    for i in range(len(test_cases)):
        print(i+1, '. \tInput string: "', test_cases[i], '"', sep="")
        print('\tCharacter counts: ', reorganize_string(test_cases[i]))
        print("-"*100)
if __name__ == '__main__':
    main()

Enter to Rename, Shift+Enter to Preview

Reorganize String

After getting all the characters and their respective frequencies, we initialize the heap to provide quick access to the most frequently occurring characters in the string. Since some languages, such as Python, don’t have built-in max-heap functionality, we store the frequencies in a heap in such a way that will serve our purpose.

We first iterate through the hash map and store the negative count of each character and the character itself in a heap. The reason for storing the negative count of each character is that when we pop characters from the heap, the heap will return the character with the maximum frequency.

For example, we have aabc as an input string. The hash map stores {a: 2, b:1, c:1}. Now, when we store the negative count of each character along with that character, the heap will look like this: [[-2, a], [-1, b], [-1, c]]. The first element that is popped from the heap is a, since it has the highest frequency of occurrence in the string.

# importing libraries
from collections import Counter
import heapq
def reorganize_string(str):
    # Calculate the frequency of characters in string and store counts
    # of each character along with the character itself in hash map.
    char_counter = Counter(str)
    # Initializing a heap
    most_freq_chars = []
    # Store character and its negative frequency in the array
    for char, count in char_counter.items():
        most_freq_chars.append([-count, char])
    # Construct heap from the array
    heapq.heapify(most_freq_chars)
    return most_freq_chars
# Driver code
def main():
    test_cases = ["programming", "hello", "fofjjb",
              "abbacdde", "aba", "awesome"]
    for i in range(len(test_cases)):
        print(i+1, '. \tInput string: "', test_cases[i], '"', sep="")
        print('\tCharacter frequencies as a max-heap: \n\t', reorganize_string(test_cases[i]))

Enter to Rename, Shift+Enter to Preview

Reorganize String

Now, we take two variables, previous and result. The previous variable stores the previous character that we used so that we don’t use that character again. The result variable stores the final reorganized string.

The character with the highest frequency will always be at the root of our heap. We keep popping characters from the top of the heap to add them to the result string.

When we add a character to the result string, we won’t push this character back onto the heap right away, even if its frequency of occurrence is greater than $0$ . Instead, we add it back to the heap in the next iteration. The reason is that we want to ensure that the same characters don’t appear adjacent to each other in the result string. Therefore, we store the current character along with its frequency of occurrence in previous for use in the next iteration.

Let’s explain this with the help of an example. For example, we have abcddd as an input string. The heap will store [[-3, d], [-1, a], [-1, b], [-1, c]]. In the first iteration, we add d to the result string as it has the highest count. If we update its count and put this element back into the heap right away, our heap will become [[-2, d], [-1, a], [-1, b], [-1, c]], and we again get d in the next iteration, since it is still the most frequently occurring element. Therefore, we store d in previous to push onto the heap in the next iteration to avoid similar adjacent characters.

# importing libraries
from collections import Counter
import heapq
def reorganize_string(str):
    # Calculate the frequency of characters in string and store counts
    # of each character along with the character itself in hash map.
    char_counter = Counter(str)
    # initializing heap
    most_freq_chars = []
    # Store character and its negative frequency in the array
    for char, count in char_counter.items():
        most_freq_chars.append([-count, char])
    # Construct heap from the array
    heapq.heapify(most_freq_chars)
    # initializing variables
    previous = None
    result = ""
    while len(most_freq_chars) > 0 or previous:
        count, char = heapq.heappop(most_freq_chars)
        result = result + char
       # decrement the character count, as we've now used one occurrence of it
        count = count + 1   # as we store negative character counts, adding 1 is actually a decrement operation
        # pushing the char back to heap

Enter to Rename, Shift+Enter to Preview

Reorganize String

# importing libraries
from collections import Counter
import heapq
def reorganize_string(str):
    # Calculate the frequency of characters in string and store counts
    # of each character along with the character itself in hash map.
    char_counter = Counter(str)
    # initializing heap
    most_freq_chars = []
    # Store character and its negative frequency in the array
    for char, count in char_counter.items():
        most_freq_chars.append([-count, char])
    # Construct heap from the array
    heapq.heapify(most_freq_chars)
    # initializing variables
    previous = None
    result = ""
    while len(most_freq_chars) > 0 or previous:
        if previous and len(most_freq_chars) == 0:
            return ""
        count, char = heapq.heappop(most_freq_chars)
        result = result + char

Enter to Rename, Shift+Enter to Preview

Reorganize String

# importing libraries
from collections import Counter
import heapq
def reorganize_string(str):
    char_counter = Counter(str)
    most_freq_chars = []
    for char, count in char_counter.items():
        most_freq_chars.append([-count, char])
    heapq.heapify(most_freq_chars)
    previous = None
    result = ""
    while len(most_freq_chars) > 0 or previous:
        if previous and len(most_freq_chars) == 0:
            return ""
        count, char = heapq.heappop(most_freq_chars)
        result = result + char
        count = count + 1
        if previous:
            heapq.heappush(most_freq_chars, previous)
            previous = None
        if count != 0:

Enter to Rename, Shift+Enter to Preview

Reorganize String

Solution summary#

Store each character and its frequency in a hash map.
Construct a max-heap using the character frequency data so that the most frequently occurring character is at the root of the heap.
Iterate over the heap until all the characters have been considered.
- Pop the most frequently occurring character from the heap and append it to the result string.
- Decrement the count of the popped character (since we have used one occurrence of it).
- Push the popped character back onto the heap in the next iteration if the updated frequency is greater than $0$ .
After all the iterations, return the reorganized string.
If the heap becomes empty and there is still an element exist to push into the heap, it indicates that reorganization of the string is not possible, return an empty string.

Time complexity#

As we iterate through the heap, every popped element may be pushed back onto the heap. This process is repeated until we have considered all the characters in the input string. Therefore, the iteration runs $O(n)$ times, where $n$ is the number of characters in the string. The worst-case time complexity of the push operation is $O(\log(c))$ , where $c$ is the number of distinct characters in the string. Now, the time complexity becomes $O(n \log(c)).$ Since the upper bound on $c$ is the size of the alphabet, which is 26, the $\log(c)$ term is effectively a constant. As a result, we may say that the overall time complexity is $O(n)$ .

Space complexity#

In our solution, we employed two data structures: a hash map and a heap. The hash map is responsible for storing the frequencies of characters in the input string, while the heap is used in the solution to find the desired string. Both data structures store lowercase alphabets. The maximum capacity of each data structure is $26$ — a fixed number. As a result, the space complexity of our solution is $O(1)$ .

Solution: Reorganize String

Statement#

Solution#

Naive approach#

Optimized solution using top $k$ elements#

Step-by-step solution construction#

Just the code#

Solution summary#

Time complexity#

Space complexity#

Solution: Reorganize String

Statement#

Solution#

Naive approach#

Optimized solution using top kkk elements#

Step-by-step solution construction#

Just the code#

Solution summary#

Time complexity#

Space complexity#

Optimized solution using top $k$ elements#