Solution: Word Break

Let's solve the Word Break problem using the Dynamic Programming pattern.

Statement#

Given a string, s, and a dictionary of strings, word_dict, check if s can be segmented into a space-separated sequence of one or more dictionary words. If yes, return TRUE; else, return FALSE.

Note: The same word in the dictionary may be used multiple times.

Constraints:

  • 11 \leq s.length 250\leq 250

  • 11 \leq word_dict.length 1000\leq 1000

  • 11 \leq word_dict[i].length 20\leq 20

  • s and word_dict[i] consist of only lowercase English letters.

  • All the strings of word_dict are unique.

Solution#

So far, you’ve probably brainstormed some approaches and have an idea of how to solve this problem. Let’s explore some of these approaches and figure out which one to follow based on considerations such as time complexity and any implementation constraints.

Naive approach#

The naive approach to solve this problem is to use a basic recursive strategy, which considers all possible prefixes of a string and checks if they are present in the dictionary. If a prefix is contained in the dictionary, the rest of the string is checked in the same manner.

Although this approach can get us the desired output, it has a time complexity of O(2n)O(2^n), where nn is the length of the string. This is because at each step, we split the string in two, check the prefix in the dictionary, and repeat the process with the rest of the string. The space complexity of this approach is O(n)O(n) because the depth of the recursion tree can go up to nn.

Optimized approach using dynamic programming#

In the recursive approach, we notice that many prefixes are computed again even though they were checked in the earlier computations. For example, a prefix of two characters contains a prefix of one character that has already been checked. These redundant computations consume a lot of time that can be avoided using dynamic programming. The idea is to use a lookup table, dp, of size n+1n+1, where nn is the length of the input string and 11 is added to cater the empty string. This table will store the results of the shorter prefixes that can be used, in O(1)O(1) lookup time, for solving the longer prefixes.

The dp is initialized with FALSE except for dp[0], which is set TRUE because an empty string is assumed to be a valid word in the dictionary. Then, using two pointers, i and j, we check every possible prefix s[j..i] and whether they’re contained in the dictionary. If the substring s[j..i] is found in the dictionary, we don’t check further smaller substrings. We also check whether dp[j] is TRUE, which tells us that the prefix s[0..j] was found in the dictionary. If both conditions are fulfilled, the corresponding dp index, i.e., dp[i], is set to TRUE. We continue this process until the whole string has been processed. Finally, we return the value of dp[n], which tells us that the input string could be segmented into a space-separated sequence of dictionary words.

Let’s look at the following illustration to get a better understanding of the solution:

Created with Fabric.js 3.6.6 s v e g a n d i j dp T F F F F F F i will iterate over the input string from index 1 to the length of thestring +1. j will iterate over the substring from index 0 to i-1. dp[] is used to store the results. The 0th index of the dp is set toTRUE because an empty string is assumed to be part of the dictionary. word dict an and veg vegans

1 of 18

Created with Fabric.js 3.6.6 s v e g a n d i j The substring between i and j is "v". Since "v" is not contained in the dictionary, we don't update anything. dp T F F F F F F word dict an and veg vegans

2 of 18

Created with Fabric.js 3.6.6 s v e g a n d i j The substring between i and j is "ve". Since "ve" is not contained in the dictionary, we don't update anything. We increment j and check the next substring. dp T F F F F F F word dict an and veg vegans

3 of 18

Created with Fabric.js 3.6.6 s v e g a n d i j dp T F F F F F F The substring between i and j is "e". Since "e" is not contained in the dictionary, we don't update anything and proceed to the next step. word dict an and veg vegans

4 of 18

Created with Fabric.js 3.6.6 s v e g a n d i j dp T F F T F F F The substring "veg" is checked in the dictionary. Since it's contained in the dictionary and the value of dp[j] is also TRUE, we set dp[i] to TRUE. Since the whole prefix has been found in the dictionary, we terminate the j loop here. word dict an and veg vegans

5 of 18

Created with Fabric.js 3.6.6 s v e g a n d i j The substring between i and j is "vega". Since "vega" is not contained in the dictionary, we don't update anything. We increment j and check the next substring. dp T F F T F F F word dict an and veg vegans

6 of 18

Created with Fabric.js 3.6.6 s v e g a n d i j dp T F F T F F F The substring between i and j is "ega". Since "ega" is not contained in the dictionary, we don't update anything. We increment j and check the next substring. word dict an and veg vegans

7 of 18

Created with Fabric.js 3.6.6 s v e g a n d i j The substring between i and j is "ga". Since "ga" is not contained in the dictionary, we don't update anything. We increment j and check the next substring. dp T F F T F F F word dict an and veg vegans

8 of 18

Created with Fabric.js 3.6.6 s v e g a n d i j dp T F F T F F F The substring between i and j is "a". Since "a" is not contained in the dictionary, we don't update anything and proceed to the next step. word dict an and veg vegans

9 of 18

Created with Fabric.js 3.6.6 s v e g a n d i j The substring between i and j is "vegan". Since "vegan" is not contained in the dictionary, we don't update anything. We increment j and check the next substring. dp T F F T F F F word dict an and veg vegans

10 of 18

Created with Fabric.js 3.6.6 s v e g a n d i j The substring between i and j is "egan". Since "egan" is not contained in the dictionary, we don't update anything. We increment j and check the next substring. dp T F F T F F F word dict an and veg vegans

11 of 18

Created with Fabric.js 3.6.6 s v e g a n d i j The substring between i and j is "gan". Since "gan" is not contained in the dictionary, we don't update anything. We increment j and check the next substring. dp T F F T F F F word dict an and veg vegans

12 of 18

Created with Fabric.js 3.6.6 s v e g a n d i j The substring "an" is checked in the dictionary. Since it's contained in the dictionary and the value of dp[j] is also TRUE, we set dp[i] to TRUE. Since the whole prefix has been found in the dicitonary, we terminate the j loop here. dp T F F T F T F word dict an and veg vegans

13 of 18

Created with Fabric.js 3.6.6 The substring between i and j is "vegand". Since "vegand" is not contained in the dictionary, we don't update anything. We increment j and check the next substring. dp T F F T F T F s v e g a n d i j word dict an and veg vegans

14 of 18

Created with Fabric.js 3.6.6 The substring between i and j is "egand". Since "egand" is not contained in the dictionary, we don't update anything. We increment j and check the next substring. dp T F F T F T F s v e g a n d i j word dict an and veg vegans

15 of 18

Created with Fabric.js 3.6.6 The substring between i and j is "gand". Since "gand" is not contained in the dictionary, we don't update anything. We increment j and check the next substring. dp T F F T F T F s v e g a n d i j word dict an and veg vegans

16 of 18

Created with Fabric.js 3.6.6 dp T F F T F T T s v e g a n d i j The substring "and" is checked in the dictionary. Since it's contained in the dictionary and the value of dp[j] is also TRUE, we set dp[i] to TRUE. Since the whole prefix has been found in the dictionary, we terminate the j loop here. word dict an and veg vegans

17 of 18

Created with Fabric.js 3.6.6 dp T F F T F T T s v e g a n d i j After the whole string has been processed, we return the value of dp[n], which holds the final result. word dict an and veg vegans

18 of 18

Let's implement the above algorithm:

Word Break

Solution summary#

  1. We initialize a lookup table, dp, and store TRUE at dp[0] because an empty string is always assumed to be part of the dictionary.
  2. Using two pointers, i and j, we find all possible prefixes of the given input string.
  3. For each prefix s[j..i], we check whether it’s contained in the dictionary and if dp[j] is TRUE. If both conditions are fulfilled, the corresponding index in the lookup table, i.e., dp[i], is set to TRUE.
  4. After the whole string has been processed, we return the value of dp[n], which holds the final result.

Time complexity#

The time complexity of this algorithm is O(n2)O(n^2), where nn is the length of the input string.

Space complexity#

The space complexity of this algorithm is O(n)O(n) , where nn is the length of the input string. This is the space occupied by the lookup table.

Word Break

Maximum Profit in Job Scheduling