I am stucked at the problem Shortest subsequence. Can someone kindly explain an optimal algorithm to solve this problem?

# | User | Rating |
---|---|---|

1 | tourist | 3778 |

2 | Benq | 3592 |

3 | ecnerwala | 3521 |

4 | Um_nik | 3423 |

5 | jiangly | 3375 |

6 | Petr | 3342 |

7 | Radewoosh | 3337 |

8 | scott_wu | 3313 |

9 | maroonrk | 3265 |

10 | yosupo | 3259 |

# | User | Contrib. |
---|---|---|

1 | Errichto | 201 |

2 | 1-gon | 200 |

3 | rng_58 | 194 |

4 | SecondThread | 193 |

5 | awoo | 187 |

6 | vovuh | 183 |

7 | Um_nik | 182 |

8 | antontrygubO_o | 177 |

9 | Ashishgup | 175 |

10 | -is-this-fft- | 171 |

I am stucked at the problem Shortest subsequence. Can someone kindly explain an optimal algorithm to solve this problem?

↑

↓

Codeforces (c) Copyright 2010-2021 Mike Mirzayanov

The only programming contests Web 2.0 platform

Server time: Jan/21/2021 20:28:37 (f2).

Desktop version, switch to mobile version.

Supported by

User lists

Name |
---|

Suppose we partition the string into $$$k$$$ contiguous subsegments such that the letters

`GCAT`

all appear at least once each in each partition. Then, it is clear that all $$$k$$$-character strings appear as subsequences.We can construct such a partition greedily. Find the shortest prefix of the string that contains all characters

`GCAT`

, make that one subsegment, then recurse on the remaining string. Note that this might actually partition it into $$$k+1$$$ subsegments, where the last subsegment is ``incomplete''. The last character in each subsegment (besides the incomplete subsegment) also appears exactly once in that subsegment; greedily, if it appeared earlier in the subsegment, then we could have ended this partition earlier.If $$$k$$$ is maximal, then we can show that there exists a $$$k+1$$$ length string that is

nota subsequence. How? We can explicitly construct it as the last character in each of the partitions, plus some characternotin the incomplete subsegment (or any character, if there is no incomplete subsegment).Thanks a lot.

Shisuko Great explanation!

I have a question. Is it possible to count the number of such subsequences in a reasonable time limit? If possible then what is the approach?

Very nice explanation dude

Can you explain why this is the optimal way?

All strings of length at most $$$k$$$ appear as a subsequence, so the answer has to be at least $$$k+1$$$.

But, we can actually do $$$k+1$$$. Thus, the minimum is exactly $$$k+1$$$.

Ohh!! We have greedily selected the minimum possible. thanks :)