Longest common substring in linear time

We know that the longest common substring of two strings can be found in O(N^2) time complexity.
Can a solution be found in only linear time?

edited 1 hour ago

Discrete lizard♦

4,44011537

asked 2 hours ago

Manoharsinh Rana

917

add a comment |

We know that the longest common substring of two strings can be found in O(N^2) time complexity.
Can a solution be found in only linear time?

edited 1 hour ago

Discrete lizard♦

4,44011537

asked 2 hours ago

Manoharsinh Rana

917

add a comment |

We know that the longest common substring of two strings can be found in O(N^2) time complexity.
Can a solution be found in only linear time?

edited 1 hour ago

Discrete lizard♦

4,44011537

asked 2 hours ago

Manoharsinh Rana

917

We know that the longest common substring of two strings can be found in O(N^2) time complexity.
Can a solution be found in only linear time?

algorithms time-complexity strings longest-common-substring

edited 1 hour ago

Discrete lizard♦

4,44011537

asked 2 hours ago

Manoharsinh Rana

917

edited 1 hour ago

Discrete lizard♦

4,44011537

asked 2 hours ago

Manoharsinh Rana

917

edited 1 hour ago

Discrete lizard♦

4,44011537

edited 1 hour ago

Discrete lizard♦

4,44011537

edited 1 hour ago

Discrete lizard♦

4,44011537

asked 2 hours ago

Manoharsinh Rana

917

asked 2 hours ago

Manoharsinh Rana

917

asked 2 hours ago

Manoharsinh Rana

917

add a comment |

3 Answers
3

active

oldest

votes

Yes, the longest common substring of two strings can be found in $O(m+n)$ time, where $m$ and $n$ are the lengths of the two strings, assuming the size of the alphabet is constant.

Here is an excerpt from https://en.wikipedia.org/wiki/Longest_common_substring_problem.

The longest common substrings of a set of strings can be found by building a generalized suffix tree for the strings, and then finding the deepest internal nodes which have leaf nodes from all the strings in the subtree below it.

Building a generalized suffix tree for two given strings takes $Theta(m+n)$ time using the famous ingenious Ukkonen's algorithm. Finding the deepest internal nodes that come from both strings takes $Theta(m+n)$ time. Hence we can find the longest common substring in $Theta(m+n)$ time.

answered 34 mins ago

Apass.Jack

13.3k1939

$begingroup$
I did not see @D.W's answer, possibly because I was interrupted while writing my answer.
$endgroup$
– Apass.Jack
30 mins ago

add a comment |

It is unlikely that that a better algorithm than quadratic exists, let alone linear. For the related problem of finding subsequences, this is a known result: In the paper "Tight hardness results for LCS and other sequence similarity measures." by Abboud et al. , they show that the existence of an algorithm with a running time of $O(n^{2-varepsilon})$, for some $varepsilon>0$ refutes the Strong Exponential Time Hypothesis (SETH).

SETH is considered to be very likely true (although not universally accepted), so it is unlikely any $O(n^{2-varepsilon})$ time algorithm exists.

While finding a substring is a slightly different problem, it seems likely to be equally hard.

edited 1 hour ago

answered 1 hour ago

Discrete lizard♦

4,44011537

$begingroup$
are you talking about subsequence? I am talking about substring.
$endgroup$
– Manoharsinh Rana
1 hour ago

$begingroup$
@ManoharsinhRana Ah, I see. The problems are similar, and it is hard to find results for the string variant. I think there are similar results for the substring problem, but they are not easy to find. You could try looking at papers that cite "Quadratic conditional lower bounds for string problems and dynamic time warping" by Bringmann and Künnemann, as their program lead to a lot of results related to this problem.
$endgroup$
– Discrete lizard♦
1 hour ago

$begingroup$
Longest common substring is much easier than longest common subsequence. See my answer.
$endgroup$
– D.W.♦
33 mins ago

add a comment |

Yes. There's even a Wikipedia article about it! https://en.wikipedia.org/wiki/Longest_common_substring_problem

In particular, as Wikipedia explains, there is a linear-time algorithm, using suffix trees (or suffix arrays).

Searching on "longest common substring" turns up that Wikipedia article as the first hit (for me). In the future, please research the problem before asking here. (See, e.g., https://meta.stackoverflow.com/q/261592/781723.)

answered 52 mins ago

D.W.♦

102k12127291

add a comment |

Your Answer

StackExchange.ifUsing("editor", function () {
return StackExchange.using("mathjaxEditing", function () {
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\$","\$"]]);
});
});
}, "mathjax-editing");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "419"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});

}
});

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fcs.stackexchange.com%2fquestions%2f105969%2flongest-common-substring-in-linear-time%23new-answer', 'question_page');
}
);

Post as a guest

Name

Required, but never shown

3 Answers
3

active

oldest

votes

3 Answers
3

active

oldest

votes

Yes, the longest common substring of two strings can be found in $O(m+n)$ time, where $m$ and $n$ are the lengths of the two strings, assuming the size of the alphabet is constant.

Here is an excerpt from https://en.wikipedia.org/wiki/Longest_common_substring_problem.

The longest common substrings of a set of strings can be found by building a generalized suffix tree for the strings, and then finding the deepest internal nodes which have leaf nodes from all the strings in the subtree below it.

answered 34 mins ago

Apass.Jack

13.3k1939

$begingroup$
I did not see @D.W's answer, possibly because I was interrupted while writing my answer.
$endgroup$
– Apass.Jack
30 mins ago

add a comment |

Yes, the longest common substring of two strings can be found in $O(m+n)$ time, where $m$ and $n$ are the lengths of the two strings, assuming the size of the alphabet is constant.

Here is an excerpt from https://en.wikipedia.org/wiki/Longest_common_substring_problem.

The longest common substrings of a set of strings can be found by building a generalized suffix tree for the strings, and then finding the deepest internal nodes which have leaf nodes from all the strings in the subtree below it.

answered 34 mins ago

Apass.Jack

13.3k1939

$begingroup$
I did not see @D.W's answer, possibly because I was interrupted while writing my answer.
$endgroup$
– Apass.Jack
30 mins ago

add a comment |

Yes, the longest common substring of two strings can be found in $O(m+n)$ time, where $m$ and $n$ are the lengths of the two strings, assuming the size of the alphabet is constant.

Here is an excerpt from https://en.wikipedia.org/wiki/Longest_common_substring_problem.

The longest common substrings of a set of strings can be found by building a generalized suffix tree for the strings, and then finding the deepest internal nodes which have leaf nodes from all the strings in the subtree below it.

answered 34 mins ago

Apass.Jack

13.3k1939

Yes, the longest common substring of two strings can be found in $O(m+n)$ time, where $m$ and $n$ are the lengths of the two strings, assuming the size of the alphabet is constant.

Here is an excerpt from https://en.wikipedia.org/wiki/Longest_common_substring_problem.

The longest common substrings of a set of strings can be found by building a generalized suffix tree for the strings, and then finding the deepest internal nodes which have leaf nodes from all the strings in the subtree below it.

answered 34 mins ago

Apass.Jack

13.3k1939

answered 34 mins ago

Apass.Jack

13.3k1939

answered 34 mins ago

Apass.Jack

13.3k1939

answered 34 mins ago

Apass.Jack

13.3k1939

$begingroup$
I did not see @D.W's answer, possibly because I was interrupted while writing my answer.
$endgroup$
– Apass.Jack
30 mins ago

add a comment |

$begingroup$
I did not see @D.W's answer, possibly because I was interrupted while writing my answer.
$endgroup$
– Apass.Jack
30 mins ago

I did not see @D.W's answer, possibly because I was interrupted while writing my answer.

– Apass.Jack
30 mins ago

add a comment |

SETH is considered to be very likely true (although not universally accepted), so it is unlikely any $O(n^{2-varepsilon})$ time algorithm exists.

While finding a substring is a slightly different problem, it seems likely to be equally hard.

edited 1 hour ago

answered 1 hour ago

Discrete lizard♦

4,44011537

$begingroup$
are you talking about subsequence? I am talking about substring.
$endgroup$
– Manoharsinh Rana
1 hour ago

$begingroup$
@ManoharsinhRana Ah, I see. The problems are similar, and it is hard to find results for the string variant. I think there are similar results for the substring problem, but they are not easy to find. You could try looking at papers that cite "Quadratic conditional lower bounds for string problems and dynamic time warping" by Bringmann and Künnemann, as their program lead to a lot of results related to this problem.
$endgroup$
– Discrete lizard♦
1 hour ago

$begingroup$
Longest common substring is much easier than longest common subsequence. See my answer.
$endgroup$
– D.W.♦
33 mins ago

add a comment |

SETH is considered to be very likely true (although not universally accepted), so it is unlikely any $O(n^{2-varepsilon})$ time algorithm exists.

While finding a substring is a slightly different problem, it seems likely to be equally hard.

edited 1 hour ago

answered 1 hour ago

Discrete lizard♦

4,44011537

$begingroup$
are you talking about subsequence? I am talking about substring.
$endgroup$
– Manoharsinh Rana
1 hour ago

$begingroup$
@ManoharsinhRana Ah, I see. The problems are similar, and it is hard to find results for the string variant. I think there are similar results for the substring problem, but they are not easy to find. You could try looking at papers that cite "Quadratic conditional lower bounds for string problems and dynamic time warping" by Bringmann and Künnemann, as their program lead to a lot of results related to this problem.
$endgroup$
– Discrete lizard♦
1 hour ago

$begingroup$
Longest common substring is much easier than longest common subsequence. See my answer.
$endgroup$
– D.W.♦
33 mins ago

add a comment |

SETH is considered to be very likely true (although not universally accepted), so it is unlikely any $O(n^{2-varepsilon})$ time algorithm exists.

While finding a substring is a slightly different problem, it seems likely to be equally hard.

edited 1 hour ago

answered 1 hour ago

Discrete lizard♦

4,44011537

SETH is considered to be very likely true (although not universally accepted), so it is unlikely any $O(n^{2-varepsilon})$ time algorithm exists.

While finding a substring is a slightly different problem, it seems likely to be equally hard.

edited 1 hour ago

answered 1 hour ago

Discrete lizard♦

4,44011537

edited 1 hour ago

answered 1 hour ago

Discrete lizard♦

4,44011537

answered 1 hour ago

Discrete lizard♦

4,44011537

answered 1 hour ago

Discrete lizard♦

4,44011537

$begingroup$
are you talking about subsequence? I am talking about substring.
$endgroup$
– Manoharsinh Rana
1 hour ago

$begingroup$
@ManoharsinhRana Ah, I see. The problems are similar, and it is hard to find results for the string variant. I think there are similar results for the substring problem, but they are not easy to find. You could try looking at papers that cite "Quadratic conditional lower bounds for string problems and dynamic time warping" by Bringmann and Künnemann, as their program lead to a lot of results related to this problem.
$endgroup$
– Discrete lizard♦
1 hour ago

$begingroup$
Longest common substring is much easier than longest common subsequence. See my answer.
$endgroup$
– D.W.♦
33 mins ago

add a comment |

$begingroup$
are you talking about subsequence? I am talking about substring.
$endgroup$
– Manoharsinh Rana
1 hour ago

$begingroup$
@ManoharsinhRana Ah, I see. The problems are similar, and it is hard to find results for the string variant. I think there are similar results for the substring problem, but they are not easy to find. You could try looking at papers that cite "Quadratic conditional lower bounds for string problems and dynamic time warping" by Bringmann and Künnemann, as their program lead to a lot of results related to this problem.
$endgroup$
– Discrete lizard♦
1 hour ago

$begingroup$
Longest common substring is much easier than longest common subsequence. See my answer.
$endgroup$
– D.W.♦
33 mins ago

are you talking about subsequence? I am talking about substring.

– Manoharsinh Rana
1 hour ago

@ManoharsinhRana Ah, I see. The problems are similar, and it is hard to find results for the string variant. I think there are similar results for the substring problem, but they are not easy to find. You could try looking at papers that cite "Quadratic conditional lower bounds for string problems and dynamic time warping" by Bringmann and Künnemann, as their program lead to a lot of results related to this problem.

– Discrete lizard♦
1 hour ago

Longest common substring is much easier than longest common subsequence. See my answer.

– D.W.♦
33 mins ago

add a comment |

Yes. There's even a Wikipedia article about it! https://en.wikipedia.org/wiki/Longest_common_substring_problem

In particular, as Wikipedia explains, there is a linear-time algorithm, using suffix trees (or suffix arrays).

answered 52 mins ago

D.W.♦

102k12127291

add a comment |

Yes. There's even a Wikipedia article about it! https://en.wikipedia.org/wiki/Longest_common_substring_problem

In particular, as Wikipedia explains, there is a linear-time algorithm, using suffix trees (or suffix arrays).

answered 52 mins ago

D.W.♦

102k12127291

add a comment |

Yes. There's even a Wikipedia article about it! https://en.wikipedia.org/wiki/Longest_common_substring_problem

In particular, as Wikipedia explains, there is a linear-time algorithm, using suffix trees (or suffix arrays).

answered 52 mins ago

D.W.♦

102k12127291

Yes. There's even a Wikipedia article about it! https://en.wikipedia.org/wiki/Longest_common_substring_problem

In particular, as Wikipedia explains, there is a linear-time algorithm, using suffix trees (or suffix arrays).

answered 52 mins ago

D.W.♦

102k12127291

answered 52 mins ago

D.W.♦

102k12127291

answered 52 mins ago

D.W.♦

102k12127291

answered 52 mins ago

D.W.♦

102k12127291

add a comment |

draft saved

draft discarded

Thanks for contributing an answer to Computer Science Stack Exchange!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

Use MathJax to format equations. MathJax reference.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

搜尋此網誌

Yrurtj