poj 3415 Common Substrings

阿新 • • 發佈：2018-05-20

top != accepted lang dep ons pri tro vector

Common Substrings

Time Limit: 5000MS		Memory Limit: 65536K
Total Submissions: 12585		Accepted: 4228

Description

A substring of a string T is defined as:

T(i, k)=T_iT_i₊₁...T_i+k_-1, 1≤i≤i+k-1≤|T|.

Given two strings A, B and one integer K, we define S, a set of triples (i, j, k):

S = {(i, j

, k) | k≥K, A(i, k)=B(j, k)}.

You are to give the value of |S| for specific A, B and K.

Input

The input file contains several blocks of data. For each block, the first line contains one integer K, followed by two lines containing strings A and B, respectively. The input file is ended by K=0.

1 ≤ |A

|, |B| ≤ 10⁵
1 ≤ K ≤ min{|A|, |B|}
Characters of A and B are all Latin letters.

Output

For each case, output an integer |S|.

Sample Input

2
aababaa
abaabaa
1
xx
xx
0

Sample Output

22
5

題意：求兩個字符串的長度大於k的子串的數量
思路：其實就是求兩個字符串當中的任意兩個後綴的相同前綴的數量,設lcp是任意兩個後綴的相同前綴的最大長度，那麽這兩個後綴的長度大於K的相同前綴數量為lcp-K+1.
直接枚舉兩個字符串的所有後綴並累加他們的前綴數量復雜度在O(n^2)行不通。
可以利用單調棧。首先把兩個字符串s1,s2進行合並,中間可以加個不同的字符(譬如‘$‘)來區別，即s=s1+‘$‘+s2 ，求s的後綴數組和高度數組。
首先任意兩個後綴,記它們在後綴數組中位置分別為i,j，則它們的高度lcp可以表示為min(lcp[i],lcp[i+1],...,lcp[j-1])，既然如此，可以用單調棧來維護lcp
對於s2的每一個後綴B，考慮所有字典序在B前面的s1的後綴Ai，計算所有Ai與B的相同前綴的數量和，可以用單調棧優化。對於s1中的每個後綴A，計算Bi與A的相同前綴數量和與之前是類似的。
在高度數組當中把高度大於等於K的連續的序列分成一塊，一塊一塊的用單調棧考慮，具體見代碼：

AC代碼：

#define _CRT_SECURE_NO_DEPRECATE
#include<iostream>
#include<algorithm>
#include<vector>
#include<cstring>
#include<string>
#include<cmath>
using namespace std;
const int INF = 0x3f3f3f3f;
const int N_MAX = 100000 + 20;
typedef long long ll;
int n, k;
int Rank[N_MAX*2];
int tmp[N_MAX*2];
int sa[N_MAX * 2];
int lcp[N_MAX*2];
bool compare_sa(const int& i,const int& j) {
    if (Rank[i] != Rank[j])return Rank[i] < Rank[j];
    else {
        int ri = i + k <= n ? Rank[i + k] : -1;
        int rj = j + k <= n ? Rank[j + k] : -1;
        return ri < rj;
    }
}

void construct_sa(const string& S,int *sa) {
    n = S.size();
    for (int i = 0; i <= n;i++) {
        sa[i] = i;
        Rank[i] = i < n ? S[i] : -1;
    }
    for (k = 1; k <= n;k*=2) {
        sort(sa,sa+n+1,compare_sa);
        tmp[sa[0]] = 0;
        for (int i = 1; i <= n;i++) {
            tmp[sa[i]] = tmp[sa[i - 1]] + (compare_sa(sa[i - 1], sa[i]) ? 1 : 0);
        }
        for (int i = 0; i <= n;i++) {
            Rank[i] = tmp[i];
        }
    }
}
void construct_lcp(const string& S,int *sa,int *lcp){
    memset(lcp,0,sizeof(lcp));
    int n = S.length();
    for (int i = 0; i <= n; i++)Rank[sa[i]] = i;
    int h = 0;
    lcp[0] = 0;
    for (int i = 0; i < n; i++) {
        int j = sa[Rank[i] - 1];
        if (h > 0)h--;
        for (; j + h < n&&i + h < n; h++) {
            if (S[j + h] != S[i + h])break;
        }
        lcp[Rank[i] - 1] = h;
    }
}

int K;
string s1, s2, s;
ll top, accumu;
int stack[N_MAX * 2][2];//1存放人數,0存放lcp
ll find_num(int sz1,bool is_s1) {
    ll res = 0; top = accumu = 0;
    for (int i = 0; i < s.size(); i++) {
        if (lcp[i] < K) {
            top = 0; accumu = 0;
        }
        else {
            int size = 0;//統計高度為lcp[i]的人數
            if ((is_s1&&sa[i] < sz1) || (!is_s1&&sa[i] > sz1)) {//如果是s1中的後綴
                size++;
                accumu += lcp[i] - K + 1;
            }
            while (top>0&&lcp[i]<=stack[top-1][0]) {//前面的lcp高度比較高，則要削減高度直到和lcp[i]一樣，這樣之前的那些人的高度也變成lcp[i]了
                top--;
                accumu -= stack[top][1] * (stack[top][0] - lcp[i]);
                size += stack[top][1];
            }
            if (size) {
                stack[top][0] = lcp[i];
                stack[top][1] = size;
                top++;//!!!
            }
            if ((is_s1&&sa[i+1] > sz1) || (!is_s1&&sa[i+1] < sz1)) {//sa[i+1]是s2中的後綴!!!
                res += accumu;
            }
        }
    }
    return res;
}

int main() {
    while (scanf("%d",&K)&&K) {
        cin >> s1 >> s2;
        int sz1 = s1.size();
        int sz2 = s2.size();
        s = s1 + ‘$‘ + s2;
        construct_sa(s,sa);
        construct_lcp(s,sa,lcp);
        printf("%lld\n",find_num(sz1,1)+find_num(sz1,0));
    }
    return 0;
}

poj 3415 Common Substrings

POJ 3415 Common Substrings（長度不小於K的公共子串的個數+後綴數組+height數組分組思想+單調棧）

3*3 直接 math break can type strings 需要 bre http://poj.org/problem?id=3415 題意：求長度不小於K的公共子串的個數。思路：好題！！！拉丁字母讓我Wa了好久！！單調棧又讓我理解了好久！！太弱啊！！

poj 3415 Common Substrings

top != accepted lang dep ons pri tro vector Common Substrings Time Limit: 5000MS Memory Limit: 65536K Total Submissions: 12585

Common Substrings POJ - 3415(長度不小於k的公共子串的個數)

mat oid continue return src substr alt 技術 pen 題意：　　給定兩個字符串A 和 B，求長度不小於 k 的公共子串的個數（可以相同）分兩部分求和sa[i-1] > len1 sa[i] < len1 和

POJ 1458 - Common Subsequence（最長公共子串）

strlen cstring algorithm 鏈接 space %d ace -s set 此文為博主原創題解，轉載時請通知博主，並把原文鏈接放在正文醒目位置。題目鏈接：http://poj.org/problem?id=1458 AC代碼：

POJ 1458 - Common Subsequence（最長公共子序列）題解

void 方式 mem strong 輸出 inline ron eof init 此文為博主原創題解，轉載時請通知博主，並把原文鏈接放在正文醒目位置。題目鏈接：http://poj.org/problem?id=1458 題目大意：有若幹組數據，每組給出兩個字符

POJ #1458 Common Subsequence

str get 問題 des 技術分享 sin 個數 bcf std Description 　　問題的描述以及樣例在這裏：1458 Common Subsequence Sample 　　 INPUT: abcfbc a

POJ 1458 Common Subsequence（動態規劃）

Common Subsequence Time Limit: 1000MS Memory Limit: 10000K Total Submissions: 61454 &n

POJ 1458 Common Subsequence （公共最長子序列）

題目描述： A subsequence of a given sequence is the given sequence with some elements (possible none) left out. Given a sequence X = < x1,

POJ 1458 Common Subsequence

A subsequence of a given sequence is the given sequence with some elements (possible none) left out. Given a sequence X = < x1, x2, ...

POJ 1458 Common Subsequence(最長公共子序列LCS)

題意: 給你兩個字串, 要你求出兩個字串的最長公共子序列長度. 分析: 本題不用輸出子序列,很簡單,直接處理即可. 首先令dp[i][j]==x表示A串

poj 1458 Common Subsequence 最基本的LCS 最長公共子序列

剛好對應演算法導論 15.4 最長公共子序列#include <iostream> #include <stdio.h> #include <cstring> using namespace std; string strA,str

最長公共子串（Longest Common SubStrings）

Description 給出兩個字串，求出兩個字串的公共子串？ sample Input encodingmy mydecoding sample Output coding

POJ 1330 Nearest Common Ancestors（lca）

我不 ont compute data pri rst nss cfb str POJ 1330 Nearest Common Ancestors A rooted tree is a well-known data structure in computer s

POJ 1470 -- Closest Common Ancestors

bre else common 多次提交 flag 變量 spa pro int 題目鏈接：http://poj.org/problem?id=1470 Closest Common Ancestors Time Limit: 2000MS Memory Lim

poj-1330 Nearest Common Ancestors

AS PE ram stdin tinc contain repr can node A rooted tree is a well-known data structure in computer science and engineering. An example i

POJ - 1330 Nearest Common Ancestors 最近公共祖先+鏈式前向星模板題

mon represent pac different add const nod sam ger A rooted tree is a well-known data structure in computer science and engineering. An ex

POJ 1470 Closest Common Ancestors (模板題)(Tarjan離線)【LCA】

clear pac push 公共祖先 back family ble lan tarjan <題目鏈接> 題目大意：給你一棵樹，然後進行q次詢問，然後要你統計這q次詢問中指定的兩個節點最近公共祖先出現的次數。解題分析：LCA模板題，下面用的是離線Tarjan

POJ-1330-Nearest Common Ancestors(LCA+倍增模板題)

題目連結：http://poj.org/problem?id=1330 Description A rooted tree is a well-known data structure in computer science and engineering. An example is sh

POJ 1330 Nearest Common Ancestors (模板題) (LCA)【倍增】

<題目連結> 題目大意：給出一棵樹，問任意兩個點的最近公共祖先的編號。解題分析：LCA模板題，下面用的是線上倍增演算法求解。 1 #include <cstdio> 2 #include <cstring> 3 #include <algori

POJ 1330 Nearest Common Ancestors (LCA模板題)

#include<cstdio> #include<cstring> #include<algorithm> using namespace std; #define debug puts("YES"); #define rep(x,

poj 3415 Common Substrings

相關推薦