[TJU — Stupid Problem] is there an alternative solution to suffix tree? - Codeforces

→ Обратите внимание

До соревнования
Educational Codeforces Round 189 (Rated for Div. 2)
4 дня
Зарегистрироваться »

→ Трансляции

Leetcode Weekly 498 Solution Discussion

aryanc403

До начала 36:51:53

Всё →

→ Лидеры (рейтинг)

№	Пользователь	Рейтинг
1	Benq	3792
2	VivaciousAubergine	3647
3	Kevin114514	3603
4	jiangly	3583
5	turmax	3559
6	tourist	3541
7	strapple	3515
8	ksun48	3461
9	dXqwq	3436
10	Otomachi_Una	3413

Страны | Города | Организации

→ Лидеры (вклад)

№	Пользователь	Вклад
1	Qingyu	157
2	adamant	153
3	Um_nik	147
3	Proof_by_QED	147
5	Dominater069	145
6	errorgorn	142
7	cry	139
8	YuukiS	135
9	TheScrasse	134
10	chromate00	133

Всё →

→ Найти пользователя

→ Прямой эфир

Детальнее →

Блог пользователя k790alex

[TJU — Stupid Problem] is there an alternative solution to suffix tree?

Автор k790alex, 11 лет назад, По-английски

По-английски

Hello, I'm trying to solve this problem (link).

A brief description:

Given two strings (s, t) and Q queries, each query contains an integer K, you should count how many substrings of s can be equal to t with changing exactly K characters of s.

constrains:
|s| <= 10^6
|s| > |t|
Q <= 10^5
characters can be 'x' or 'y' only

I got a solution using suffix tree which runs in linear time but my problem is that it uses too much memory.

I tried to reduce it to suffix array but I couldn't.

As the constrains are high I guess I need a linear time solution, can you see how to do it using less memory?

Thanks.

Теги

suffix tree

+8

k790alex
11 лет назад
12

Комментарии

Комментарии (12)

Написать комментарий?

»

11 лет назад, скрыть # |

← Rev. 2 →

Проголосовать: нравится

+21

Проголосовать: не нравится

How did you solved this problem with suffix tree?

If we calculate hamming distance of small string with large string for every shifts, queries can be answered easily.

We can calculate those hamming distances with FFT. Overall complexity is O(NlogN)

→ Ответить

»

»

11 лет назад, скрыть # ^ |

Проголосовать: нравится

0

Проголосовать: не нравится

Thanks but I didn't know how to compute these using FFT, can you explain a little?

I'll try to describe my approach, let s = "xxyxx" and t = "xy", if I build the suffix tree for s (appending '$' as delimiter), I can traverse the tree with t counting the necessary changes to match them (see the picture).

Each node have and id and the string contained in it, there are texts explaining the results, with this I can build an array containing how many substrings can be equal to t with 0 changes, 1 changes, .. , |t| changes.

→ Ответить

»

»

»

11 лет назад, скрыть # ^ |

Проголосовать: нравится

0

Проголосовать: не нравится

How are you calculating number of mismatches? As you know suffix tree is a compressed trie that's why there might be a node that holding information about thousands of characters. Are you able to do that step in constant time?

→ Ответить

»

»

»

»

11 лет назад, скрыть # ^ |

Проголосовать: нравится

0

Проголосовать: не нравится

Yes, its a simple dfs like:

// node is the current node of the tree the dfs is visiting
// index the current index of t
// mismatches is the number of mismatches I had found to get into the node
void dfs(node, index, mismatches) {
 if ( index == |t| ) {
  // means i traverse all t
  // count(node) = number of leafs of subtree rooted at node ( O(1) after preprocessing)
  // ans[x] = number of substrings of s that can be matched to t changing exactly x chars
  ans[mismatches] += count(node);
  ret;
 }
 // node contains a substring of s
 // node.begin is start index (inclusive)
 // node.end is end index (exclusive)
 for (i = node.begin; i < node.end && index < |t|; i++, index++) {
 // traverse substring at node until we reach end of t or end of node
  if ( s[i] == '$' )
   ret; // a delimiter means node has not enough chars to match t
  // count mismatches while traversing substring at node
  mismatched += s[i] == t[index] ? 1 : 0;
 }

 // traverse children
 for each children of node called next
  dfs(next, index, mismatched);

 // i had traversed the whole t
 if ( node is a leaf )
  ans[mismatched]++;
}

→ Ответить

»

»

»

»

»

11 лет назад, скрыть # ^ |

Проголосовать: нравится

0

Проголосовать: не нравится

You are not calculating mismatches in constant time, you are doing it in linear time. It will get tle anyway, even if we can handle memory issue.

→ Ответить

»

»

»

»

»

»

11 лет назад, скрыть # ^ |

Проголосовать: нравится

0

Проголосовать: не нравится

Is linear to |s| as far as I know, every vertex in the tree is going to be visited at most 1, it may be something I can't see, can you explain me why?

→ Ответить

»

»

»

»

»

»

»

11 лет назад, скрыть # ^ |

Проголосовать: нравится

0

Проголосовать: не нравится

Please notice that your algorithm is same as simple brute force which is for each suffix do |S| comparsions. Which leads to overall O(|T||S|) complexity. There is no usage of suffix tree at all. Why do you even using it to travel on all suffixes? Only two loops would do the same!

→ Ответить

»

»

»

»

»

»

»

»

11 лет назад, скрыть # ^ |

Проголосовать: нравится

0

Проголосовать: не нравится

Whoops, I see it now.

Thanks.

→ Ответить

»

»

»

11 лет назад, скрыть # ^ |

Проголосовать: нравится

0

Проголосовать: не нравится

To calculate hamming distance with fft, one can create two polynomials where coficients is +1 for X and -1 for Y. Notice that if two characters is not same they will be counted as -1.

→ Ответить

»

»

»

»

11 лет назад, скрыть # ^ |

Проголосовать: нравится

0

Проголосовать: не нравится

I can't see it but multiplying these polynomials is going to give the hamming distance for each substring of s? if yes, thats wonderful.

→ Ответить

»

»

»

»

»

11 лет назад, скрыть # ^ |

Проголосовать: нравится

0

Проголосовать: не нравится

For each shift p denotes number of matches and q denotes number of mismatches.

p+q = lenght of small string

p-q = related coeficient coming from fft.

So we know both p and q

→ Ответить

»

»

»

»

»

»

11 лет назад, скрыть # ^ |

Проголосовать: нравится

0

Проголосовать: не нравится

I got an idea but I'm still confused, can you detail more your solution?

Thanks.

→ Ответить

Codeforces (c) Copyright 2010-2026 Михаил Мирзаянов

Соревнования по программированию 2.0

Время на сервере: 18.04.2026 00:38:07 (g2).

Десктопная версия, переключиться на мобильную.

Privacy Policy | Terms and Conditions

При поддержке