Algorithm Wildcard Searching with *

#	User	Rating
1	tourist	3985
2	jiangly	3814
3	jqdai0815	3682
4	Benq	3529
5	orzdevinwang	3526
6	ksun48	3517
7	Radewoosh	3410
8	hos.lyric	3399
9	ecnerwala	3392
9	Um_nik	3392

#	User	Contrib.
1	cry	169
2	maomao90	162
2	Um_nik	162
4	atcoder_official	161
5	djm03178	158
6	-is-this-fft-	157
7	adamant	155
8	awoo	154
8	Dominater069	154
10	luogu_official	150

I am thinking about an effient algorithm for wildcard searching with * representing any characters with any length.

aa*c, she*he, *she*he

Example:

caa find aa*c
return "DOES NOT EXIST"
hesherheshe find she*he return 2
Because sherheshe begins at index 2
sherheshe find she*he return 0 Because the whole string When I am supposed to return, say, the beginning index of the first matching instance.

Say the pattern is of length M and the document is of length N, and the pattern has K '*' signs. I can think of a solution that first uses AC Automation to find all occurences of each chunk in O(N + M), with bitmask.

While converting the bitmask to indexes takes O(N * K)

Then binary search for the last possible beginning positions for each chunk. This could take O (K log N)

So the overall time complexity is still O (N * K), any way to do better?

References

https://mirror.codeforces.com/blog/entry/111380

https://mirror.codeforces.com/problemset/problem/1023/A

https://mirror.codeforces.com/blog/entry/127169

Rev.	By	When	Δ	Comment
en7	cardcounter	2024-12-21 23:01:06	18
en6	cardcounter	2024-12-21 22:58:46	229
en5	cardcounter	2024-12-21 12:46:01	44	Tiny change: 'm/1023/A\n' -> 'm/1023/A\n\nhttps://mirror.codeforces.com/blog/entry/127169\n'
en4	cardcounter	2024-12-21 12:41:11	2	Tiny change: 'y/111380\nhttps://' -> 'y/111380\n\nhttps://'
en3	cardcounter	2024-12-21 12:40:45	108
en2	cardcounter	2024-12-21 12:32:51	7
en1	cardcounter	2024-12-21 12:30:21	746	Initial revision (published)

Rev.

Lang.

When

Comment

en7