Abstract
We present new algorithms for exact string matching of long patterns. Our algorithms read (Formula presented.) -grams at constant distances and are variations of the simplified BNDM algorithm. We demonstrate the competitiveness of our solutions through practical experiments. Many of our algorithms were faster than previous methods for English and DNA patterns between 400 and 50,000 in length. Our algorithms were still better when the preprocessing time was taken into account or when the patterns were taken from a different text of the same type.
| Original language | English |
|---|---|
| Pages (from-to) | 2160-2169 |
| Number of pages | 10 |
| Journal | Software - Practice and Experience |
| Volume | 54 |
| Issue number | 11 |
| Early online date | 2024 |
| DOIs | |
| Publication status | Published - Nov 2024 |
| MoE publication type | A1 Journal article-refereed |
Keywords
- exact string matching
- experimental comparison of algorithms
- fingerprint
- q-gram