Package: AhoCorasickTrie 0.1.3
AhoCorasickTrie: Fast Searching for Multiple Keywords in Multiple Texts
Aho-Corasick is an optimal algorithm for finding many keywords in a text. It can locate all matches in a text in O(N+M) time; i.e., the time needed scales linearly with the number of keywords (N) and the size of the text (M). Compare this to the naive approach which takes O(N*M) time to loop through each pattern and scan for it in the text. This implementation builds the trie (the generic name of the data structure) and runs the search in a single function call. If you want to search multiple texts with the same trie, the function will take a list or vector of texts and return a list of matches to each text. By default, all 128 ASCII characters are allowed in both the keywords and the text. A more efficient trie is possible if the alphabet size can be reduced. For example, DNA sequences use at most 19 distinct characters and usually only 4; protein sequences use at most 26 distinct characters and usually only 20. UTF-8 (Unicode) matching is not currently supported.
Authors:
AhoCorasickTrie_0.1.3.tar.gz
AhoCorasickTrie_0.1.3.zip(r-4.7)AhoCorasickTrie_0.1.3.zip(r-4.6)AhoCorasickTrie_0.1.3.zip(r-4.5)
AhoCorasickTrie_0.1.3.tgz(r-4.6-x86_64)AhoCorasickTrie_0.1.3.tgz(r-4.6-arm64)AhoCorasickTrie_0.1.3.tgz(r-4.5-x86_64)AhoCorasickTrie_0.1.3.tgz(r-4.5-arm64)
AhoCorasickTrie_0.1.3.tar.gz(r-4.7-arm64)AhoCorasickTrie_0.1.3.tar.gz(r-4.7-x86_64)AhoCorasickTrie_0.1.3.tar.gz(r-4.6-arm64)AhoCorasickTrie_0.1.3.tar.gz(r-4.6-x86_64)
AhoCorasickTrie_0.1.3.tgz(r-4.6-emscripten)
manual.pdf |manual.html✨
card.svg |card.png
AhoCorasickTrie/json (API)
| # Install 'AhoCorasickTrie' in R: |
| install.packages('AhoCorasickTrie', repos = c('https://chambm.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/chambm/ahocorasicktrie/issues
Last updated from:b3a09ae28e. Checks:13 OK. Indexed: yes.
| Target | Result | Time | Files | Syslog |
|---|---|---|---|---|
| linux-devel-arm64 | OK | 106 | ||
| linux-devel-x86_64 | OK | 112 | ||
| source / vignettes | OK | 152 | ||
| linux-release-arm64 | OK | 116 | ||
| linux-release-x86_64 | OK | 111 | ||
| macos-release-arm64 | OK | 122 | ||
| macos-release-x86_64 | OK | 338 | ||
| macos-oldrel-arm64 | OK | 127 | ||
| macos-oldrel-x86_64 | OK | 281 | ||
| windows-devel | OK | 86 | ||
| windows-release | OK | 103 | ||
| windows-oldrel | OK | 98 | ||
| wasm-release | OK | 94 |
Exports:AhoCorasickSearchAhoCorasickSearchList
Dependencies:Rcpp
Readme and manuals
Help Manual
| Help page | Topics |
|---|---|
| Fast searching for one or more keywords in one or more texts | AhoCorasickSearch |
| Fast searching for one or more keywords in a list of texts | AhoCorasickSearchList |
| AhoCorasickTrie: fast searching for multiple keywords in multiple texts | AhoCorasickTrie-package AhoCorasickTrie |
