Oh, I had missed that the Galil rule gives Boyer-Moore linear worst case performance. This might mean that BM is always a better choice than KMP.
I see. Does the speed remain the same if the file is in cache? Also, how many matches are there? e.g., For a 13GB file in cache, `rg 'Sherlock Holmes' OpenSubtitles2018.raw.en -c` takes 1.7s for 7673 matches, which is about 7.5 GB/s.
-
-
You got faster CPU or RAM than I do: $ time rg 'Sherlock Holmes' ~/Downloads/en.txt -c 7673 real0m16,405s user0m1,998s sys0m3,852s $ time rg 'Sherlock Holmes' ~/Downloads/en.txt -c 7673 real0m2,734s user0m1,548s sys0m1,179s $
-
Yeah that's fine. My main point here was to question whether 900 MB/s should be your target. :-) Your search there (in cache) is running at ~4.9 GB/s. It's interesting though that your out of cache search isn't saturating your disk bandwidth? Or is it what you'd expect?
- 5 more replies
New conversation -
-
-
It drops from 5.5s to 1.4s, but I also just noticed that I’ve been running rg —version 0.9.0. Trying again with… *squints* 11.0.2.
-
4.8s and 1.2s. Oddly no count is printed, though I run with `-caF --no-ignore`.
- 4 more replies
New conversation -
-
-
If I clear caches first, then it takes about 24.5 seconds, or ~543 MB/s. (I have a SATAIII SSD, so this is close to its top speed.) The input file can be download here: http://opus.nlpl.eu/download.php?f=OpenSubtitles/v2018/mono/OpenSubtitles.raw.en.gz …
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.