Updated manual

This commit is contained in:
Andrew S. Rightenburg 2023-08-24 13:47:14 -04:00
parent 4821398a5f
commit 40361a4981
Signed by: rail5
GPG Key ID: A0CB570AB6629159
1 changed files with 41 additions and 0 deletions

View File

@ -228,6 +228,47 @@ For example: \f[V]polonius-reader -f \[dq][a-z]+[0-9]{2}\[dq] -e\f[R]
All of the above about normal searches applies also to regex searches.
Regex searches, however, are significantly slower than normal searches.
.PP
A similar test to that mentioned in the \f[B]search\f[R] section was
run:
.IP \[bu] 2
A \f[B]2.5GB\f[R] file was created using
randomtext (https://github.com/rail5/randomtext)
.IP \[bu] 2
The string \[lq]123-456-7890\[rq] was inserted approximately 2.4GB in
(right near the end of the file)
.IP \[bu] 2
The following commands were run through the Bash \f[B]time\f[R] utility:
.RS 2
.IP "1." 3
\f[V]polonius-reader ./big-file -f \[dq][0-9]{3}-[0-9]{3}-[0-9]{4}\[dq] -e -b 16M\f[R]
.IP "2." 3
\f[V]grep -Eo \[dq][0-9]{3}-[0-9]{3}-[0-9]{4}\[dq] ./big-file\f[R]
.RE
.IP \[bu] 2
Here was the result of the \f[B]Polonius\f[R] command:
.IP
.nf
\f[C]
123-456-7890
real 2m46.267s
user 2m45.862s
sys 0m0.384s
\f[R]
.fi
.IP \[bu] 2
Here was the result of the \f[B]grep\f[R] command:
.IP
.nf
\f[C]
grep: memory exhausted
real 0m8.770s
user 0m5.506s
sys 0m3.207s
\f[R]
.fi
.PP
Polonius is not capable of finding regex matches which are larger than
the \f[I]block size\f[R] (default \f[I]10KB\f[R] if unspecified).
.SS BLOCK SIZE