Commits · 3d2efec791726aa8e98215d15cf4d2a91015c623 · uplex-varnish / libvmod-selector

16 Oct, 2020 20 commits

malloc the temp array used for sorting in .compile(). · 3d2efec7
Geoff Simmons authored Jun 01, 2020
```
For large sets, workspace could be too small.
```
3d2efec7

Implement perfect hashing based on universal hashing. · 1f7815f1

Geoff Simmons authored Jun 01, 2020

Universal hashing has a sounder theoretical basis; in particular, it
doesn't have the dubious minimum hash table size below which a
perfect hash may not be possible, and which was set by trial and error.

For nearly all test data, universal hashing performs at least as
well or better. Especially better for sets with longer strings,
since the subject string is cast as an array of uint32_t, so the
hash is computed in fewer operations.

The only exception I've noticed is /usr/share/dict/words, which now
appears to have more collisions than under the previous approach.
But it appears likely that this only becomes an issue for sets that
are much larger than are probable for VCL use cases (in the 100,000
range), and if all of the sets' elements are tested for matches
about equally often (whereas real-world usage patterns tend to
match a subset much more frequently).

1f7815f1

QP_Insert() requires that strings are added in sorted order. · b956c988

Geoff Simmons authored May 31, 2020

The VMOD does this during .compile(), and QP_Insert() is no longer
called during .add(). The .compile() call is now required in all
cases, and it must be called before .create_stats().

This is because QP_Insert() was not correctly rotating the trie
when a set has overlapping prefixes, and a shorter prefix was
added before the longer one. With sorted order, shorter prefixes
are always added first, so rotation is unnecessary.

b956c988

Revert "Prune the QP prefix sub-branch search more efficiently." · b0b250a2
Geoff Simmons authored May 31, 2020
```
This reverts commit afded0f5.
```
b0b250a2
Prune the QP prefix sub-branch search more efficiently. · 36b01633
Geoff Simmons authored Mar 27, 2020

36b01633

Implement QP prefix searching without recursion. · 83364942

Geoff Simmons authored Mar 27, 2020

The new algorithm improves efficiency with iteration in place of
recursion, and in a number of other ways:

- Avoid searches into dead-end branches. The traversal of all branches
was done because of the overlapping prefix case -- "foo" and "foobar"
both in the set. Now we just search the tree for a match, but before
descending into the next branch, check if there are other branches at
which the current prefix matches a terminating node.

- Only do string comparisons when we hit a terminating node.

- Mark terminating nodes with a flag in the tree, so that we don't go
looking for the null byte in the strings table during the search.

While we're here, rename the flag for the nibble search as hinib --
non-zero if and only if we inspect the most significant nibble at that
node. Also remove some dead code from QP_Insert().

83364942

Delete some dead code. · fa021199
Geoff Simmons authored Mar 26, 2020

fa021199
PH benchmark prints the match throughput. · cb23a30e
Geoff Simmons authored Mar 26, 2020

cb23a30e
Correcty set the table size and bits for the minimum size. · 80b1c726
Geoff Simmons authored Mar 26, 2020

80b1c726
Hashing counts down from the length of the string. · 39bef809
Geoff Simmons authored Mar 24, 2020
```
May be advantageous for loop/branch prediction.
```
39bef809
Perfect hashing uses 64-bit FNV-1a with xor folding. · 105a1327
Geoff Simmons authored Mar 24, 2020
```
Theoretically, this reduces the probability of collisions. Benchmarks
don't show much of a difference.
```
105a1327

VMOD uses quadbit tries and perfect hashes instead of patricia tries. · 883a6d61

Geoff Simmons authored Mar 20, 2020

This adds the .compile() method to set objects, required for the
use of .match().

Docs for the .compile() method are currently incomplete.

883a6d61

Hash checks strings against min and max length for the set. · c74275e2

Geoff Simmons authored Mar 20, 2020

strlen() is also cheap if it has a SIMD implementation, so we can
afford this optimization to reject some strings immediately.

c74275e2

Re-order fields in struct pt_y so that it is packed to a smaller object. · 32a979db
Geoff Simmons authored Mar 20, 2020

32a979db
Rename the struct for qp tries. · cd40d26b
Geoff Simmons authored Mar 06, 2020

cd40d26b
Add a benchmark tool for the perfect hash functions. · 6e0d540f
Geoff Simmons authored Mar 06, 2020

6e0d540f
Add a perfect hash implementation, for full matches only. · 9488c9f1
Geoff Simmons authored Mar 06, 2020
```
Cannot be used for prefix matches.
```
9488c9f1
gitignore the benchmark artifacts. · f2928e66
Geoff Simmons authored Mar 03, 2020

f2928e66
Add a benchmark utility for the QP functions. · 13e3114c
Geoff Simmons authored Mar 03, 2020

13e3114c

Add the QP interface as a possible replacement for patricia tries. · 5deae778

Geoff Simmons authored Mar 03, 2020

For "quadbit patricia tries", inspired by the work of Tony Finch:
https://dotat.at/prog/qp/README.html

Radix 16 tries, examining a nibble at a time, to make the tries
smaller and reduce pointer chasing.

5deae778

01 Sep, 2020 3 commits
- Adjust to changed WS_* interface. · 62ea5cea
  Geoff Simmons authored Sep 01, 2020
  
  62ea5cea
- Changed README from the current vmodtool. · 01327a74
  Geoff Simmons authored Sep 01, 2020
  
  01327a74
- Fix make distcheck. · 7f1a7405
  Geoff Simmons authored Sep 01, 2020
  
  7f1a7405
03 Mar, 2020 5 commits
- gitignore the benachmark artifact. · ecfb9b9a
  Geoff Simmons authored Mar 03, 2020
  
  ecfb9b9a
- Add a benchmark utility for the PT functions. · 9c27ee9f
  Geoff Simmons authored Mar 03, 2020
  
  9c27ee9f
- gitignore package build artifacts. · 37747adf
  Geoff Simmons authored Mar 03, 2020
  
  37747adf
- Add some more info to the debugging dump. · d388e37e
  Geoff Simmons authored Mar 03, 2020
  
  d388e37e
- Speed up the exact match operation. · 35d24473
  Geoff Simmons authored Mar 03, 2020
```
Only call strcmp() once, when a node is reached that must be either a
hit or a miss.
```
  35d24473
28 Feb, 2020 1 commit
- Speed up prefix matches a bit. · da885ca1
  Geoff Simmons authored Feb 28, 2020
  
  da885ca1
27 Feb, 2020 1 commit

Fix a bug in match(). · 47eae6cd

Geoff Simmons authored Feb 27, 2020

The search may have matched a string that is actually a prefix of
the subject string, if a longer string with the same prefix is also
in the set.

This "happens" to give correct results for match(), but which()
would return the wrong value.

The fix uses strcmp() instead of memcmp(), but that is also
vectorized, where the C library uses vector instructions.

47eae6cd

26 Feb, 2020 4 commits

Document the new stats (setsz and nodesz). · 840a781b
Geoff Simmons authored Feb 26, 2020

840a781b
Auto-generated README reformatting. · 35ccc746
Geoff Simmons authored Feb 26, 2020

35ccc746
Add stats setsz and nodesz. · 3df6c1fa
Geoff Simmons authored Feb 26, 2020

3df6c1fa

Remove the byte-to-byte compares in match and prefix searches. · 729b8c44

Geoff Simmons authored Feb 26, 2020

Vector extensions are common hardware now, as are C libraries that
use vector instructions to implement functions like memcmp(). So
we hand off compares to the lib to get the advantage.

For the same reason, we can afford to call strlen() on the subject
string to locate the terminating null, rather than scan for it.

Also, the match function descends through the trie to find a
potential match, and does the comparison only then, as is common
for trie/critbit/patricia implementations.

729b8c44

10 Dec, 2019 1 commit

Support backend None · f95af2f6

Nils Goroll authored Dec 10, 2019

since varnish-cache ecef48518f3b3f4bbf28256e090bdbb5cd2b163c backends
can be NULL (as defined with backend <name> None)

f95af2f6

09 Dec, 2019 1 commit
- Bugfix object fini with more than one data entry (string, backend, etc). · 24e35725
  Geoff Simmons authored Dec 09, 2019
```
Fixes #1
```
  24e35725
31 Oct, 2019 1 commit

Add autotool support for generating coverage reports. · b5303dfa

Geoff Simmons authored Oct 31, 2019

configure checks if you have lcov & genhtml; these can be specified
with --with-lcov and/or --with-genhtml.

If they are available, then make coverage does the following:

- make clean, then make check with CC=gcc and CFLAGS set so that
  inputs for gcov/lcov are generated.

- lcov creates the src/coverage subdir and generates a targetfile
  there.

- genhtml generates HTML reports in src/coverage.

b5303dfa

02 Oct, 2019 2 commits
- Update README formatting. · 13b3b902
  Geoff Simmons authored Oct 02, 2019
  
  13b3b902
- Add the ZERO_OBJ workaround. · e13017aa
  Geoff Simmons authored Oct 02, 2019
  
  e13017aa
22 Aug, 2019 1 commit
- Add the integer param to .add(), and the .integer() method. · 0608f516
  Geoff Simmons authored Aug 22, 2019
  
  0608f516