Commits · e8d545462862f1c36678eee4daae130d4ff5b1c4 · uplex-varnish / slash

13 Dec, 2023 3 commits

Rework fellow_cache_obj_iter and read ahead · e8d54546

Nils Goroll authored Dec 10, 2023

Issue #41 has shown a deadlock scenario where various object iterators
would wait for memory.

While reviewing this issue, we noticed a couple of shortcomings in the
existing code:

* fellow_cache_seg_ref_in() would always wait for allocation requests
  for readahead segments. Yet, when under memory pressure, we should
  not wait at all for memory for readahead.

* fellow_cache_obj_iter() would hold onto already sent segments also
  when waiting for synchronous I/O and memory allocations.

To improve on these shortcomings and further optimize the code, some
of fellow_cache_obj_iter() and all of the readahead code has been
rewritten. Improvements comprise the following:

* For read ahead, we now use asynchronous memory allocations. If they
  succeed right away, we issue I/O right away also, but if allocations
  are delayed, we continue delivery and check back later. By chance,
  memory allocations will succeed until then.

* We decouple memory allocations from specific segments and only care
  about the right size of the allocation. Because many segments will
  be of chunk_bytes size, this will allow more efficient use of
  available asynchronous allocations.

* We now de-reference already sent segments also whenever we need to
  wait for anything, be it a memory allocation or I/O. This should
  help overall efficiency and reduce memory pressure, because already
  sent segments can be LRUd earlier.

  The drawback is that we flush the VDP pipeline more often (we need
  to before we can deref segments).

We also cap the readahead parameter at the equivalent of 1/16 of
memory in order to avoid inefficiencies because of single requests
holding too much of the memory cache hostage.

An additional hard cap at 31 is required to keep the default esi depth
supported with the default stack size of varnish-cache.

e8d54546

Decide a race between obj_get() and lru · c8b01760

Nils Goroll authored Dec 12, 2023

we only set stobj->priv after returning from obj_get(), so
assert(oc->stobj->priv == fco) could trigger in the lru thread.

We now set the priv right before inserting into LRU.

c8b01760

Integrate fellow_busy allocation in fellow_cache_obj_new() · 2235fc09

Nils Goroll authored Dec 12, 2023

It does not make sense to use up memory for busy objects if we can not
create their cache and disk counterparts in memory.

Motivated by #41

2235fc09

11 Dec, 2023 4 commits
- For e29.vtc, also execute a reasonably complex regex match · 3aba33be
  Nils Goroll authored Dec 11, 2023
  
  3aba33be
- test that the default stack size is sufficent for max_esi_depth · 00264865
  Nils Goroll authored Dec 11, 2023
```
Motivated by https://gitlab.com/uplex/varnish/slash/-/issues/41#note_1688912442

Also added to Varnish-Cache: https://github.com/varnishcache/varnish-cache/commit/24b434383c616639d5aa9be9b5ba3647a418d64c
```
  00264865
- Use proper log region sizing for fellow_cache_test · 2c2c4229
  Nils Goroll authored Dec 11, 2023
```
Fixes #42
```
  2c2c4229
- Add filename argument to fellow_cache_test · 9139e7f9
  Nils Goroll authored Dec 11, 2023
```
to enable parallel tests, for example:

$ for i in {1..24} ; do while ./src/fellow_cache_test /tmp/f.${i} >/dev/null 2>/dev/null ; do : ;done & done ; wait

Motivated by #42
```
  9139e7f9
10 Dec, 2023 7 commits

Changelog TLC · d6d7ef8e
Nils Goroll authored Dec 10, 2023

d6d7ef8e
Add macro for compound literal BUDDY_REQS · ce255e28
Nils Goroll authored Dec 10, 2023

ce255e28
Buddy allocator: buddy_req_* return 0 with ENOSPC for full reqs · 7d28bddd
Nils Goroll authored Dec 10, 2023

7d28bddd

Zero a finished fbio to decide rate with fellow_busy_io_get() · 9e590ada

Nils Goroll authored Dec 09, 2023

Have seen (fbo) != NULL in fellow_cache_async_write_complete():

 #14 0x00007f9aa30957e5 in fellow_cache_async_write_complete (fc=0x7f9aa2c41300, ptr=0x7f9a9ff4df58, result=4096)
    at fellow_cache.c:2791
 #15 0x00007f9aa3096403 in fellow_cache_seg_async_compl_cb (priv=0x7f9aa2c41300, status=0x7f9a999fa3e0, n=1)
    at fellow_cache.c:2951

(gdb) info local
fbio = 0x7f9a9ff4df58
fbo = 0x0
fcs = 0x0
fco = 0x7f9a9ff4f000
fcos_next = FCOS_INVAL
type = FBIO_SEG
io_outstanding = 2 '\002'
refcount = 0
__PRETTY_FUNCTION__ = "fellow_cache_async_write_complete"
lcb = {{magic = 2863944409, n_add = 0, l_rem = 2, n_rem = 0, fco = 0x7f9a9ff4f000, add = {vtqh_first = 0x0,
      vtqh_last = 0x7f9a999fa2b8}, fcs = 0x7f9a9ff4df5c}}
__func__ = "fellow_cache_async_write_complete"
_pterr281611 = <optimized out>
_pterr282913 = <optimized out>
_pterr289715 = <optimized out>
(gdb) p *fbio
$1 = {magic = 3019, retries = 0, type = FBIO_SEG, sync = FBIOS_ASYNC, fbo = 0x0, u = {fcs = 0x0, seglist = {fdsl = 0x0,
      reg = {off = 0, size = 0}}}}

9e590ada

Add fco assertion to fellow_cache_seg_deref() · c736f530
Nils Goroll authored Dec 09, 2023

c736f530
Fix bad assertion in fellow_stream_f() · 096749ed
Nils Goroll authored Dec 09, 2023
```
we can have FCS_BUSY segments before OBJ_ITER_END while streaming.
```
096749ed
fcsc: handle "end of fcsl" properly · 9383d477
Nils Goroll authored Dec 09, 2023
```
At the last segment, do not advance to the next segment list if it is
still empty.
```
9383d477

08 Dec, 2023 1 commit
- Remove VC#4013 workaround where not needed · 32ecc656
  Nils Goroll authored Dec 08, 2023
  
  32ecc656
29 Nov, 2023 3 commits
- Polish flag manipulation · 91cdd6c6
  Nils Goroll authored Nov 29, 2023
  
  91cdd6c6
- Add clarifying assertion · 0d410654
  Nils Goroll authored Nov 29, 2023
```
we only jump to the again label if we did not get a reference.
```
  0d410654
- Tigthen assertions · 8883290b
  Nils Goroll authored Nov 29, 2023
```
Motivated by #40
```
  8883290b
28 Nov, 2023 21 commits
- Coverity: Try to avoid tainted values · 798fa608
  Nils Goroll authored Nov 28, 2023
```
CID#469253
```
  798fa608
- Appease Coverity · 4a395640
  Nils Goroll authored Nov 28, 2023
```
CID#469261
```
  4a395640
- Classic case of sweeping the floor before C4ing the building · f2979e02
  Nils Goroll authored Nov 28, 2023
```
Coverity CID#469242
```
  f2979e02
- Polish insignificant assertion · 12c52782
  Nils Goroll authored Nov 28, 2023
```
Coverity CID#469229
```
  12c52782
- Give the fellow open worker thread some state · f365451e
  Nils Goroll authored Nov 28, 2023
```
To address some Coverity pedentry with minor impact (at most)

CID#469230
```
  f365451e
- Add missing assertion on calloc() return in test code · 4e8f130a
  Nils Goroll authored Nov 28, 2023
```
Coverity CID#469233
```
  4e8f130a
- Fix nit in test code · 5531ff6d
  Nils Goroll authored Nov 28, 2023
```
Spotted by Coverity CID#469228, but it is irrelevant because only in
test code
```
  5531ff6d
- No log region means full logregion · 10b82de2
  Nils Goroll authored Nov 28, 2023
```
Good catch by Coverity CID#469254
```
  10b82de2
- Minor polish: Fix unused value · 89ee2c17
  Nils Goroll authored Nov 28, 2023
```
Coverity CID#469252
```
  89ee2c17
- Fix comment · af580ef7
  Nils Goroll authored Nov 28, 2023
```
Ref Coverity CID#469262
```
  af580ef7
- Fix unintended control flow in fellow_cache_lru_work() · 6cd3f691
  Nils Goroll authored Nov 28, 2023
```
The optimized case for multiple segments from the same fco did
not work as expected, continue did not continue the inner loop.

Spotted by Coverity, CID#469236
```
  6cd3f691
- Polish insignificant data race · 7cdea70e
  Nils Goroll authored Nov 28, 2023
```
Coverity CID#469225
```
  7cdea70e
- Add assertion to hopefully help coverity understand the code · 57299c57
  Nils Goroll authored Nov 28, 2023
```
Ref CID#469268
```
  57299c57
- coverage: fix injection in test_fellow_cache_obj_iter_fina() · db90af32
  Nils Goroll authored Nov 28, 2023
  
  db90af32
- coverage: fellow_cache_obj_lru_touch() · 2b958cf2
  Nils Goroll authored Nov 28, 2023
  
  2b958cf2
- coverage: fellow_cache_obj_slim() · 99915368
  Nils Goroll authored Nov 28, 2023
  
  99915368
- whitespace · e296ec4d
  Nils Goroll authored Nov 28, 2023
  
  e296ec4d
- coverage: fellow_busy_body_seg_return() · 12528b9b
  Nils Goroll authored Nov 28, 2023
  
  12528b9b
- coverage: fellow_busy_body_seg_adjust() · 47ef1bad
  Nils Goroll authored Nov 28, 2023
  
  47ef1bad
- Document how to analyze lockups · e36c1ca6
  Nils Goroll authored Nov 28, 2023
```
Motivated by #41
```
  e36c1ca6
- Use gdb --batch and backtrace full in gdb command docs · 4727ce59
  Nils Goroll authored Nov 28, 2023
  
  4727ce59
26 Nov, 2023 1 commit

Simplify allocation in fellow_cache_obj_new() · 5cc5bd6d

Nils Goroll authored Nov 26, 2023

the disk object is always a multiple of 4K, and for higher values (like
12K), rounding up to the next power of two does not make sense.

So, just use two allocations for FCO and FDO always.

5cc5bd6d