Commits · e1017adc0acff1595781fd31e3455d129fac2193 · uplex-varnish / slash

07 Feb, 2024 40 commits

fellow_stream_f(): Improve comment and assertion · e1017adc
Nils Goroll authored Nov 07, 2023
```
to make clear that we understand exactly what is happening.
```
e1017adc

Fix races for streaming busy objects · aca69dac

Nils Goroll authored Nov 07, 2023

For streaming busy objects, we basically rely on the varnish-cache
ObjExtend() / ObjWaitExtend() API to never read past the object: In
fellow_stream_f(), we always wait for more data (or the end of the
object) before returning, such that fellow_cache_obj_iter(), which
iterates over segments, should never touch a segment past the final
FCS_BUSY segment.

Yet - it did, by means of the read-ahead and the peek-ahead to determine
whether or not OBJ_ITER_END should be signaled.

We fix this issue by reading/peeking ahead only for segments with a
state beyond FCS_BUSY.

There is now also extensive test infrastructure to specifically test
concurrent access ti busy objects. To keep layers separate,
fellow_cache_test uses a lightweight signal/wait implementation
analogous to the ObjExtend() / ObjWaitExtend() Varnish-Cache
interface.

An earlier version of t_busyobj() had run on my dev laptop for 3.5
hours without crashing, while without the fixes it had run into
assertion failures within seconds.

Fixes #35 and #36 (I hope)

aca69dac

Extend b62.vtc by cache reload · 83bc6afe
Nils Goroll authored Nov 06, 2023

83bc6afe
Mark a question to revisit later · ab644362
Nils Goroll authored Nov 06, 2023

ab644362

Add DBG() to fcsc_next() · 8fc211fe

Nils Goroll authored Nov 06, 2023

... to make it easier to follow the code in fellow_cache_test

motivated by #35

8fc211fe

Reorganize offsets in log info · 39700637
Nils Goroll authored Nov 03, 2023

39700637

Introduce a dynamic minimum to dsk_reserve_chunks ... · fefc08da

Nils Goroll authored Nov 03, 2023

... such that the total reserve is no less than 2MB.

This is required for stable operation of LRU when the log is full.

Ref #28

fefc08da

Add buddy_next_ptr_* · 6fc6bd78
Nils Goroll authored Oct 28, 2023

6fc6bd78
Fix single active logblock allocation for logregion-only case · 0b45d073
Nils Goroll authored Oct 26, 2023
```
Should be irrelevant in practice, because we would not flush
a single block during startup.
```
0b45d073

Fix nit in logblocks_alloc_from_logregion() with already allocated blocks · 78f2dcc4

Nils Goroll authored Oct 26, 2023

When some blocks were already allocated, we would fail to
use all of the log region, that is, the newly added assertion

	if (n > 0) AZ(logreg->free_n);

would fail

This left some blocks of the logregion unused, but was insignificant
otherwise.

78f2dcc4

Fix stupid glitch rendering logbuffer capabilities useless · 8b6e81f7

Nils Goroll authored Oct 26, 2023

Unfortunately, this was present even in the initial public
release 58ec40f9

This issue should have had no production impact, but it made hunting
down bugs unnecessary hard.

8b6e81f7

Move assertion to the right place · 108714e7

Nils Goroll authored Nov 03, 2023

When we work on the last segment, the remaining length is zero,
but we still have a current pointer and length.

This was a particularly annoying glitch because I wrote almost
the same code for varnish-cache with the equivalent assertion in
the right place :(

Sorry

Ref https://github.com/varnishcache/varnish-cache/pull/4013/commits/8ec77190d91603c8f0dead0cee013e3c9ca8fa78#diff-f79cfeda8456789ae873270aefa58e8f1e94213ee16d32ea96b8db8a7013ebf8R790
Closes #34

108714e7

Introduce a flush finish state · dac7e1da

Nils Goroll authored Nov 02, 2023

it is planned to replace the "inuse" tri-state and might turn
out helpful for debugging.

dac7e1da

Polish: use seq_inc() macro · 27841cb3
Nils Goroll authored Nov 01, 2023

27841cb3

Workaround for Varnish-Cache VC#4013: Wrong trim use, inefficient copy · 8409356f

Nils Goroll authored Oct 31, 2023

https://github.com/varnishcache/varnish-cache/pull/4013 fixes two
issues in Varnish-Cache, which are relevant for SLASH/fellow and of
which the first is the root cause of #33.

This commit works around these issues until the fix gets merged:

Because of the wrong use of the .objtrimstore API function by
varnish-cache, we remove it from our obj_methods and exploit the fact
that varnish-cache always sets the OA_LEN attribute when the object is
complete: We move the trimstore function there, effectively calling it
at the right time only.

The inefficient memory allocation fixed in the second commit of
VC#4013 is particularly relevant for fellow, because it causes the
allocation code to assume that the object might grow up to the maximum
possible size, which causes a substantial over-allocation. We work
around this issue for the case that a 304 copy is made from fellow to
fellow by using private thread-local storage to emulate basically the
same function as the #4013 fix.

Closes #33
Ref https://github.com/varnishcache/varnish-cache/pull/4013

8409356f

Assert no duplicate trimming · ce719295
Nils Goroll authored Oct 31, 2023
```
Ref #33
Ref https://github.com/varnishcache/varnish-cache/pull/4013
```
ce719295
Add PTOK() macro from varnish-cache · 5fc0b708
Nils Goroll authored Feb 07, 2024

5fc0b708
Modify b62.vtc to trigger #33 · 5b92665c
Nils Goroll authored Oct 31, 2023

5b92665c
Minor polish · 7dc2a23a
Nils Goroll authored Oct 31, 2023

7dc2a23a
Tigthen assertions in fellow_busy_body_seg_next · 9797906c
Nils Goroll authored Oct 31, 2023
```
These would have made analyzing #33 much easier. :|
```
9797906c
Add assertions · 3850cded
Nils Goroll authored Oct 30, 2023
```
motivated by #32
```
3850cded
Polish RST · 25662138
Nils Goroll authored Oct 30, 2023

25662138
Add b62.vtc · f84afcdd
Nils Goroll authored Oct 30, 2023

f84afcdd
Start a document about helpful debugging information · 2bcc41c3
Nils Goroll authored Oct 30, 2023

2bcc41c3
Add a variation of varnish-cache c62.vtc · 76180991
Nils Goroll authored Oct 30, 2023

76180991

In forkrun(), fix SIGCHLD waiting and test it · e1b3e40f

Nils Goroll authored Oct 29, 2023

Spotted by Thomas Gleixner <tglx@linutronix.de>, THANK YOU

forkrun() never properly handled the case that a child exited before
the timeout expired, because we had failed to block the signal and
hence never received a SIGCHLD. This was overlooked because this
functionality was never relevant (it only delayed test execution) and
because we did not explicitly test it.

Related to #31

e1b3e40f

Do not call the stream function again after it has failed · dc565b27
Nils Goroll authored Oct 27, 2023
```
Should fix #32
```
dc565b27
Get a weird problem out of the way for now · 2b9f729f
Nils Goroll authored Oct 26, 2023
```
See #31
```
2b9f729f

Retry flock() 3 times with 100ms delay inbetween · a17ef049

Nils Goroll authored Oct 26, 2023

It seems with the recent debian updates on my machine, some change
of timing/scheduling has come which makes flock() fail when the
lock holder is being killed by the timeout code in forkrun()

For future reference: logs/20231026_apt_history.txt

a17ef049

BUDYY_REQS_INIT(): zero the i_reqalloc struct · cbd9563a
Nils Goroll authored Oct 24, 2023

cbd9563a
BUDDY_REQS(): name argument now names the struct · 4cf2cca6
Nils Goroll authored Oct 24, 2023

4cf2cca6
BUDDY_REQS_INIT(): gc now unused argument · 7acb1991
Nils Goroll authored Oct 24, 2023

7acb1991
Refactor BUDDY_REQS(): fixed member name · 29b48098
Nils Goroll authored Oct 24, 2023

29b48098

Refactor logbuffer_grow · 159d7b19

Nils Goroll authored Oct 24, 2023

Making a full copy of the logbuffer just to access four members
was not justified. The original idea was to re-use logbuffer_fini,
but, effectively, only buddy_return1_ptr_page() was called.

159d7b19

Fix BUDDY_REQS member name · 90c10efd
Nils Goroll authored Oct 24, 2023

90c10efd
buddy: in struct i_wait, separate initiator and allocator owned members · 19e995d1
Nils Goroll authored Oct 24, 2023
```
In particular with uint8_t, we risk writes to be non atomic and
overwrite neighboring members
```
19e995d1
Add buddy_reqs_next_ready / buddy_reqs_done · 345ae72d
Nils Goroll authored Oct 24, 2023

345ae72d
Add buddy_get_next_off_* · 1902b35a
Nils Goroll authored Oct 23, 2023

1902b35a
Trivial refactor · 0c07b475
Nils Goroll authored Oct 23, 2023
```
Ref #28
```
0c07b475
Rename for clarity · 2d619a45
Nils Goroll authored Oct 23, 2023
```
Ref #28
```
2d619a45