Commits · b6d9a9ba94decab1346aaafcdfb9011726fc7f13 · uplex-varnish / slash

10 Nov, 2023 3 commits

Sort prios table · b6d9a9ba
Nils Goroll authored Oct 25, 2023

b6d9a9ba
Rename allocation priorities, Raise logblk (dsk) priority by one · 7c72a2e0
Nils Goroll authored Oct 25, 2023
```
it is more important than objects

Should also contribute to a fix for #28
```
7c72a2e0

Allocate additional log blocks early · 98e78f01

Nils Goroll authored Oct 24, 2023

This, hopefully, is part of a possible solution to the nasty issue #28:

When we do not have a sufficiently large pre-allocated log (log region)
as determined by objsize_hint in relation to the storage size, we need
to dynamically allocate disk blocks while we flush the log.

When the log flush includes object deletions (in particular when
triggered from the disk LRU), we run into a typical deadlock: To
complete the transaction to free space, we need the space...

This commit is part of an attempt to make this work by allocating
space early on: When we only have 20% of the log region left, we start
to reserve more blocks for the log.

The problem can, for example, be reproduced with an objsize_hint of 1MB
and an actual object size in the oder of 32KB.

Ref #28

98e78f01

09 Nov, 2023 1 commit

Fix wrong assertion hitting when all discard methods fail · e7999e0a

Nils Goroll authored Nov 09, 2023

Manually tested with this modification:

diff --git a/src/fellow_log.c b/src/fellow_log.c
index 6075d81..45da269 100644
--- a/src/fellow_log.c
+++ b/src/fellow_log.c
@@ -1696,6 +1696,9 @@ fellow_io_regions_discard(struct fellow_fd *ffd, void *ioctx,
                r = fallocate(ffd->fd,
                    FALLOC_FL_KEEP_SIZE | FALLOC_FL_PUNCH_HOLE,
                    (off_t)todo->offset, (off_t)todo->len);
+               // XXX TEST
+               r = 1;
+               errno = EOPNOTSUPP;
                if (r == 0) {
                        if ((ffd->cap & FFD_CAN_FALLOCATE_PUNCH_URING) == 0) {
                                ffd->diag("fellow: fallocate punch"

Fixes #38

e7999e0a

07 Nov, 2023 5 commits

fellow_stream_f(): Improve comment and assertion · 3fccc602
Nils Goroll authored Nov 07, 2023
```
to make clear that we understand exactly what is happening.
```
3fccc602

Fix races for streaming busy objects · a1dbf0fe

Nils Goroll authored Nov 07, 2023

For streaming busy objects, we basically rely on the varnish-cache
ObjExtend() / ObjWaitExtend() API to never read past the object: In
fellow_stream_f(), we always wait for more data (or the end of the
object) before returning, such that fellow_cache_obj_iter(), which
iterates over segments, should never touch a segment past the final
FCS_BUSY segment.

Yet - it did, by means of the read-ahead and the peek-ahead to determine
whether or not OBJ_ITER_END should be signaled.

We fix this issue by reading/peeking ahead only for segments with a
state beyond FCS_BUSY.

There is now also extensive test infrastructure to specifically test
concurrent access ti busy objects. To keep layers separate,
fellow_cache_test uses a lightweight signal/wait implementation
analogous to the ObjExtend() / ObjWaitExtend() Varnish-Cache
interface.

An earlier version of t_busyobj() had run on my dev laptop for 3.5
hours without crashing, while without the fixes it had run into
assertion failures within seconds.

Fixes #35 and #36 (I hope)

a1dbf0fe

Extend b62.vtc by cache reload · 79ed0dab
Nils Goroll authored Nov 06, 2023

79ed0dab
Mark a question to revisit later · 6fff4eed
Nils Goroll authored Nov 06, 2023

6fff4eed

Add DBG() to fcsc_next() · fbbcb962

Nils Goroll authored Nov 06, 2023

... to make it easier to follow the code in fellow_cache_test

motivated by #35

fbbcb962

03 Nov, 2023 7 commits

Reorganize offsets in log info · 2553385d
Nils Goroll authored Nov 03, 2023

2553385d

Introduce a dynamic minimum to dsk_reserve_chunks ... · 32857f4b

Nils Goroll authored Nov 03, 2023

... such that the total reserve is no less than 2MB.

This is required for stable operation of LRU when the log is full.

Ref #28

32857f4b

Add buddy_next_ptr_* · e17e841f
Nils Goroll authored Oct 28, 2023

e17e841f
Fix single active logblock allocation for logregion-only case · ad443fe9
Nils Goroll authored Oct 26, 2023
```
Should be irrelevant in practice, because we would not flush
a single block during startup.
```
ad443fe9

Fix nit in logblocks_alloc_from_logregion() with already allocated blocks · de0ed184

Nils Goroll authored Oct 26, 2023

When some blocks were already allocated, we would fail to
use all of the log region, that is, the newly added assertion

	if (n > 0) AZ(logreg->free_n);

would fail

This left some blocks of the logregion unused, but was insignificant
otherwise.

de0ed184

Fix stupid glitch rendering logbuffer capabilities useless · 7201a3a4

Nils Goroll authored Oct 26, 2023

Unfortunately, this was present even in the initial public
release 58ec40f9

This issue should have had no production impact, but it made hunting
down bugs unnecessary hard.

7201a3a4

Move assertion to the right place · 656cbe57

Nils Goroll authored Nov 03, 2023

When we work on the last segment, the remaining length is zero,
but we still have a current pointer and length.

This was a particularly annoying glitch because I wrote almost
the same code for varnish-cache with the equivalent assertion in
the right place :(

Sorry

Ref https://github.com/varnishcache/varnish-cache/pull/4013/commits/8ec77190d91603c8f0dead0cee013e3c9ca8fa78#diff-f79cfeda8456789ae873270aefa58e8f1e94213ee16d32ea96b8db8a7013ebf8R790
Closes #34

656cbe57

02 Nov, 2023 1 commit

Introduce a flush finish state · cab6dd10

Nils Goroll authored Nov 02, 2023

it is planned to replace the "inuse" tri-state and might turn
out helpful for debugging.

cab6dd10

01 Nov, 2023 2 commits

Polish: use seq_inc() macro · cf06a81f
Nils Goroll authored Nov 01, 2023

cf06a81f

Workaround for Varnish-Cache VC#4013: Wrong trim use, inefficient copy · ed0bbf80

Nils Goroll authored Oct 31, 2023

https://github.com/varnishcache/varnish-cache/pull/4013 fixes two
issues in Varnish-Cache, which are relevant for SLASH/fellow and of
which the first is the root cause of #33.

This commit works around these issues until the fix gets merged:

Because of the wrong use of the .objtrimstore API function by
varnish-cache, we remove it from our obj_methods and exploit the fact
that varnish-cache always sets the OA_LEN attribute when the object is
complete: We move the trimstore function there, effectively calling it
at the right time only.

The inefficient memory allocation fixed in the second commit of
VC#4013 is particularly relevant for fellow, because it causes the
allocation code to assume that the object might grow up to the maximum
possible size, which causes a substantial over-allocation. We work
around this issue for the case that a 304 copy is made from fellow to
fellow by using private thread-local storage to emulate basically the
same function as the #4013 fix.

Closes #33
Ref https://github.com/varnishcache/varnish-cache/pull/4013

ed0bbf80

31 Oct, 2023 4 commits
- Assert no duplicate trimming · f99a1c3f
  Nils Goroll authored Oct 31, 2023
```
Ref #33
Ref https://github.com/varnishcache/varnish-cache/pull/4013
```
  f99a1c3f
- Modify b62.vtc to trigger #33 · 602c22a9
  Nils Goroll authored Oct 31, 2023
  
  602c22a9
- Minor polish · 50f488b3
  Nils Goroll authored Oct 31, 2023
  
  50f488b3
- Tigthen assertions in fellow_busy_body_seg_next · 3b6b8ff4
  Nils Goroll authored Oct 31, 2023
```
These would have made analyzing #33 much easier. :|
```
  3b6b8ff4
30 Oct, 2023 5 commits
- Add assertions · 71d4c954
  Nils Goroll authored Oct 30, 2023
```
motivated by #32
```
  71d4c954
- Polish RST · b328691a
  Nils Goroll authored Oct 30, 2023
  
  b328691a
- Add b62.vtc · 8d9f7100
  Nils Goroll authored Oct 30, 2023
  
  8d9f7100
- Start a document about helpful debugging information · 91ebf4ce
  Nils Goroll authored Oct 30, 2023
  
  91ebf4ce
- Add a variation of varnish-cache c62.vtc · 7ec9c1b5
  Nils Goroll authored Oct 30, 2023
  
  7ec9c1b5
29 Oct, 2023 1 commit

In forkrun(), fix SIGCHLD waiting and test it · f711a1d5

Nils Goroll authored Oct 29, 2023

Spotted by Thomas Gleixner <tglx@linutronix.de>, THANK YOU

forkrun() never properly handled the case that a child exited before
the timeout expired, because we had failed to block the signal and
hence never received a SIGCHLD. This was overlooked because this
functionality was never relevant (it only delayed test execution) and
because we did not explicitly test it.

Related to #31

f711a1d5

27 Oct, 2023 1 commit
- Do not call the stream function again after it has failed · 9728b48a
  Nils Goroll authored Oct 27, 2023
```
Should fix #32
```
  9728b48a
26 Oct, 2023 2 commits

Get a weird problem out of the way for now · 15325a4f
Nils Goroll authored Oct 26, 2023
```
See #31
```
15325a4f

Retry flock() 3 times with 100ms delay inbetween · 92b0e12e

Nils Goroll authored Oct 26, 2023

It seems with the recent debian updates on my machine, some change
of timing/scheduling has come which makes flock() fail when the
lock holder is being killed by the timeout code in forkrun()

For future reference: logs/20231026_apt_history.txt

92b0e12e

24 Oct, 2023 8 commits
- BUDYY_REQS_INIT(): zero the i_reqalloc struct · d2116ede
  Nils Goroll authored Oct 24, 2023
  
  d2116ede
- BUDDY_REQS(): name argument now names the struct · 6627211f
  Nils Goroll authored Oct 24, 2023
  
  6627211f
- BUDDY_REQS_INIT(): gc now unused argument · 99c7f955
  Nils Goroll authored Oct 24, 2023
  
  99c7f955
- Refactor BUDDY_REQS(): fixed member name · 9e22245a
  Nils Goroll authored Oct 24, 2023
  
  9e22245a
- Refactor logbuffer_grow · 0431fada
  Nils Goroll authored Oct 24, 2023
```
Making a full copy of the logbuffer just to access four members
was not justified. The original idea was to re-use logbuffer_fini,
but, effectively, only buddy_return1_ptr_page() was called.
```
  0431fada
- Fix BUDDY_REQS member name · 0f56d455
  Nils Goroll authored Oct 24, 2023
  
  0f56d455
- buddy: in struct i_wait, separate initiator and allocator owned members · 982d904b
  Nils Goroll authored Oct 24, 2023
```
In particular with uint8_t, we risk writes to be non atomic and
overwrite neighboring members
```
  982d904b
- Add buddy_reqs_next_ready / buddy_reqs_done · 60b07979
  Nils Goroll authored Oct 24, 2023
  
  60b07979