Commits · e112b51fc8ccc373721652adba98a5e230874f17 · uplex-varnish / slash

31 Jul, 2023 3 commits

Nils Goroll authored Jul 31, 2023

Motivated by #18, but does not fix the root cause yet

For the call path in the bug ticket, the stack regionlist is supposed
to be big enough and the root cause is that it is not. But at any
rate, for that call path, the regionlist is OK to be NULL and
regionlist_add() should never be called.

If, however, it _is_ called, the regionlist can't be NULL.

e112b51f

If IO submission fails, assert that there are completions · b7b26499
Nils Goroll authored Jul 30, 2023

b7b26499

Call try_flags() even when there are no flags to try · bd33e5d5

Nils Goroll authored Jul 31, 2023

Avoids:

fellow_io_uring.c:234:1: error: ‘try_flag’ defined but not used [-Werror=unused-function]
  234 | try_flag(unsigned flag)
      | ^~~~~~~~

bd33e5d5

28 Jul, 2023 2 commits

Batch LRU changes · cb92c844

Nils Goroll authored Jul 28, 2023

the lru_mtx is our most contended mtx.

As a first improvement, batch changes to LRU for multiple segments
and maintain the effective change locally outside the lru mtx (but
while holding the obj mtx).

cb92c844

Minor refactor · da5d8b9e
Nils Goroll authored Jul 28, 2023

da5d8b9e

24 Jul, 2023 23 commits
- Test io_uring flags before using them · 66a07c7a
  Nils Goroll authored Jul 24, 2023
```
is there a better way?

https://github.com/axboe/liburing/issues/906
```
  66a07c7a
- make fellow_io_fini() idempotent · ecf6f24c
  Nils Goroll authored Jul 24, 2023
```
during error paths, we might call it multiple times
```
  ecf6f24c
- Use io_uring_free_probe() · a378525e
  Nils Goroll authored Jul 24, 2023
  
  a378525e
- Changelog TLC · 085ba7b4
  Nils Goroll authored Jul 24, 2023
  
  085ba7b4
- LRU-Touch objcts for OA_VARY · 94d37731
  Nils Goroll authored Jul 24, 2023
```
varnish-cache does not touch objects for OA_VARY, but we need
to keep FCOs in memory which are frequently used during lookup.

Thoughts on why this should not race LRU:

- lru_list is owned by lru_mtx
- object can't go away, because
  - for call from hash, we hold the oh->mtx
  - otherwise, we hold a ref
```
  94d37731
- Prioritize object memory allocation for OA_VARY · 491339c2
  Nils Goroll authored Jul 24, 2023
```
... which happens potentially under the cache lock
```
  491339c2
- New region alloc · a0e8e8f7
  Nils Goroll authored Jul 10, 2023
```
upfront: This is not the segment allocation, which uses parts of the busy
obj region allocation, and is mostly motivated by how much data we need
to have in RAM at minimum.

For the region allocation, we have conflicting goals:

- To keep the log short, we want to use the least number of regions
- To reduce fragmentation, we want to use the largest possible
  allocations
- To use space efficiently, we want to split regions into power of
  two allocations.

Also, for chunked encoding, we do not have an upper limit of
how much space we are going to need, so we have to use the
estimate provided by fellow_busy_obj_getspace(). It can not
guess more than objsize_max.

The new region alloc algorithm takes this compromise:

- For the base case that we ran out of available regions (220), we
  allocate all we need without cramming.
- Otherwise if we need less than a chunk, we request it
- Otherwise if we know the size, we round down to a power of two
- Otherwise we round up

We then allow any cramming down to the chunk size, because that
is what our LRU reservation uses.
```
  a0e8e8f7
- Refactor size estimate · 7200ef50
  Nils Goroll authored Jul 10, 2023
  
  7200ef50
- Add objsize_max · bef3d014
  Nils Goroll authored Jul 10, 2023
  
  bef3d014
- Refactor region reserve · ede98cd5
  Nils Goroll authored Jul 10, 2023
  
  ede98cd5
- Drop some notes regarding required refactoring, add grown indicator · 2e63c01f
  Nils Goroll authored Jul 09, 2023
```
Ref #10
```
  2e63c01f
- Fix off-by-one in assertion on fbo region number · cec5e02e
  Nils Goroll authored Jul 10, 2023
  
  cec5e02e
- Minor refactor: move 2 lines · 5cd489cd
  Nils Goroll authored Jul 10, 2023
  
  5cd489cd
- The real chunk_exponent maximum is 30 (FIO_MAX == 1<<30) · 97539149
  Nils Goroll authored Jul 10, 2023
  
  97539149
- Add a simple content-length test · a596d51d
  Nils Goroll authored Jul 09, 2023
```
Ref #10
```
  a596d51d
- Improve safety of log2 functions · a7e26a24
  Nils Goroll authored Jul 10, 2023
  
  a7e26a24
- Support camming of segment memory allocation · b82a7f0d
  Nils Goroll authored Jul 21, 2023
```
adjust dsk size if mem allocation was smaller than requested
```
  b82a7f0d
- Add cramlimit function · 564ced85
  Nils Goroll authored Jul 21, 2023
  
  564ced85
- Check in generated documentation · db13fe36
  Nils Goroll authored Jul 21, 2023
  
  db13fe36
- Fix and properly document reserve limits · 8b700a27
  Nils Goroll authored Jul 21, 2023
```
Could have caused #5, related to #10
```
  8b700a27
- Update documentation wrt new chunk size cap · adaec45d
  Nils Goroll authored Jul 21, 2023
  
  adaec45d
- Stop auto-adjusting the number of chunks when the chunk size is capped · a02dd5c5
  Nils Goroll authored Jul 21, 2023
```
This is counter-intuitive and could lead to extreme values, for
example:

	default: chunk_exponent = 20, dsk_reserve_chunks 4

	adjusted to: 12, 4 << 8 = 1024

now user sets chunk_exponent = 21

	adjusted to: 12, 1024 << 9 = 524288

Could have caused #5, related to #10
```
  a02dd5c5
- Flexelint "Limit chunk_bytes more strictly" · 3f4f9b60
  Nils Goroll authored Jul 21, 2023
```
tiny glitch
```
  3f4f9b60
21 Jul, 2023 7 commits

Limit chunk_bytes more strictly · da63a5b8

Nils Goroll authored Jul 21, 2023

The main cause for #11 seems to be that the chunk size in relation to
the memory cache was too big.

We now clamp it at memsz >> 10 (less than 1/1024 of the memsz).

This can still lead to issues when the memory size is reduced and
the cache reloaded, but then at least new objects will not compete
for the available memory.

da63a5b8

Add a header file for fundamental constants/defines · 7bc0fc0d
Nils Goroll authored Jul 21, 2023

7bc0fc0d
Add a wait table to serialize initial object read per object · d5c1bbd7
Nils Goroll authored Jul 15, 2023

d5c1bbd7
Differentiate memory allocation priorities · 4dcda2c0
Nils Goroll authored Jul 15, 2023
```
Fixes #11 I hope
```
4dcda2c0
Introduce new priorities · ed28758e
Nils Goroll authored Jul 15, 2023

ed28758e

Decide against extending the log at this point - no oa_present yet · 7125d0c7

Nils Goroll authored Jul 21, 2023

With the previous commit, we restore the objcore's oa_present field
when we first read the object.

On top of that, it would be nice if we could restore the field or at
least the bit for OA_VARY when we read the log, because a cache lookup
checks it and could avoid reading objects if the OA_VARY bit is not
set.

On the other hand, if vary is used, the object needs to be read
anyway, so restoring oa_present during log read only payed off for no
vary.

The straight forward solution would be to add 2 bytes to struct
fellow_dle_obj. This would

- require a DLE version bump and compatibility code
- bump the dle size from 72 to 80 bytes (because alignment)
- consequently, reduce FELLOW_DISK_LOG_BLOCK_ENTRIES from 56 to 4032 /
  80 = 50

All this is possible and part of the fellow design, but at
this point I conclude that it is not worth the effort.

Another option would be to cram two bits into the existing
fellow_dle_obj, for example by exploiting the fact that the sign bit
of ban and t_origin is always zero. This could break backwards
compatibility, so we prepare for the option, but do not implement it.

All in all, we just add a TODO to add this when we need to extend the
DLE anyway.

Closes #17

7125d0c7

Trivial refactor: move code. View with diff -b · c40f8bcd
Nils Goroll authored Jul 21, 2023

c40f8bcd

20 Jul, 2023 2 commits

Save and restore oa_present bitfield · fc1af457

Nils Goroll authored Jul 20, 2023

Implements most of #17, except that we would like to restore OA_VARY
with the log read...

This should be backwards and forwards compatible, because the default
in both cases is 0, and varnish-cache will not use the bitfield
if it is 0.

fc1af457

Add conversion between v-c oa_present and fellow stable values · d122f79c
Nils Goroll authored Jul 20, 2023

d122f79c

15 Jul, 2023 2 commits
- Bring struct fellow_busy below 4KB · 7576da5e
  Nils Goroll authored Jul 15, 2023
```
Closes #16
```
  7576da5e
- Rename define for (struct fellow_busy).io size · 07640fa0
  Nils Goroll authored Jul 15, 2023
```
Ref #16
```
  07640fa0
09 Jul, 2023 1 commit
- Refactor for clarity: I think this is a good case for goto · e3a45409
  Nils Goroll authored Jul 09, 2023
  
  e3a45409