Commits · d5c1528b45530557247f090222f2cdfae5c61e45 · uplex-varnish / k8s-ingress

06 Jul, 2020 1 commit

Refactor the return status for sync operations in the main loop. · d5c1528b

Geoff Simmons authored Jul 06, 2020

Previously all sync operations (add/update/delete for the resource
types that the controller watches, and the cluster changes brought
about for them if necessary) were only characterized by a Go error
variable. The only conditions that mattered were nil or not-nil.

On a nil return, success was logged, and a SyncSuccess event was
generated for the resource that was synced. But this was done even
if no change was required in the cluster, and the resource had
nothing to do with Ingress or the viking application. This led
to many superfluous Events.

On a non-nil return, a warning Event was generated the sync
operation was re-queued, using the workqueue's rate-limiting delay.
This was done regardless of the type of error. Since the initial delay
is quite rapid (subsequent re-queues begin to back off the delay), it
led to many re-queues, and many Events. But it is very common that a
brief delay is predictable, when not all necessary information is
available to the controller, so that the rapid retries just
generated a lot of noise. In other cases, retries will not improve
the situation -- an invalid config, for example, will still be
invalid on the next attempt.

This commit introduces pkg/update and the type Status, which classifies
the result of a sync operation. All of the sync methods now return
an object of this type, which in turn determines how the controller
handles errors, logs results, and generates Events. The Status types
are:

Success: a cluster change was necessary and was executed successfully.
The result is logged, and an Event with Reason SyncSuccess is generated
(as before).

Noop: no cluster change was necessary. The result is logged, but no
Event is generated. This reduces Event generation considerably.

Fatal: an unrecoverable error (retries won't help). The result is
logged, a SyncFatalError warning Event is generated, but no retries
are attempted.

Recoverable: an error that might do better on retry. The result is
logged, a SyncRecoverableError is generated, and the operation is
re-queued with the rate-limiting delay (as before).

Incomplete: a cluster change is necessary, but some information is
missing. The result is logged, a SyncIncomplete warning Event is
generated, and the operation is re-queued with a delay. The delay
is currently hard-wired to 5s, but will be made configurable.

We'll probably tweak some of the decisions about which status types
are chosen for which results. But this has already improved the
controller's error handling, and has considerably reduced its
verbosity, with respect to both logging and event generation.

d5c1528b

01 Jul, 2020 5 commits

Update BackendConfig with parameters for ExternalName Services. · 0f1ea9c5

Geoff Simmons authored Jul 01, 2020

These correspond to properties that can be set in VMOD dynamic.
Currently we have:

dnsRetryDelay: this is set as the ttl in the VMOD. Since we get
DNS TTLs from the server, it effectively sets the retry delay
after a lookup gets negative results. More recent versions of the
VMOD have a separate parameter for this purpose, so this should be
updated soon.

domainUsageTimeout: corresponds to the domain_usage_timeout param
of the VMOD director.

firstLookupTimeout: corresponds to the first_lookup_timeout param
of the VMOD director.

resolverTimeout: set with the .set_timeout() method of the VMOD
resolver object.

resolverIdleTimeout: set with the .set_idle_timeout() method of the
VMOD resolver object.

maxDNSQueries: set with the .set_limit_outstanding_queries() method
of the VMOD resolver object.

followDNSRedirects: set with the .set_follow_redirects() method of
the VMOD resolver object.

0f1ea9c5

Update copyright boilerplate in generated client code. · 20124907
Geoff Simmons authored Jul 01, 2020

20124907
Update modules for client code generation. · f9fb5a36
Geoff Simmons authored Jul 01, 2020

f9fb5a36
Bugfix a potential nil dereference. · c542e46e
Geoff Simmons authored Jul 01, 2020

c542e46e
Fix e2e test for ExternalName Services. · be739765
Geoff Simmons authored Jul 01, 2020

be739765

30 Jun, 2020 8 commits
- Apply a BackendConfig in the example for ExternalName Services. · efb6498a
  Geoff Simmons authored Jun 30, 2020
```
Ref gitlab issue #20
```
  efb6498a
- Fix generation of probes for backends from ExternalName Services. · 08f6e38d
  Geoff Simmons authored Jun 30, 2020
```
Ref gitlab issue #20
```
  08f6e38d
- Support ExternalName Services as IngressBackends (more to come). · bd43dbbd
  Geoff Simmons authored Jun 30, 2020
```
Uses VMOD dynamic, and requires that the getdns library is installed
in the image running Varnish. This allows us to use dynamic.resolve(),
in particular so that TTLs from DNS are honored.

Currently sets ttl to a hard-wired value of 30s. Since the TTLs for
lookup are obtained from DNS, this actually sets the delay until
lookups are retried after negative results (default 1h).

The next step is to test and extend BackendConfig support to configure
properties of VMOD dynamic. That will make it possible to configure
the ttl value (although we might stay with a much shorter ttl than
1h).

Partially addresses gitlab issue #20.
```
  bd43dbbd
- Minor formatting improvement. · 67bea72d
  Geoff Simmons authored Jun 30, 2020
  
  67bea72d
- Bugfix a potential nil reference. · 7a875506
  Geoff Simmons authored Jun 29, 2020
  
  7a875506
- Raise some log levels from trace to debug for Ingress update. · e1675758
  Geoff Simmons authored Jun 29, 2020
  
  e1675758
- Install VMOD dynamic version 2.1.1 in the Varnish image. · 73b04bc3
  Geoff Simmons authored Jun 29, 2020
  
  73b04bc3
- Install VMOD dynamic in the Varnish image. · 804eae10
  Geoff Simmons authored Jun 22, 2020
  
  804eae10
18 Jun, 2020 1 commit
- Helm charts: Fix naming of the admin service · 0cbc71ca
  Lars Fenneberg authored Jun 18, 2020
  
  0cbc71ca
12 Jun, 2020 6 commits
- On Ingress update, update the viking Service Endpoints. · c8d9ad8b
  Geoff Simmons authored Jun 12, 2020
```
These weren't necessarily being changed after an Endpoints update.
```
  c8d9ad8b
- Wait 1 second longer before verifying initial deployment (sigh). · 54c8490d
  Geoff Simmons authored Jun 12, 2020
  
  54c8490d
- Add a String() method to vcl.Address. · 720ad2f6
  Geoff Simmons authored Jun 12, 2020
  
  720ad2f6
- varnish.Controller.HasConfig() checks for changed viking Endpoints. · a5c865d8
  Geoff Simmons authored Jun 12, 2020
  
  a5c865d8
- Update offloader endpoints when an Ingress is updated. · aa6c23e2
  Geoff Simmons authored Jun 11, 2020
```
The Ingress update may have followed an update for Endpoints.
```
  aa6c23e2
- Verification script waits more robustly for port-forward connectivity. · 870a0bfb
  Geoff Simmons authored Jun 11, 2020
```
XXX: re-use this code wherever we wait for port-forward.
```
  870a0bfb
11 Jun, 2020 2 commits
- Fix reference to config of HAProxy extra env variables · f4c1bb39
  Emanuel Winblad authored Jun 10, 2020
  
  f4c1bb39
- Fix name of dataplane API secret · 3e6e652d
  Emanuel Winblad authored Jun 10, 2020
  
  3e6e652d
10 Jun, 2020 2 commits
- Remove some dead code. · 972269a3
  Geoff Simmons authored Jun 10, 2020
  
  972269a3
- Helm charts: sensible default extra args for viking-service · 8abed666
  Lars Fenneberg authored Jun 10, 2020
  
  8abed666
05 Jun, 2020 1 commit
- Bugfix: don't requeue if a PEM Secret is not found on delete Secret. · 15842c1e
  Geoff Simmons authored Jun 05, 2020
  
  15842c1e
04 Jun, 2020 2 commits

Add an e2e test about deletion of a TLS Secret for a non-viking Ingress. · 6cc6c1a1
Geoff Simmons authored Jun 04, 2020

6cc6c1a1

Bugfix: only act on delete TLS Secret if it's relevant to viking. · 2bf40a90

Geoff Simmons authored Jun 04, 2020

For that, the Secret must be named as the TLS Secret by an Ingress
in the same namespace that identifies out ingress.class.

This means that the controller doesn't need to try delete an element
from any PEM Secret (to remove the certificate from the haproxy
Secret volume).

2bf40a90

03 Jun, 2020 9 commits
- Bugfix: check the ingress.class annotation on delete Ingress events. · 2165e1e5
  Geoff Simmons authored Jun 03, 2020
  
  2165e1e5
- Helm charts: Fix name of update strategy field for StatefulSet · 5e926688
  Lars Fenneberg authored Jun 03, 2020
  
  5e926688
- Helm charts: Switch StatefulSet to parallel pod management policy · 21270059
  Lars Fenneberg authored Jun 03, 2020
  
  21270059
- Helm charts: Remove erroneous comment · 09a0f248
  Lars Fenneberg authored Jun 03, 2020
  
  09a0f248
- Helm charts: Beautify template output a little bit · 47a3f937
  Lars Fenneberg authored Jun 03, 2020
  
  47a3f937
- Helm charts: Add option for using a StatefulSet with persistent storage · 9598c469
  Lars Fenneberg authored Jun 03, 2020
  
  9598c469
- Helm charts: Remove unused named template · 3d7908ac
  Lars Fenneberg authored Jun 03, 2020
  
  3d7908ac
- Helm charts: Actually render extra args and envs for haproxy · c95faa59
  Lars Fenneberg authored Jun 03, 2020
  
  c95faa59
- Helm charts: Fix default values · b0e91b68
  Lars Fenneberg authored Jun 03, 2020
  
  b0e91b68
02 Jun, 2020 2 commits
- Prevent another nil dereference in varnish.Controller.HasConfig(). · 5c3ea7ac
  Geoff Simmons authored Jun 02, 2020
  
  5c3ea7ac
- Helm charts: Use klarlack image for Varnish service by default · ba35369c
  Lars Fenneberg authored Jun 02, 2020
  
  ba35369c
28 May, 2020 1 commit
- Install VMOD selector in the klarlack container. · 7e017a93
  Geoff Simmons authored May 28, 2020
  
  7e017a93