resilience

package

v0.300.0 Latest Latest Go to latest Published: Jun 29, 2025 License: Apache-2.0 Imports: 11 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/adamluzsi/frameless

Links

Open Source Insights

README ¶

Package `resilience`

The resilience package provides tooling to make your code more resilient against failures or to make downstream dependencies more protected against overloads.

retry

Using a retry policy easy as iterating with a for loop, but instead of making a condition based on a max value, we check it with resilience.RetryPolicy#ShouldTry.

Example:

package mypkg

import (
	"context"
	"fmt"
	"go.llib.dev/frameless/pkg/resilience"
)

func (ms MyStruct) MyFunc(ctx context.Context) error {
	var rp resilience.ExponentialBackoff

	for range := resilience.Retries(ctx, rp) {
		err := ms.DoAction(ctx)
		if err != nil {
			if ms.isErrTemporary(err) {
				continue
			}
			return err
		}
		return nil
	}
	return fmt.Errorf("failed to DoAction")
}

The package contains multiple strategies for retrying:

ExponentialBackoff: Implements an exponential backoff strategy where the wait time between retries doubles after each failure.
Jitter: Adds randomness (jitter) to the delay between retries to prevent thundering herd issues.
Waiter: Uses a fixed timeout since the start of the operation to determine if retry attempts should continue.
FixedDelay: Waits for a fixed amount of time between retries without exponential backoff.

ExponentialBackoff

ExponentialBackoff is a RetryPolicy implementation that uses an exponentially increasing delay between retry attempts. This strategy helps prevent overwhelming the downstream system by giving it more time to recover after each failed attempt.

Key Features:

Initial delay (default: 500ms)
Exponentially increases delay for each subsequent retry
Configurable maximum number of retries (default: 5)
Optional timeout to limit total waiting time

When to use:

When you want to gradually increase the wait time between attempts
If you need a balance between fast retries and system recovery time
When the failure rate is high but expected to decrease over time

Example:

retry := resilience.ExponentialBackoff{
    Delay:  time.Second,
    Timeout: 30 * time.Second, // set Timeout OR max Attempts
    Attempts: 10,
}

Jitter

Jitter adds randomness to the delay between retry attempts. This helps prevent multiple clients from retrying simultaneously (thundering herd problem) while still maintaining a reasonable maximum wait time.

Key Features:

Randomized delay up to a configured maximum
Configurable maximum number of retries (default: 5)
Prevents synchronization between retrying clients if the resource experiencing a temporal outage

When to use:

In distributed systems with multiple clients
When you want to avoid thundering herd issues
If you need some randomness in your retry timing

Example:

retry := resilience.Jitter{
    Delay:   10 * time.Second,
    Attempts: 7,
}

Waiter

Waiter strategy retries based on the total elapsed time since the operation started. It will keep retrying as long as the timeout hasn't been exceeded.

Key Features:

Single fixed timeout for all attempts
Doesn't count individual attempts - only tracks total time
Simple and predictable behavior

When to use:

When you need a hard time limit on total retry time
If you prefer simplicity over more complex backoff strategies
For operations with strict time constraints

Example:

retry := resilience.Waiter{
    Timeout: 30 * time.Second,
}

FixedDelay

FixedDelay retries with a constant delay between attempts. Unlike exponential backoff, the wait time doesn't increase - it stays fixed for all attempts.

Key Features:

Fixed delay between retries (default: 500ms)
Configurable maximum number of retries (default: 5)
Optional timeout to limit total waiting time
Simple and predictable timing

When to use:

When you want consistent timing between retries
If a simple retry strategy is sufficient
For operations with known, fixed recovery times

Example:

retry := resilience.FixedDelay{
    Delay:   time.Second,
    Timeout: 30 * time.Second,
    Attempts: 10,
}

Rate Limiting

Rate limiting ensures that operations are performed at a controlled rate, preventing overloads on systems or services.

SlidingWindow

SlidingWindow implements a token bucket-like approach with a sliding time window to enforce rate limits.

Key Features:

Token Management: Tracks the number of requests (events) within a configurable time window.
Rate Enforcement: Ensures operations do not exceed a specified rate, calculated as N tokens per Per duration.
Sliding Window: Dynamically adjusts the window based on request timing to prevent thundering herd issues.
Context Awareness: Honors context cancellation and returns appropriate errors.
Efficient Timing: Calculates necessary wait periods when the rate limit is exceeded.

When to use:

To enforce strict rate limits, such as API call quotas.
When you need smooth, evenly distributed requests over time.

Configuration Options:

Rate: A struct specifying the number of tokens (N) and the duration (Per) for which this rate applies. The Pace() method calculates the minimum interval between allowed requests.
- Example: Rate{N: 10, Per: time.Minute} allows up to 10 requests per minute.

How It Works

Initialization: Create a SlidingWindow instance with your desired Rate.
Usage: Call RateLimit(context.Context) before performing the operation you wish to rate limit.
Flow:
- If the context is canceled, returns immediately with the context error.
- Checks if current requests are within the allowed rate for the window.
  - If under the limit, proceeds and records the event.
  - If over the limit, calculates the wait time until the window slides enough to allow more requests.

Example

package mypkg

import (
	"context"
	"fmt"
	"time"

	"go.llib.dev/frameless/pkg/resilience"
)

func main() {
	ms := MyStruct{
		RateLimitPolicy: &resilience.SlidingWindow{
			Rate: resilience.Rate{N: 5, Per: time.Minute}, // Allow 5 requests per minute
		},
	}

	_ = ms // start your app that uses MyStruct#MyFunc multiple times that requires rate limiting.
}

type MyStruct struct {
	RateLimitPolicy resilience.RateLimitPolicy
}

func (ms MyStruct) MyFunc(ctx context.Context) error {
	if err := ms.RateLimitPolicy.RateLimit(ctx); err != nil {
		return fmt.Errorf("rate limit exceeded: %w", err)
	}

	// Perform the rate-limited operation here.
	fmt.Println("Performing request...")

	return nil
}

Behavior

Context Cancellation: If the provided context is canceled during a rate limit wait, RateLimit returns immediately with the context error.
Zero Rate Configuration: If .Rate is not set (zero value), calls to RateLimit will not block execution, allowing unlimited requests.
Even Distribution: Requests are distributed as evenly as possible within the specified window, preventing spikes and ensuring smooth operation.

When To Use

API Rate Limits: Enforce API call quotas imposed by external services.
Resource Protection: Prevent overloading of internal or external resources.
Distributed Systems: Avoid synchronized retries in distributed environments, reducing the likelihood of thundering herd issues.
Predictable Workloads: Maintain a consistent request rate for systems that require predictable load patterns.

Documentation ¶

Index ¶

func Retries[U FailureCount | StartedAt](ctx context.Context, rp RetryPolicy[U]) iter.Seq[FailureCount]
type ExponentialBackoff
- func (rs ExponentialBackoff) ShouldTry(ctx context.Context, failureCount FailureCount) bool
type FailureCount
type FixedDelay
- func (rs FixedDelay) ShouldTry(ctx context.Context, failureCount FailureCount) bool
type Jitter
- func (rs Jitter) ShouldTry(ctx context.Context, count FailureCount) bool
type Rate
type RateLimitPolicy
type RetryPolicy
type SlidingWindow
- func (rl *SlidingWindow) RateLimit(ctx context.Context) error
type StartedAt
type Waiter
- func (rs Waiter) ShouldTry(ctx context.Context, startedAt StartedAt) bool
- func (rs Waiter) While(do func() (Continue bool))

Examples ¶

ExponentialBackoff
FixedDelay
Jitter
Retries
Retries (WithFailureCountBasedRetryPolicy)
Retries (WithFailureCountRangeArgument)
Retries (WithTimeDelayBasedRetryPolicy)
SlidingWindow
Waiter.ShouldTry

Constants ¶

This section is empty.

Variables ¶

This section is empty.

Functions ¶

func Retries ¶ added in v0.298.0

func Retries[U FailureCount | StartedAt](ctx context.Context, rp RetryPolicy[U]) iter.Seq[FailureCount]

Example ¶

package main

import (
	"context"

	"go.llib.dev/frameless/pkg/resilience"
)

func main() {
	var (
		ctx = context.Background()
		rp  = resilience.ExponentialBackoff{}
	)

	for range resilience.Retries(ctx, rp) {
		// on success, break out from retries
		break
	}
}

Example (WithFailureCountBasedRetryPolicy) ¶

package main

import (
	"context"

	"go.llib.dev/frameless/pkg/resilience"
)

func main() {
	var (
		ctx = context.Background()
		rp  = resilience.ExponentialBackoff{}
	)

	for range resilience.Retries(ctx, rp) {
		// on success, break out from retries
		break
	}
}

Example (WithFailureCountRangeArgument) ¶

package main

import (
	"context"

	"go.llib.dev/frameless/pkg/resilience"
)

func main() {
	var (
		ctx = context.Background()
		rp  = resilience.ExponentialBackoff{}
	)

	for failureCount := range resilience.Retries(ctx, rp) {
		_ = failureCount // starts from zero
		// on success, break out from retries
		break
	}
}

Example (WithTimeDelayBasedRetryPolicy) ¶

package main

import (
	"context"
	"time"

	"go.llib.dev/frameless/pkg/resilience"
)

func main() {
	var (
		ctx = context.Background()
		rp  = resilience.Waiter{Timeout: time.Minute}
	)

	for range resilience.Retries(ctx, rp) {
		// on success, break out from retries
		break
	}
}

Types ¶

type ExponentialBackoff ¶

type ExponentialBackoff struct {
	// Delay is the time duration being waited.
	// Initially, it serves as the starting wait duration,
	// and then it increases based on the exponential backoff formula calculation.
	//
	// Default: 1/2 Second
	Delay time.Duration
	// Timeout is the time within the RetryPolicy is attempting further retries.
	// If the total waited time is greater than the Timeout, ExponentialBackoff will stop further attempts.
	// When Timeout is given, but MaxRetries is not, ExponentialBackoff will continue to retry until the calculated deadline is reached.
	//
	// Default: ignored
	Timeout time.Duration
	// Attempts is the amount of retry which is allowed before giving up the application.
	//
	// Default: 5 if Timeout is not set.
	Attempts int
}

ExponentialBackoff is a RetryPolicy implementation.

ExponentialBackoff will answer if retry can be made. It waits as well the amount of time based on the failure count. The waiting time before returning is doubled for each failed attempts This ensures that the system gets progressively more time to recover from any issues.

Example ¶

package main

import (
	"context"

	"go.llib.dev/frameless/pkg/resilience"
)

func main() {
	ctx := context.Background()
	rs := resilience.ExponentialBackoff{}

	for i := 0; rs.ShouldTry(ctx, i); i++ {
		// do an action
		// return on success
	}
	// return failure
}

func (ExponentialBackoff) ShouldTry ¶

func (rs ExponentialBackoff) ShouldTry(ctx context.Context, failureCount FailureCount) bool

type FailureCount ¶

type FailureCount = int

type FixedDelay ¶

type FixedDelay struct {
	// Delay is the time duration waited between attempts.
	//
	// Default: 1/2 Second
	Delay time.Duration
	// Timeout is the time within the RetryPolicy is attempting further retries.
	// If the total waited time is greater than the Timeout, ExponentialBackoff will stop further attempts.
	// When Timeout is given, but MaxRetries is not, ExponentialBackoff will continue to retry until a calculated deadline is reached.
	//
	// Default: ignored
	Timeout time.Duration
	// Attempts is the amount of retry attempt which is allowed before giving up the application.
	//
	// Default: 5 if Timeout is not set.
	Attempts int
}

FixedDelay is a RetryPolicy implementation.

FixedDelay will make retries with fixed delays between them. It is a lineral waiting time based retry policy.

Example ¶

package main

import (
	"context"
	"time"

	"go.llib.dev/frameless/pkg/resilience"
)

func main() {
	ctx := context.Background()
	rs := resilience.FixedDelay{
		Delay:   10 * time.Second,
		Timeout: 5 * time.Minute,
	}

	for i := 0; rs.ShouldTry(ctx, i); i++ {
		// do an action
		// return/break on success
	}
	// return failure
}

func (FixedDelay) ShouldTry ¶

func (rs FixedDelay) ShouldTry(ctx context.Context, failureCount FailureCount) bool

type Jitter ¶

type Jitter struct {
	// Delay is the maximum time duration that the Jitter is willing to wait between attempts.
	// There is no guarantee that it will wait the full duration.
	//
	// Default: 5 Second
	Delay time.Duration
	// Attempts is the amount of retry that is allowed before giving up the application.
	//
	// Default: 5
	Attempts int
}

Jitter is a RetryPolicy implementation.

Jitter is a random variation added to the backoff time. This helps to distribute the retry attempts evenly over time, reducing the risk of overwhelming the system and avoiding synchronization between multiple clients that might be retrying simultaneously.

Example ¶

package main

import (
	"context"

	"go.llib.dev/frameless/pkg/resilience"
)

func main() {
	ctx := context.Background()
	rs := resilience.Jitter{}

	for i := 0; rs.ShouldTry(ctx, i); i++ {
		// do an action
		// return on success
	}
	// return failure
}

func (Jitter) ShouldTry ¶

func (rs Jitter) ShouldTry(ctx context.Context, count FailureCount) bool

type Rate ¶

type Rate struct {
	// N represents the number of tokens to add or leak per the specified duration.
	N int
	// Per defines the duration over which N tokens are added or leaked (e.g., 1 second or 1 minute).
	Per time.Duration
}

func (Rate) IsZero ¶

func (r Rate) IsZero() bool

func (Rate) Pace ¶

func (r Rate) Pace() time.Duration

func (Rate) String ¶

func (r Rate) String() string

type RateLimitPolicy ¶

type RateLimitPolicy interface {
	RateLimit(context.Context) error
}

type RetryPolicy ¶

type RetryPolicy[U FailureCount | StartedAt] interface {
	// ShouldTry will tell if retry should be attempted after a given number of failed attempts.
	ShouldTry(ctx context.Context, u U) bool
}

type SlidingWindow ¶

type SlidingWindow struct {
	Rate Rate
	// contains filtered or unexported fields
}

Example ¶

package main

import (
	"context"
	"time"

	"go.llib.dev/frameless/pkg/resilience"
)

func main() {
	var (
		ctx = context.Background()
		rl  = resilience.SlidingWindow{Rate: resilience.Rate{N: 100, Per: time.Minute}}
	)
	if err := rl.RateLimit(ctx); err != nil { // err could be like context cancellation
		_ = err // return err
	}
}

func (*SlidingWindow) RateLimit ¶

func (rl *SlidingWindow) RateLimit(ctx context.Context) error

type StartedAt ¶

type StartedAt = time.Time

type Waiter ¶

type Waiter struct {
	// Timeout refers to the maximum duration we can wait
	// before a retry attempt is deemed unreasonable.
	//
	// Default: 30 seconds
	Timeout time.Duration
	// WaitDuration is the time how lone Waiter.Wait should wait between attempting a new retry during Waiter.While.
	//
	// Default: 1ms
	WaitDuration time.Duration
}

Waiter is a RetryPolicy implementation.

Waiter will check if a retry attempt should be made compared to when an operation was initially started.

func (Waiter) ShouldTry ¶

func (rs Waiter) ShouldTry(ctx context.Context, startedAt StartedAt) bool

Example ¶

package main

import (
	"context"
	"time"

	"go.llib.dev/frameless/pkg/resilience"
)

func main() {
	var (
		ctx = context.Background()
		rs  = resilience.Waiter{Timeout: time.Minute}
		now = time.Now()
	)

	for rs.ShouldTry(ctx, now) {
		// do an action
		// return on success
	}
	// return failure
}

func (Waiter) While ¶ added in v0.296.0

func (rs Waiter) While(do func() (Continue bool))

While implements the retry strategy looping part. Depending on the outcome of the condition, the RetryStrategy can decide whether further iterations can be done or not

Source Files ¶

View all Source files

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL

README ¶

Package resilience

retry

ExponentialBackoff

Jitter

Waiter

FixedDelay

Rate Limiting

SlidingWindow

How It Works

Example

Behavior

When To Use

Documentation ¶

Index ¶

Examples ¶

Constants ¶

Variables ¶

Functions ¶

func Retries ¶ added in v0.298.0

Types ¶

type ExponentialBackoff ¶

func (ExponentialBackoff) ShouldTry ¶

type FailureCount ¶

type FixedDelay ¶

func (FixedDelay) ShouldTry ¶

type Jitter ¶

func (Jitter) ShouldTry ¶

type Rate ¶

func (Rate) IsZero ¶

func (Rate) Pace ¶

func (Rate) String ¶

type RateLimitPolicy ¶

type RetryPolicy ¶

type SlidingWindow ¶

func (*SlidingWindow) RateLimit ¶

type StartedAt ¶

type Waiter ¶

func (Waiter) ShouldTry ¶

func (Waiter) While ¶ added in v0.296.0

Source Files ¶

Package `resilience`