Fluxus - Go Pipeline Library

Fluxus is a modern, type-safe pipeline orchestration library for Go that makes complex data processing elegant and efficient.

A lightweight yet powerful framework for building flexible data processing pipelines with built-in support for parallel execution, error handling, and advanced flow control. Designed for performance-critical applications where reliability and type safety matter.

Features

⚡ High-performance parallel processing with fine-grained concurrency control
🔄 Fan-out/fan-in patterns for easy parallelization
🧬 Type-safe pipeline construction using Go generics
🛡️ Robust error handling with custom error strategies
⏱️ Context-aware operations with proper cancellation support
🔁 Retry mechanisms with configurable backoff strategies
📦 Batch processing capabilities for efficient resource utilization
📊 Metrics collection with customizable collectors
🔍 OpenTelemetry tracing for observability
🧯 Circuit breaker pattern for fault tolerance
🚦 Rate limiting to control throughput
🧠 Memory pooling for reduced allocations
🧪 Thoroughly tested with comprehensive examples
🔗 Chain stages with different input/output types

Perfect for ETL workloads, data processing services, API orchestration, and any application that needs to chain operations with reliable error handling and parallel execution.

Installation

go get github.com/synoptiq/go-fluxus

Quick Start

Check out the examples directory for practical use cases.

Core Components

Stage

A Stage is the basic unit of processing in a pipeline. It takes an input, performs some operation, and produces an output.

// Create a stage using a function
stage := fluxus.StageFunc[Input, Output](func(ctx context.Context, input Input) (Output, error) {
    // Process input and return output
    return output, nil
})

Chain

Chain combines two stages where the output of the first becomes input to the second. This allows you to create pipelines with stages of different input/output types.

// Chain two stages with compatible types
stage1 := fluxus.StageFunc[string, int](/* ... */)
stage2 := fluxus.StageFunc[int, bool](/* ... */)

// Chain them together
chainedStage := fluxus.Chain(stage1, stage2)

// The chained stage takes a string input and returns a bool output
result, err := chainedStage.Process(ctx, "input")

Pipeline

A Pipeline represents a sequence of processing stages encapsulated in a single stage.

// Create a pipeline with a single stage (which may be a chained stage)
pipeline := fluxus.NewPipeline(stage)

// Add custom error handling
pipeline.WithErrorHandler(func(err error) error {
    log.Printf("Pipeline error: %v", err)
    return err
})

// Process input
result, err := pipeline.Process(ctx, input)

FanOut

FanOut processes an input using multiple stages in parallel and collects the results.

// Create a fan-out stage with multiple processing functions
fanOut := fluxus.NewFanOut(stage1, stage2, stage3)

// Limit concurrency
fanOut.WithConcurrency(5)

// Process input
results, err := fanOut.Process(ctx, input)

FanIn

FanIn aggregates the results from multiple parallel stages.

// Create a fan-in stage with an aggregator function
fanIn := fluxus.NewFanIn(func(inputs []Result) (Output, error) {
    // Aggregate results
    return combinedResult, nil
})

// Process multiple inputs
result, err := fanIn.Process(ctx, inputs)

Parallel

Parallel combines fan-out and fan-in to create a stage that processes an input in parallel and aggregates the results.

// Create a parallel stage
parallel := fluxus.Parallel[Input, Result, Output](
    []fluxus.Stage[Input, Result]{stage1, stage2, stage3},
    func(results []Result) (Output, error) {
        // Aggregate results
        return combinedResult, nil
    },
)

// Process input
result, err := parallel.Process(ctx, input)

Map (New!)

Map applies a single Stage[I, O] concurrently to each element of an input slice []I, producing an output slice []O. This is useful for parallelizing the same operation across multiple data items.

// Stage that processes a single item (e.g., int to string)
processItem := fluxus.StageFunc[int, string](func(ctx context.Context, item int) (string, error) {
    // ... process item ...
    return processedString, nil
})

// Create a Map stage using the item processor
mapStage := fluxus.NewMap(processItem).
    WithConcurrency(runtime.NumCPU()). // Set concurrency limit
    WithCollectErrors(true)            // Collect all errors instead of failing fast

// Process a slice of inputs
inputSlice := []int{1, 2, 3, 4, 5}
results, err := mapStage.Process(ctx, inputSlice)

// If WithCollectErrors(true), err might be a *fluxus.MultiError
// and results will contain successes and zero values for errors.
// If WithCollectErrors(false) (default), err is the first error encountered
// and results will be nil.

Advanced Features

Circuit Breaker

Circuit breaker prevents cascading failures by automatically stopping calls to a failing service.

// Create a circuit breaker that will open after 5 failures
// and attempt to reset after 10 seconds
circuitBreaker := fluxus.NewCircuitBreaker(
    stage,              // The stage to protect
    5,                  // Failure threshold
    10*time.Second,     // Reset timeout
    fluxus.WithSuccessThreshold[Input, Output](3),        // Require 3 successes to close
    fluxus.WithHalfOpenMaxRequests[Input, Output](2),     // Allow 2 test requests when half-open
)

// Process with circuit breaker protection
result, err := circuitBreaker.Process(ctx, input)
// If circuit is open, err will be fluxus.ErrCircuitOpen

Retry with Backoff

// Create a stage that retries on failure
retry := fluxus.NewRetry(stage, 3)  // 3 attempts

// Add custom backoff strategy (exponential backoff)
retry.WithBackoff(func(attempt int) int {
    return 100 * (1 << attempt)  // 100, 200, 400 ms
})

// Only retry specific errors
retry.WithShouldRetry(func(err error) bool {
    return errors.Is(err, io.ErrTemporary)
})

Rate Limiting

// Create a rate limiter with 10 requests per second and burst of 5
limiter := fluxus.NewRateLimiter(
    stage,
    rate.Limit(10),  // 10 requests/second
    5,               // Burst of 5
    fluxus.WithLimiterTimeout[Input, Output](100*time.Millisecond),  // Timeout after 100ms
)

// Dynamically adjust rate limits
limiter.SetLimit(rate.Limit(5))  // Change to 5 requests/second
limiter.SetBurst(10)             // Change burst to 10

Timeout

// Create a stage with a timeout
timeout := fluxus.NewTimeout(stage, 5*time.Second)

Buffer for Batch Processing

// Create a buffer that processes items in batches of 10
buffer := fluxus.NewBuffer[Item, Result](10, func(ctx context.Context, batch []Item) ([]Result, error) {
    // Process batch
    return results, nil
})

Memory Pooling

// Create a memory pool for reducing allocations
pool := fluxus.NewObjectPool(
    func() MyObject { return MyObject{} },  // Factory function
    fluxus.WithPoolName[MyObject]("my-objects"),
    fluxus.WithMaxCapacity[MyObject](100),
)

// Get an object from the pool
obj := pool.Get()

// Return to the pool when done
pool.Put(obj)

// Use specialized slice pool
slicePool := fluxus.NewSlicePool[int](
    10,  // Initial capacity
    fluxus.WithPoolName[[]int]("int-slices"),
)

// Get a slice with specific capacity
slice := slicePool.GetWithCapacity(20)

Optimized Pooled Buffer

// Create a high-performance buffer with object pooling
pooledBuffer := fluxus.NewPooledBuffer[Item, Result](
    10,  // Batch size
    func(ctx context.Context, batch []Item) ([]Result, error) {
        // Process batch
        return results, nil
    },
    fluxus.WithBufferName[Item, Result]("pooled-buffer"),
)

Metrics Collection

// Implement the MetricsCollector interface for your metrics system
type PrometheusMetrics struct{}

func (p *PrometheusMetrics) StageStarted(ctx context.Context, stageName string) {
    // Increment counter in Prometheus
}

// ... implement other methods ...

// Use the metrics collector
collector := &PrometheusMetrics{}
fluxus.DefaultMetricsCollector = collector

// Create a metricated stage
metricated := fluxus.NewMetricatedStage(
    stage,
    fluxus.WithStageName[Input, Output]("my-stage"),
    fluxus.WithMetricsCollector[Input, Output](collector),
)

OpenTelemetry Tracing

// Create a traced stage
traced := fluxus.NewTracedStage(
    stage,
    fluxus.WithTracerName[Input, Output]("my-stage"),
    fluxus.WithTracerAttributes[Input, Output](
        attribute.String("service", "my-service"),
    ),
)

// Specialized tracing for fan-out
tracedFanOut := fluxus.NewTracedFanOut(
    fanOut,
    "parallel-processing",
    attribute.String("operation", "data-transformation"),
)

Best Practices

Chaining Stages with Different Types

Use the Chain function to connect stages with different input/output types:

// Create stages with different input/output types
parseStage := fluxus.StageFunc[string, []int](/* parse string to integers */)
sumStage := fluxus.StageFunc[[]int, int](/* sum integers */)
formatStage := fluxus.StageFunc[int, string](/* format result */)

// Chain them together
processStage := fluxus.Chain(parseStage, 
                    fluxus.Chain(sumStage, formatStage))

// Create a pipeline with the chained stage
pipeline := fluxus.NewPipeline(processStage)

Use Context for Cancellation

ctx, cancel := context.WithCancel(context.Background())
defer cancel()
result, err := pipeline.Process(ctx, input)

Handle Errors Appropriately

pipeline.WithErrorHandler(func(err error) error {
    if errors.Is(err, SomeSpecificError) {
        // Handle specific error
        return nil  // Ignore error
    }
    return err  // Propagate other errors
})

Control Resource Usage

fanOut.WithConcurrency(runtime.NumCPU())  // Limit concurrency to CPU count

Use Generics for Type Safety

stage := fluxus.StageFunc[CustomInput, CustomOutput](func(ctx context.Context, input CustomInput) (CustomOutput, error) {
    // Type-safe processing
})

Enhance Observability

Combine metrics and tracing for comprehensive monitoring:

// Start with a basic stage
baseStage := fluxus.StageFunc[Input, Output](/* ... */)

// Add metrics
metricated := fluxus.NewMetricatedStage(baseStage, /* ... */)

// Add tracing on top of metrics
traced := fluxus.NewTracedStage(metricated, /* ... */)

// Use it in a pipeline
pipeline := fluxus.NewPipeline(traced)

Pre-warm Pools for Performance

// Create the pool
pool := fluxus.NewObjectPool(/* ... */)

// Pre-warm with 100 objects before high-load periods
fluxus.PreWarmPool(pool, 100)

Error Types

Fluxus provides specialized error types for different scenarios:

StageError: Identifies which specific stage in a pipeline failed
FanOutError: Contains errors from multiple failed stages in fan-out
RetryExhaustedError: Indicates all retry attempts were exhausted
TimeoutError: Indicates a stage timed out
BufferError: Provides details about batch processing failures
ErrCircuitOpen: Indicates a circuit breaker is open

Performance Benchmarks

Fluxus is designed with performance in mind. Here's how to run the benchmarks and what they measure:

# Run all benchmarks
go test -bench=Benchmark -benchmem ./...

# Run a specific benchmark
go test -bench=BenchmarkChainedPipeline -benchmem ./...

# Run benchmarks with more iterations for better statistical significance
go test -bench=Benchmark -benchmem -count=5 ./...

Benchmark Results

See BENCHMARK.md for detailed results.

Performance Optimization Tips

Limit Concurrency: While unlimited concurrency can be faster for CPU-bound tasks, setting an appropriate concurrency limit is crucial for I/O-bound tasks to avoid resource exhaustion. Use WithConcurrency() to tune this.
Buffer Size: For batch processing, choose a buffer size that balances memory usage and processing efficiency. Too small and you lose efficiency, too large and you waste memory.
Chain Depth: Deeper chains incur more overhead. Consider flattening chains if extreme performance is needed.
Error Handling: Custom error handlers add minimal overhead but provide significant benefits in production environments.
Context Cancellation: Use contexts to cancel operations early when results are no longer needed.
Object Pooling: For frequently created objects, use the provided pooling mechanisms to reduce GC pressure.
Rate Limiting: Apply rate limiting at appropriate points to prevent overloading downstream services.
Circuit Breakers: Place circuit breakers around unreliable services to fail fast when necessary.
Metrics & Tracing: In production environments, the overhead of metrics and tracing is usually negligible compared to their benefits, but use the noop implementations in performance-critical paths if needed.
Memory Management: For large data processing, consider using the PooledBuffer to reduce allocations.

License

MIT License. See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.github/workflows		.github/workflows
example		example
.gitignore		.gitignore
.golangci.yml		.golangci.yml
.goreleaser.yml		.goreleaser.yml
BENCHMARK.md		BENCHMARK.md
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
circuit.go		circuit.go
circuit_test.go		circuit_test.go
errors.go		errors.go
fluxus.go		fluxus.go
fluxus_test.go		fluxus_test.go
go.mod		go.mod
go.sum		go.sum
logo.svg		logo.svg
metrics.go		metrics.go
metrics_test.go		metrics_test.go
pool.go		pool.go
pool_test.go		pool_test.go
rate_limiter.go		rate_limiter.go
rate_limiter_test.go		rate_limiter_test.go
tracing.go		tracing.go
tracing_test.go		tracing_test.go

License

Synoptiq/go-fluxus

Folders and files

Latest commit

History

Repository files navigation

Fluxus - Go Pipeline Library

Features

Installation

Quick Start

Core Components

Stage

Chain

Pipeline

FanOut

FanIn

Parallel

Map (New!)

Advanced Features

Circuit Breaker

Retry with Backoff

Rate Limiting

Timeout

Buffer for Batch Processing

Memory Pooling

Optimized Pooled Buffer

Metrics Collection

OpenTelemetry Tracing

Best Practices

Chaining Stages with Different Types

Use Context for Cancellation

Handle Errors Appropriately

Control Resource Usage

Use Generics for Type Safety

Enhance Observability

Pre-warm Pools for Performance

Error Types

Performance Benchmarks

Benchmark Results

Performance Optimization Tips

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Languages

Packages