Simpipe.Net

A high-performance, composable pipeline pattern library for .NET using System.Threading.Channels. Build robust data processing pipelines with ease.

Overview

Simpipe.Net provides a fluent API for constructing data processing pipelines using System.Threading.Channels. It simplifies complex data flow scenarios while maintaining high performance and reliability.

Key Benefits

Composable: Chain pipes together to create complex processing workflows
Concurrent: Built with System.Threading.Channels for efficient parallel processing
Back-pressure Handling: Automatic flow control prevents memory overload
Type-Safe: Full generic type support with compile-time safety
Testable: Clear separation of concerns makes testing straightforward

When to Use Simpipe.Net

Use Simpipe.Net when you need:

Stream processing with multiple transformation stages
ETL (Extract, Transform, Load) pipelines
Real-time data processing with batching capabilities
Complex routing logic between processing stages
Work-in-progress limiting

Features

✅ Fluent API for pipeline construction
✅ Multiple specialized pipe types (Action, Batch)
✅ Async/await support throughout
✅ Automatic completion propagation
✅ Back-pressure handling via bounded capacity
✅ Conditional routing between pipes
✅ Work-in-progress limiting
✅ Performance monitoring (input/output/working counts)
✅ Graceful shutdown and cancellation support

Installation

dotnet add package Simpipe.Net

Quick Start

Here's a simple example that demonstrates basic pipeline construction:

using Simpipe.Pipes;

// Add sentiment analysis pipe
var sentimentPipe = Pipe<Tweet>
    .Action(tweet => tweet.Sentiment = AnalyzeSentiment(tweet.Text))
    .Id("sentiment-analyzer");

// Add batch pipe for Elasticsearch indexing
var indexPipe = Pipe<Tweet>
    .Batch(100, async tweets => {
        await ElasticsearchClient.BulkIndex(tweets);
        Console.WriteLine($"Indexed {tweets.Length} tweets");
    })
    .Id("elasticsearch-indexer");

// Create a pipeline for tweet processing
var pipeline = new Pipeline<Tweet> 
{
    sentimentPipe, 
    indexPipe
};

// Process tweets
await pipeline.Send(new Tweet { Text = "Love this product! #awesome" });
await pipeline.Send(new Tweet { Text = "Great customer service @support" });

// Complete the pipeline
await pipeline.Complete();

Core Concepts

Pipes

Pipes are the fundamental building blocks of a pipeline. Each pipe:

Receives items of type T
Processes them according to its implementation
Forwards results to the next pipe or routing target
Tracks input, output, and working item counts via the Block property

Pipeline

A Pipeline is a container that:

Manages a sequence of connected pipes
Handles automatic linking between pipes
Provides completion tracking for the entire flow
Allows sending items to specific pipes by ID (useful when resuming processing)

Blocks

Blocks are the low-level processing units that power pipes:

ActionBlock: Executes actions with configurable parallelism
BatchBlock: Groups items into batches
BatchActionBlock: Processes batches with parallelism
TimerBatchBlock: Batches with time-based flushing
FilterBlock: Filters items at the block level
ParallelBlock: Manages fork-join parallel execution
NullBlock: Discards items (sink)

Routing and Filtering

Pipes support sophisticated flow control:

Filtering: Use .Filter() to process only matching items
Routing: Use .LinkTo() with predicates for conditional routing
Pass-through: Non-matching filtered items automatically pass to next pipe
Fork-Join: Split processing across parallel blocks then rejoin

Completion

Simpipe.Net provides graceful shutdown:

Call Complete() to signal no more items
Completion propagates through the pipeline
Use await Completion to wait for all processing

Pipe Types

ActionPipe

Executes an action for each item with configurable parallelism.

var pipe = Pipe<Tweet>
    .Action(async tweet => {
        await EnrichTweetMetadata(tweet);
        tweet.ProcessedAt = DateTime.UtcNow;
        Console.WriteLine($"Processed tweet from @{tweet.Author}");
    })
    .DegreeOfParallelism(4)
    .BoundedCapacity(100)
    .ToPipe();

BatchPipe

Groups items into fixed-size batches with optional time-based triggers.

var pipe = Pipe<Tweet>
    .Batch(500, async tweets => {
        await ElasticsearchClient.BulkIndex(tweets);
        Console.WriteLine($"Indexed {tweets.Length} tweets to Elasticsearch");
    })
    .BatchTriggerPeriod(TimeSpan.FromSeconds(5)) // Flush incomplete batches after 5 seconds
    .ToPipe();

PipelineLimiter

Controls total work-in-progress across a pipeline, providing back-pressure to producers.

// Create pipeline with consistent capacity for each block (block.capacity == maxWork)
var pipeline = new Pipeline<Tweet>();

// PipelineLimiter controls total work-in-progress
var limiter = new PipelineLimiter<Tweet>(
    maxWork: 50, // Only 50 tweets in flight
    dispatch: pipeline.Send);

var validator = Pipe<Tweet>
    .Action(async tweet => await ValidateTweet(tweet))
    .BoundedCapacity(50)  // Same as maxWork
    .Id("validator");

 var enricher = Pipe<Tweet>
    .Action(async tweet => await EnrichTweet(tweet))
    .BoundedCapacity(50)  // Same as maxWork
    .Id("enricher");

 var done = Pipe<Tweet>
     .Action(limiter.TrackDone); // signal limiter item completed
 
 pipeline.Add(validator);
 pipeline.Add(enricher);
 pipeline.Add(done);

// Poll from SQS queue with automatic back-pressure
while (!cancellationToken.IsCancellationRequested)
{
    var tweets = await SQSClient.ReceiveMessages(queueUrl);
    foreach (var tweet in tweets)
        await limiter.Send(tweet); // Blocks if 50 tweets in flight
}

await limiter.Complete();
await pipeline.Complete();

ForkPipe (Fork-Join Parallelism)

Execute multiple operations in parallel on the same item and wait for all to complete.

// Define parallel enrichment operations
var sentimentBlock = Parallel<Tweet>
    .Action(async tweet => tweet.Sentiment = await AnalyzeSentiment(tweet.Text))
    .Id("sentiment")
    .DegreeOfParallelism(3);

var languageBlock = Parallel<Tweet>
    .Action(async tweet => tweet.Language = await DetectLanguage(tweet.Text))
    .Id("language")
    .DegreeOfParallelism(2);

var entitiesBlock = Parallel<Tweet>
    .Batch(100, async tweets => Apply(tweets, ExtractEntities(tweet.Text))
    .Id("entities");

var saveToDb = Pipe<Tweet>
    .Batch(500, async tweets => await db.Store(tweets))
    .Id("store");

// Create fork-join pipe
var forkPipe = Pipe<Tweet>
    .Fork(sentimentBlock, languageBlock, entitiesBlock)
    .Join(tweet => Console.WriteLine($"Enriched tweet {tweet.Id}"))
    .Id("enrichment-fork");

var pipeline = new Pipeline<Tweet>
{
    forkPipe,
    saveToDb
};

// All three enrichments run in parallel and saveToDb pipe receives 
// the tweet only after ALL parallel blocks complete
await pipeline.Send(tweet);

Advanced Usage

Custom Routing

Route items to different pipes based on conditions:

var englishPipe = CreateEnglishProcessingPipe();
var spanishPipe = CreateSpanishProcessingPipe();
var translationPipe = CreateTranslationPipe();

// Route tweets based on language
sourcePipe.LinkTo(tweet => {
    return tweet.Language switch {
        "en" => englishPipe,
        "es" => spanishPipe,
        _ => translationPipe
    };
});

Performance Monitoring

Track pipeline performance using the Block metrics:

var pipe = Pipe<Tweet>
    .Action(ProcessTweet)
    .Id("processor")
    .ToPipe();

// Monitor performance
Console.WriteLine($"Input: {pipe.Block.InputCount}, Working: {pipe.Block.WorkingCount}, Output: {pipe.Block.OutputCount}");

Cancellation

Support graceful cancellation:

var cts = new CancellationTokenSource();

var pipe = Pipe<Tweet>
    .Action(async tweet => {
        await ProcessTweet(tweet);
    })
    .Id("tweet-processor")
    .CancellationToken(cts.Token)
    .ToPipe();

// Cancel processing
cts.Cancel();

Configuration Options

Common Pipe Builder Methods

All pipe builders support these configuration methods:

.Id(string): Unique identifier for the pipe
.Filter(Func<T, bool>): Predicate to filter items (non-matching items pass through)
.Route(Func<T, Pipe<T>>): Function to determine routing target
.ToPipe(): Build and return the configured pipe

ActionPipe Builder Methods

Pipe<T>.Action(action)
    .Id(string)                      // Pipe identifier
    .Filter(Func<T, bool>)           // Item filter
    .Route(Func<T, Pipe<T>>)         // Routing function
    .DegreeOfParallelism(int)        // Max concurrent executions (default: 1)
    .BoundedCapacity(int?)           // Max items buffered (default: parallelism * 2)
    .CancellationToken(token)        // Cancellation support
    .ToPipe()

BatchPipe Builder Methods

Pipe<T>.Batch(batchSize, action)
    .Id(string)                      // Pipe identifier
    .Filter(Func<T, bool>)           // Item filter
    .Route(Func<T, Pipe<T>>)         // Routing function
    .BatchTriggerPeriod(TimeSpan)    // Timer for incomplete batches
    .DegreeOfParallelism(int)        // Concurrent batch processing (default: 1)
    .BoundedCapacity(int?)           // Max items buffered (default: batchSize)
    .CancellationToken(token)        // Cancellation support
    .ToPipe()

ForkPipe Builder Methods

Pipe<T>.Fork(parallelBlocks...)
    .Id(string)                      // Pipe identifier
    .Filter(Func<T, bool>)           // Item filter
    .Route(Func<T, Pipe<T>>)         // Routing function
    .Join(Action<T>)                 // Action when all blocks complete
    .ToPipe()

Parallel Block Builder Methods

// For use within Fork pipes
Parallel<T>.Action(action)
    .Id(string)                      // Block identifier
    .Filter(Func<T, bool>)           // Item filter
    .DegreeOfParallelism(int)        // Max concurrent executions
    .BoundedCapacity(int?)           // Max items buffered
    .CancellationToken(token)        // Cancellation support

Parallel<T>.Batch(batchSize, action)
    .Id(string)                      // Block identifier
    .Filter(Func<T, bool>)           // Item filter
    .BatchTriggerPeriod(TimeSpan)    // Timer for incomplete batches
    .DegreeOfParallelism(int)        // Concurrent batch processing
    .BoundedCapacity(int?)           // Max items buffered
    .CancellationToken(token)        // Cancellation support

Best Practices

Always Complete Pipelines: Call Complete() and await Completion to ensure graceful shutdown
```
await pipeline.Complete();
await pipeline.Completion;
```
Use Bounded Capacity: Prevent memory issues by setting appropriate bounds
```
.BoundedCapacity(1000) // Limit to 1000 items
```

Consider Batch Sizes: For I/O operations, batch to reduce overhead

.Batch(100, items => BulkInsert(items)) // Process 100 at a time

Monitor Performance: Track pipeline metrics for optimization

var block = pipe.Block;
if (block.InputCount > block.OutputCount * 2)
    Console.WriteLine("Potential bottleneck detected");

Performance Considerations

Memory Usage

Use bounded capacity to limit memory consumption
Consider item size when setting batch sizes
Monitor WorkingCount to identify memory pressure

Parallelism Tuning

Set DegreeOfParallelism based on workload type
CPU-bound: Use Environment.ProcessorCount
I/O-bound: Can use higher values

Batch Size Optimization

Larger batches reduce overhead but increase latency
Smaller batches improve responsiveness but increase overhead
Consider your workload characteristics when choosing batch sizes

Examples

Data Processing Pipeline

var pipeline = new Pipeline<Tweet>();

// Content moderation stage
pipeline.Add(Pipe<Tweet>
    .Action(tweet => {
        if (IsSpam(tweet) || HasProfanity(tweet))
            tweet.Status = TweetStatus.Blocked;
    })
    .Id("content-moderator")
    .DegreeOfParallelism(4)
    .ToPipe());

// Enrichment stage (only for clean tweets)
pipeline.Add(Pipe<Tweet>
    .Action(async tweet => {
        tweet.Sentiment = await AnalyzeSentiment(tweet.Text);
        tweet.Entities = ExtractEntities(tweet.Text);
    })
    .Id("enricher")
    .Filter(tweet => tweet.Status != TweetStatus.Blocked)
    .ToPipe());

// Batch for analytics storage
pipeline.Add(Pipe<Tweet>
    .Batch(1000, async tweets => {
        await BigQueryClient.InsertRows("tweets_analytics", tweets);
    })
    .Id("analytics-writer")
    .BatchTriggerPeriod(TimeSpan.FromSeconds(10))
    .ToPipe());

Real-time Stream Processing

var pipeline = new Pipeline<Tweet>();

// Filter viral tweets
var viralFilter = Pipe<Tweet>
    .Action(_ => { })  // Pass-through
    .Id("viral-filter")
    .Filter(tweet => tweet.RetweetCount > 1000 || tweet.LikeCount > 5000)
    .ToPipe();

// Route by sentiment
var positivePipe = CreateMarketingPipe();
var negativePipe = CreateSupportPipe();

viralFilter.LinkTo(tweet => 
    tweet.Sentiment == Sentiment.Positive ? positivePipe : negativePipe);

pipeline.Add(viralFilter);

Building and Testing

Build

dotnet build

Run Tests

dotnet test

Test Coverage

dotnet test --collect:"XPlat Code Coverage"

Contributing

We welcome contributions! Please follow these guidelines:

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

Code Style

Follow C# coding conventions
Use meaningful names
Write unit tests for new features
Ensure all tests pass before submitting PR

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 160 Commits
.ai		.ai
.claude		.claude
.gemini		.gemini
.github/workflows		.github/workflows
scripts		scripts
src		src
.gitignore		.gitignore
Before.png		Before.png
CLAUDE.md		CLAUDE.md
Directory.Build.props		Directory.Build.props
GEMINI.md		GEMINI.md
LICENSE		LICENSE
README.md		README.md
Simpipe.Net.sln		Simpipe.Net.sln
examples.md		examples.md
icon.png		icon.png
learnings.md		learnings.md
logo.png		logo.png

License

yevhen/simpipe.net

Folders and files

Latest commit

History

Repository files navigation

Simpipe.Net

Overview

Key Benefits

When to Use Simpipe.Net

Features

Installation

Quick Start

Core Concepts

Pipes

Pipeline

Blocks

Routing and Filtering

Completion

Pipe Types

ActionPipe

BatchPipe

PipelineLimiter

ForkPipe (Fork-Join Parallelism)

Advanced Usage

Custom Routing

Performance Monitoring

Cancellation

Configuration Options

Common Pipe Builder Methods

ActionPipe Builder Methods

BatchPipe Builder Methods

ForkPipe Builder Methods

Parallel Block Builder Methods

Best Practices

Performance Considerations

Memory Usage

Parallelism Tuning

Batch Size Optimization

Examples

Data Processing Pipeline

Real-time Stream Processing

Building and Testing

Build

Run Tests

Test Coverage

Contributing

Code Style

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Uh oh!

Languages

Packages