GitHub - gdavidbutler/pthreadChannel: Yet another implementation of a "channel" construct for pthreads

pthreadChannel

Yet another implementation of a "Channel" construct for POSIX threads (pthreads).

Why Channels?

Parallel programming with shared memory is notoriously difficult. The standard approach, protecting shared data with mutexes and coordinating threads with condition variables, requires reasoning about all possible interleavings of concurrent execution.

The problems are well-known:

Deadlocks from inconsistent lock ordering across code paths
Race conditions from forgetting to lock, locking the wrong thing, or locking at the wrong granularity
Missed wakeups from condition variable signal/wait races
Priority inversion, lock convoys, and other subtle performance pathologies

Worse, these bugs are non-deterministic. Code that passes extensive testing can fail in production under different timing. A bug may manifest once per million runs, or only on certain hardware.

Channels offer a different model: instead of protecting shared data with locks, transfer ownership of data through channels. A Put hands off data; a Get receives it. The synchronization is encapsulated within the channel operations themselves. User code contains no mutexes, no condition variables and no barriers.

This library provides:

chanOp - blocking Put or Get on a single channel
chanOne - wait for the first available operation across multiple channels (like select)
chanAll - perform all operations atomically across multiple channels or none

These primitives compose safely. Two correct channel-based components, when connected, remain correct. This is not true of lock-based code, where combining two independently correct modules can introduce deadlocks neither had alone.

The cost is discipline: data must flow through channels rather than be shared. But this discipline is enforced by the API, not by programmer vigilance. The result is parallel code that humans can reason about, review, and maintain.

CSP Heritage

This library draws from Communicating Sequential Processes (CSP), where concurrent processes communicate through channels rather than sharing memory. In classic CSP, implemented in machine or assembler language using jump instructions, a rendezvous (Get and Put block until the other arrives) works well. When buffering is needed (queue, stack, etc.), it is implemented as another CSP.

But modern CSPs (coroutines, functions with local variables and recursion) have higher context switch costs. Modern processors (call/return instructions, etc.) make this acceptable. However, a pthread context switch requires operating system support and the cost is prohibitively high for simple operations like buffering.

This implementation therefore uses callbacks instead of processes where a trivial thread would otherwise be required:

Store callbacks provide buffering without a buffer thread
Blob Trn/Chn callbacks bridge channels to I/O without blocking reader/writer threads

The channel semantics remain, but the implementation avoids threads that would do nothing but shuffle data.

Channel

This library provides a Channel style programming environment in C. A Channel is an anonymous, pthread coordinating, Store of pointer (void *) sized items.

Channels facilitate pthread based SMP. (For message passing see Blob.)
Channels, by default, can store a single item. (For more, see Store.)
Any number of pthreads can Put/Get on a Channel.
- Awareness of Channel demand is supported. (See an example of lazy evaluation in squint.)
A pthread can Put/Get on any number of Channels.
- Including either one, or all (atomic broadcast), of an array of operations.

The canonical Channel use case is a transfer of a pointer to heap.

Putting pthread:

m = malloc(...);
initialize(m);
chanOp(chan, &m, chanOpPut);

Getting pthread:

chanOp(chan, &m, chanOpGet);
use(m);
free(m);

Channels can be Put/Get on Channels! This enables RPC-style request/response patterns:

     Requester                                      Responder
         │                                              │
         │  1. chanOpen(responseChan)                   │
         │     (increment ref count before sending)     │
         │                                              │
         │  2. Put responseChan    ┌─────────────┐      │
         ├────────────────────────►│ serviceChan │──────┤ 3. Get responseChan
         │                         └─────────────┘      │
         │                                              │
         │                         ┌─────────────┐      │
         ├◄────────────────────────│ responseChan│◄─────┤ 4. Put response
         │  5. Get response        └─────────────┘      │
         │                                              │
         │                                              │ 6. chanClose(responseChan)
         ▼                                              ▼    (delegated ownership)

NOTE: chanOpen the response chan_t before passing it, otherwise the responder's chanClose could deallocate the channel before the requester's Get completes.

requesting pthread:

chanOpen(responseChan);
chanOp(serviceChan, &responseChan, chanOpPut);
chanOp(responseChan, &response, chanOpGet);

responding pthread:

chanOp(serviceChan, &responseChan, chanOpGet);
chanOp(responseChan, &response, chanOpPut);
chanClose(responseChan);

Pthreads are serviced fairly, avoiding thundering herd:
- Waiters queue in order; when an item is available, only the next waiter is woken, not all of them.
- A new arrival goes to the end of the line if others are already waiting.
However, when both Puts and Gets are waiting, an arriving thread completes opportunistically:
- If Gets are waiting and Puts are also waiting, a new Get takes an item immediately (a waiting Put is about to fill the Store anyway).
- If Puts are waiting and Gets are also waiting, a new Put deposits immediately (a waiting Get is about to drain the Store anyway).
This avoids an unnecessary context switch. In effect, the channel switches from interrupt-style (wait for signal) to polling-style (proceed immediately) under load.

Channel Lifecycle

                         chanCreate
                             │
                             ▼
                    ┌─────────────────┐
                    │     Channel     │  openCnt = 0
                    └────────┬────────┘
                             │
         ┌───────────────────┴───────────────────┐
         │                                       │
         ▼                                       ▼
    Give away                               Share (retain reference)
    (pass as-is)                            (chanOpen before pass)
         │                                       │
         │                                       ▼
         │                               ┌───────────────┐
         │                               │  openCnt > 0  │
         │                               └───────┬───────┘
         │                                       │
         │                    ┌──────────────────┤
         │                    │                  │
         │               chanClose          chanClose
         │              (openCnt--)         (openCnt--)
         │                    │                  │
         │                    └──────┬───────────┘
         │                           │
         │                           │ when openCnt → 0
         │                           │
         ▼                           ▼
    ┌─────────────────────────────────────┐
    │  chanClose when openCnt == 0        │
    │            Deallocates              │
    └─────────────────────────────────────┘


            Orthogonal: chanShut
   ┌─────────────────────────────────────┐
   │  Before:  Put/Get block as needed   │
   │  After:   Put fails, Get nonblocking│
   └─────────────────────────────────────┘

Find the API in chan.h:

chanInit
- A process must supply a thread safe dynamic memory interface (e.g. realloc and free) before using Channels.
chanCreate
- Allocate a chan_t, initializing chanOpenCnt to zero. (pair with chanClose).
chanOpen
- Open a chan_t, incrementing chanOpenCnt. (pair with chanClose).
chanShut
- Shutdown a chan_t (afterwards chanOpPut fails and chanOpGet is always non-blocking).
chanClose
- Close a chan_t, decrementing chanOpenCnt. (deallocates at zero chanOpenCnt).
chanOpenCnt
- Return current number of chanOpen not yet chanClose.
chanOp
- Perform an operation on a Channel
chanOne
- Perform one operation (the first available, in array order) on an array of Channels.
chanAll
- Perform all (or no) operations on an array of Channels. (See squint.)

Store

By default, a Channel can hold a single item. A Put into an empty Channel succeeds immediately; the putter only blocks when attempting a second Put before a Get drains the first. This minimizes latency when producer and consumer rates are balanced.

When rates differ, a larger buffer reduces context switching. The producer can run ahead without blocking on every item. Store callbacks execute within the Put/Get caller's context, providing buffer semantics without a buffer thread (see CSP Heritage).

  Default (single item):        With FIFO Store:

  Put ──► [1] ◄── Get           Put ──► [1][2][3]... ◄── Get
           │                            └──────────┘
     (second Put blocks              (Puts succeed until
      until Get drains)                 Store is full)

A Store can be provided on a chanCreate call. The trade-off is latency versus throughput: larger Stores reduce context switching but increase the delay before backpressure propagates to producers. (See queueing theory.)

NOTE: These implementations preallocate heap with a maximum size to provide backpressure propagation semantics.

A maximum sized Channel FIFO Store implementation is provided. When a context is created, a size is allocated. (See pipeproxy.)

Find the API in Str/chanStrFIFO.h.

A maximum sized, latency sensitive, Channel FIFO Store (FLSO) implementation is provided. When a context is created, a maximum and initial size are specified; the Store starts at initial size.

Problem: Fixed-size buffers force a choice between low latency (small buffer) and high throughput (large buffer). Under variable load, neither choice is optimal. Small buffers can thrash under load. Large buffers can add latency.

Solution: Dynamically adjust buffer size based on observed pressure.

Inspiration: Consider a sushi cooler in a supermarket. Sushi must be fresh (low latency). The cooler observes producers (staff available to make sushi) and consumers (customers in line for sushi) and adjusts the amount of sushi it can hold. FLSO applies this principle to store sizing.

The adjustment rules, executed within Put/Get operations:

Before a Put, if the Store is empty and there are no waiting Gets, the size is decremented.
After a Put, if the Store is full and there are waiting Gets, the size is incremented.
Before a Get, if the Store is full and there are waiting Puts, the size is incremented.
After a Get, if the Store is empty and there are no waiting Puts, the size is decremented.

This achieves self-tuning latency/throughput balance without external configuration or monitoring.

Find the API in Str/chanStrFLSO.h.

A maximum sized Channel LIFO Store implementation is provided. When a context is created, a size is allocated.

Find the API in Str/chanStrLIFO.h.

Blob

Channels pass pointers for intra-process SMP where threads share memory. But pointers are meaningless across process boundaries or over a network. For inter-process or inter-system communication, data must be transmitted.

A Blob is a length-prefixed array of octets: a message. Blob Stores hold the content itself, not a pointer, enabling:

Persistence: Blobs can survive process restart (see chanStrBlbSQL, which stores blobs in SQLite)
Transport: Blobs can be sent over network connections (see chanBlb below)

Since a pthread can't both wait in a pthread_cond_wait and a poll/select/etc., a pair of blocking reader and writer pthreads are used. To limit the number of trivial threads, the API provides callback interfaces for both the Channel (Chn) and transport (Trn) sides.

                              chanBlb Instance
    ┌──────────────────────────────────────────────────────────────────┐
    │                                                                  │
    │     Egress Thread                         Ingress Thread         │
    │    ┌─────────────┐                       ┌─────────────┐         │
    │    │  chanOpGet  │                       │  chanOpPut  │         │
    │    │   egress    │                       │   ingress   │         │
    │    │   chan_t    │                       │   chan_t    │         │
    │    └──────┬──────┘                       └──────▲──────┘         │
    │           │                                     │                │
    │           ▼                                     │                │
    │    ┌─────────────┐                       ┌─────────────┐         │
    │    │     Chn     │  Optional Framer      │     Chn     │         │
    │    │   Framer    │  (Netstring, HTTP,    │   Framer    │         │
    │    │  Callback   │   FastCGI, etc.)      │  Callback   │         │
    │    └──────┬──────┘                       └──────┬──────┘         │
    │           │                                     │                │
    │           ▼                                     │                │
    │    ╔═════════════╗                       ╔═════════════╗         │
    │    ║     Trn     ║                       ║     Trn     ║         │
    │    ║  output()   ║                       ║   input()   ║         │
    │    ╚══════╤══════╝                       ╚══════╤══════╝         │
    │           │                                     │                │
    └───────────┼─────────────────────────────────────┼────────────────┘
                │                                     │
                │              Trn Context            │
                │   ┌───────────────────────────────┐ │
                │   │  Simple: FdStream             │ │
                │   │    write(fd)    read(fd)      │ │
                │   │                               │ │
                │   │  Complex: KCP                 │ │
                │   │  ┌─────────────────────────┐  │ │
                │   │  │   KCP Protocol State    │  │ │
                │   │  │  ┌───────────────────┐  │  │ │
                │   │  │  │  poll Thread      │  │  │ │
                │   │  │  │  UDP recv/send    │  │  │ │
                │   │  │  │  ikcp_update      │  │  │ │
                │   │  │  └───────────────────┘  │  │ │
                │   │  └─────────────────────────┘  │ │
                │   └───────────────────────────────┘ │
                │                                     │
                ▼           Transport Layer           │
           ═══════════════════════════════════════════════
                  (pipe, socket, UDP datagram, etc.)

Chn callbacks replace the default Egress/Ingress thread logic to frame/deframe blobs.
Trn callbacks handle byte I/O; the Trn context may encapsulate protocol state and additional threads.
chanBlb
- Blob transport via ingress and egress Channels.

Find the API in Blb/chanBlb.h.

A couple of transport side implementations are provided (file descriptor based interfaces):

chanBlbTrnFd
- For reliable non-stream interfaces (e.g. pipe, datagram socketpair), use close() on inClose() and outClose() and no finClose().
chanBlbTrnFdStream
- For reliable stream interfaces (e.g. TCP socket), use shutdown() on inClose() and outClose() and close() on finClose().

Several Channel side, "framing", implementations are provided (useful with streaming protocols):

chanBlbChnVlq
- Read and write framed using a Variable-length quantity length prefix.
chanBlbChnNetstring
- Read and write framed using Netstring.
chanBlbChnFcgi
- Read and write framed using FastCGI.
chanBlbChnNetconf10
- Read and write framed using NETCONF 1.0. (This flawed, XML specific, framer is only useful for NETCONF before a transition to NETCONF 1.1.)
chanBlbChnNetconf11
- Read and write framed using NETCONF 1.1.
chanBlbChnHttp1
- Read framed using HTTP/1.x on headers Transfer-Encoding (chunked) and Content-Length. Blob flow (repeats):
  - Header Blob
  - If "Transfer-Encoding" header with value "chunked":
    - Non-zero chunk Blob (repeats)
    - Zero chunk Blob (includes trailer)
  - Else if "Content-Length" header:
    - Non-zero content Blob
- Write is not framed.

Examples

sockproxy
- Modeled on tcpproxy.c from libtask. Connects two chanBlbs back-to-back, with Channels reversed.
- Sockproxy needs numeric values for socket type (-T, -t) and family type (-F, -f).
- The options protocol type (-P, -p), service type (-S, -s) and host name (-H, -h) can be symbolic (see getaddrinfo).
- Upper case options are for the "server" side, lower case options are for the "client" side.
- For most BSD compatible socket libraries, SOCK_STREAM is 1 and AF_INET is 2.
- For example, to listen (because of the server SOCK_STREAM socket type) for connections on any IPv4 stream socket on service 2222 and connect them to any IPv4 stream socket on service ssh at host localhost (letting the system choose the protocol):
  1. ./sockproxy -T 1 -F 2 -S 2222 -t 1 -f 2 -h localhost -s ssh &
  2. ssh -p 2222 user@localhost
pipeproxy
- Copy stdin to stdout through chanBlb pipe file descriptors preserving read boundaries using a FIFO Store and VLQ framing.
chanStrBlbSQL
- Demonstrate a SQLite based Channel Blob FIFO Store.
chanBlbTrnKcp
- Demonstrate a KCP UDP transport integration.
squint
- Implementation of M. Douglas McIlroy's "Squinting at Power Series".
floydWarshall
- Use a Channel coordinating dynamic thread pool to parallelize the Floyd-Warshall all shortest paths algorithm using blocking techniques extended to optionally save all equal next hops.

Formal Verification with SPIN

The channel implementation has been formally verified using SPIN, a model checker for concurrent systems. SPIN exhaustively explores all possible interleavings of concurrent execution to verify correctness properties.

Promela models verify different aspects of the implementation:

Model	Purpose	Key Properties Verified
`chan.pml`	General channel operations	Safety, conservation, deadlock freedom
`chanOne.pml`	Select-style first-available	First-match semantics, exactly-one completion
`chanAll.pml`	Atomic multi-channel operations	All-or-nothing atomicity, lock ladder correctness
`chanStrFIFO.pml`	FIFO Store	FIFO ordering, conservation, capacity bounds
`chanStrFLSO.pml`	Self-tuning FIFO Store	FIFO ordering, size bounds, adaptation safety

Running Verification

Requires SPIN (spin command) and a C compiler. Basic verification:

# Generate and compile verifier
spin -a chan.pml
cc -DSAFETY -o pan pan.c

# Run safety check
./pan -N atomicity

# For liveness properties (with fairness)
cc -o pan pan.c
./pan -a -f -N completion

Each model supports test scenario selection via preprocessor defines:

# chanOne.pml scenarios
spin -a chanOne.pml                    # default: first-match test
spin -DTEST_COMPETE -a chanOne.pml     # competing threads
spin -DTEST_LADDER -a chanOne.pml      # lock ladder deadlock test

# chanAll.pml scenarios
spin -a chanAll.pml                    # default: competing threads
spin -DTEST_PRODUCER_CONSUMER -a chanAll.pml

Verification Results

chan.pml - General operations (1,498 states with reduced parameters):

Property	Result	Description
`safety`	PASSED	No race conditions or invalid states
`conservation`	PASSED	Items neither created nor destroyed

chanOne.pml - Select-style operations (472-2,583 states):

Property	Result	Description
`exactly_one`	PASSED	Each chanOne completes 0 or 1 operations
`first_match`	PASSED	Returns first completable operation in array order
`progress`	PASSED	Both threads complete (with fairness)
`ladder_progress`	PASSED	No deadlock from lock ladder

chanAll.pml - Atomic multi-channel operations (319-626 states):

Property	Result	Description
`atomicity`	PASSED	All operations complete atomically or none do
`completion`	PASSED	Both threads eventually complete (with fairness)
`state_consistency`	PASSED	Channel states remain consistent

chanStrFIFO.pml - Fixed-size FIFO Store (1,238 states):

Property	Result	Description
`fifo_order`	PASSED	Items retrieved in order stored
`conservation`	PASSED	No items lost or duplicated
`bounds`	PASSED	Count never exceeds store size
`progress`	PASSED	Producer/consumer complete (with fairness)

chanStrFLSO.pml - Self-tuning FIFO Store (160,010 states):

Property	Result	Description
`fifo_order`	PASSED	FIFO ordering preserved despite resizing
`size_bounds`	PASSED	Size stays within min/max bounds
`conservation`	PASSED	No items lost during resize
`adapts_safely`	PASSED	Resizing respects bounds
`progress`	PASSED	Producer/consumer complete (with fairness)

The models verify that the lock ladder pattern (acquire locks in ascending order, retry on trylock failure) prevents deadlocks, that chanAll's all-or-nothing semantics hold under concurrent interference, and that the Store implementations maintain FIFO ordering and data integrity.

Runtime Verification

The implementation passes runtime verification with sanitizers:

# ThreadSanitizer - detects data races
cc -fsanitize=thread -g -O1 -o squint example/squint.c chan.c -I. -lpthread
./squint 10   # No warnings

# AddressSanitizer - detects memory errors
cc -fsanitize=address -g -O1 -o squint example/squint.c chan.c -I. -lpthread
./squint 10   # No warnings

# Memory leak check (macOS)
leaks --atExit -- ./squint 10

Static Analysis

Clang static analyzer reports no true positives:

clang --analyze chan.c

The analyzer reports potential NULL dereference warnings in the WAKE macro. These are false positives because the analyzer cannot prove the invariant that waiter queue flags accurately reflect queue contents (when the "not empty" flag is set, a valid waiter pointer exists).

Implementation Invariants

The following invariants are maintained by the implementation:

Invariant	Description	Enforcement
Lock ladder	Locks acquired in ascending channel address order	`chanAll`, `chanOne` retry on trylock failure
Atomic all-or-none	`chanAll` completes all operations or none	Single atomic section after lock acquisition
First-match	`chanOne` returns first satisfiable operation	Sequential scan in array order
Waiter fairness	Waiters serviced in arrival order	FIFO waiter queues per operation type
Queue/flag consistency	Empty flags match queue state	Flag set when head==tail, cleared on enqueue
Reference counting	Channel deallocated when openCnt reaches 0	`chanOpen`/`chanClose` balance
Shutdown semantics	After `chanShut`: Put fails, Get drains	`chanSu` flag checked before operations
Store conservation	Items neither created nor destroyed	Store callbacks maintain item count
FIFO ordering	Items retrieved in order stored	Circular buffer with head/tail pointers

Informal Verification with squint

Testing parallel code is as hard as writing it, bugs hide in rare interleavings. To validate this implementation the squint example implements McIlroy's "Squinting at Power Series", a deliberately pathological parallel algorithm. It spawns threads dynamically, uses demand channels to prevent thread explosion, and performs atomic multi-channel operations throughout. Execution order is completely unpredictable, varying with core count, load, and scheduler decisions. Yet after all the chaos (threads creating, exiting, channels opening, shutting, closing) the program must produce the correct coefficients of the tangent power series. It does. This is the proof that the lock ladder (acquire ascending, release descending) prevents deadlocks and that chanAll's all-or-nothing semantics hold under pressure.

Building

Use "make" or review the file "Makefile" to build.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

pthreadChannel

Why Channels?

CSP Heritage

Channel

Channel Lifecycle

Store

Blob

Examples

Formal Verification with SPIN

Running Verification

Verification Results

Runtime Verification

Static Analysis

Implementation Invariants

Informal Verification with squint

Building

About

Licenses found

Uh oh!

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 229 Commits
Blb		Blb
Str		Str
example		example
.gitignore		.gitignore
COPYING		COPYING
COPYING.LESSER		COPYING.LESSER
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
chan.c		chan.c
chan.h		chan.h
chan.pml		chan.pml
chanAll.pml		chanAll.pml
chanOne.pml		chanOne.pml
chanStrFIFO.pml		chanStrFIFO.pml
chanStrFLSO.pml		chanStrFLSO.pml

License

Licenses found

gdavidbutler/pthreadChannel

Folders and files

Latest commit

History

Repository files navigation

pthreadChannel

Why Channels?

CSP Heritage

Channel

Channel Lifecycle

Store

Blob

Examples

Formal Verification with SPIN

Running Verification

Verification Results

Runtime Verification

Static Analysis

Implementation Invariants

Informal Verification with squint

Building

About

Topics

Resources

License

Licenses found

Security policy

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages