open_hypergraphs/
lib.rs

1//! # Open Hypergraphs
2//!
3//! `open-hypergraphs` is a [GPU-accelerated](#data-parallelism) implementation of the
4//! [OpenHypergraph](crate::strict::OpenHypergraph)
5//! datastructure from the paper
6//! ["Data-Parallel Algorithms for String Diagrams"](https://arxiv.org/pdf/2305.01041).
7//! Open hypergraphs are used for representing, evaluating, and differentiating large networks of operations with multiple
8//! inputs and outputs.
9//!
10//! Here's a drawing of an open hypergraph with labeled nodes `●` and hyperedges `□`.
11//!
12//! ```text
13//!                    /───────────────────────────────────   x
14//!                   ╱
15//!   x   ───────────●
16//!                 i8\      ┌─────┐
17//!                    \─────┤     │        ┌─────┐
18//!            2             │ Sub ├───●────┤ Neg ├───●───    -(x - y)
19//!   y   ─────●─────────────┤     │  i8    └─────┘  i8
20//!           i8             └─────┘
21//! ```
22//!
23//! This open hypergraph represents a circuit with two inputs, `x` and `y`.
24//! this circuit computes `x` on its first output and `- (x - y)` on its second.
25//! (The input/output labels `x`, `y`, and `-(x - y)` are only illustrative, and not part of the
26//! datastructure.)
27//!
28//! <div class="warning">
29//! Note carefully: in contrast to typical graph-based syntax representations,
30//! operations correspond to hyperedges,
31//! and values correspond to nodes!
32//! This is why nodes are labeled with types like i8 and hyperedges with operations like
33//! Sub.
34//! </div>
35//!
36//! See the [datastructure](#datastructure) section for a formal definition.
37//!
38//! # What are Open Hypergraphs For?
39//!
40//! Open Hypergraphs are a general, differentiable and data-parallel datastructure for *syntax*.
41//! Here's a few examples of suitable uses:
42//!
43//! - Differentiable array programs for deep learning in [catgrad](https://catgrad.com)
44//! - Terms in [first order logic](https://arxiv.org/pdf/2401.07055)
45//! - Programs in the [λ-calculus](https://en.wikipedia.org/wiki/Cartesian_closed_category)
46//! - [Circuits with feedback](https://arxiv.org/pdf/2201.10456)
47//! - [Interaction nets](https://dl.acm.org/doi/10.1006/inco.1997.2643)
48//!
49//! Open Hypergraphs have some unique advantages compared to tree-based representations of syntax.
50//! For example, they can represent operations with *multiple outputs*, and structures with
51//! *feedback*.
52//! See the [comparison to trees and graphs](#comparison-to-trees-and-graphs) for more detail.
53//!
54//! Differentiability of open hypergraphs (as used in [catgrad](https://catgrad.com))
55//! comes from the [data-parallel algorithm](crate::strict::functor::optic::Optic) for generalised
56//! ahead-of-time automatic differentiation by optic composition.
57//! This algorithm is actually more general than just differentiability: read more in the papers
58//! ["Categorical Foundations of Gradient-Based Learning"](https://arxiv.org/abs/2103.01931)
59//! and ["Data-Parallel Algorithms for String Diagrams"](https://arxiv.org/pdf/2305.01041).
60//! See the [Theory](#theory) section for more pointers.
61//!
62//! # Usage
63//!
64//! If you're new to the library, you should start with the [`crate::lax`] module.
65//! This provides a mutable, imperative, single-threaded interface to building open hypergraphs
66//! which should be familiar if you've used a graph library before.
67//!
68//! We can build the example open hypergraph above as follows:
69//!
70//! ```rust
71//! use open_hypergraphs::lax::*;
72//!
73//! pub enum NodeLabel { I8 };
74//! pub enum EdgeLabel { Sub, Neg };
75//!
76//! #[test]
77//! fn build() -> OpenHypergraph<NodeLabel, EdgeLabel> {
78//!     use NodeLabel::*;
79//!     use EdgeLabel::*;
80//!
81//!     // Create an empty OpenHypergraph.
82//!     let mut example = OpenHypergraph::<NodeLabel, EdgeLabel>::empty();
83//!
84//!     // Create all 4 nodes
85//!     let x = example.new_node(I8);
86//!     let a = example.new_node(I8);
87//!     let y = example.new_node(I8);
88//!     let z = example.new_node(I8);
89//!
90//!     // Add the "Sub" hyperedge with source nodes `[x, y]` and targets `[a]`
91//!     example.new_edge(Sub, Hyperedge { sources: vec![x, y], targets: vec![a] });
92//!
93//!     // Add the 'Neg' hyperedge with sources `[a]` and targets `[z]`
94//!     example.new_edge(Neg, Hyperedge { sources: vec![a], targets: vec![z] });
95//!
96//!     // set the sources and targets of the example
97//!     example.sources = vec![x, y];
98//!     example.targets = vec![x, z];
99//!
100//!     // return the example
101//!     example
102//! }
103//! ```
104//!
105//! The [`crate::lax::var::Var`] struct is a helper on top of the imperative interface which
106//! reduces some boilerplate, especially when operators are involved.
107//! We can rewrite the above example as follows:
108//!
109//! ```ignore
110//! pub fn example() {
111//!     let state = OpenHypergraph::empty();
112//!     let x = Var::new(state, I8);
113//!     let y = Var::new(state, I8);
114//!     let (z0, z1) = (x.clone(), -(x - y));
115//! }
116//! ```
117//!
118//! See `examples/adder.rs` for a more complete example using this interface to build an n-bit full
119//! adder from half-adder circuits.
120//!
121//! By contrast, the [`crate::strict`] module in principle supports GPU acceleration, but has a
122//! much more complicated interface.
123//!
124//! # Datastructure
125//!
126//! Before giving the formal definition, let's revisit the example above.
127//!
128//! ```text
129//!                  /───────────────────────────────────
130//!                0╱
131//!     ───────────●
132//!               i8\      ┌─────┐
133//!                  \─────┤     │   1    ┌─────┐   3
134//!          2             │ Sub ├───●────┤ Neg ├───●───
135//!     ─────●─────────────┤     │  i8    └─────┘  i8
136//!         i8             └─────┘
137//! ```
138//!
139//! There are 4 nodes in this open hypergraph, depicted as `●` with a label `i8` and a
140//! node ID in the set `{0..3}`.
141//! There are two hyperedges depicted as a boxes labeled `Sub` and `Neg`.
142//!
143//! Each hyperedge has an *ordered list* of sources and targets.
144//! For example, the `Sub` edge has sources `[0, 2]` and targets `[1]`,
145//! while `Neg` has sources `[1]` and targets `[3]`.
146//! Note: the order is important!
147//! Without it, we couldn't represent non-commutative operations like `Sub`.
148//!
149//! As well as the sources and targets for each *hyperedge*, the whole "open hypergraph" also has
150//! sources and targets.
151//! These are drawn as dangling wires on the left and right.
152//! In this example, the sources are `[0, 2]`, and the targets are `[0, 3]`.
153//!
154//! <div class="warning">
155//! There are no restrictions on how many times a node can appear as a source or target of both
156//! hyperedges and the open hypergraph as a whole.
157//! </div>
158//!
159//! For example, node `0` is a source and target of the open hypergraph, *and* a source of the
160//! `Sub` edge.
161//! Another example: node `1` is not a source or target of the open hypergraph, although it *is* a
162//! target of the `Sub` hyperedge and a source of the `Neg` hyperedge.
163//!
164//! It's also possible to have nodes which are neither sources nor targets of the open hypergraph
165//! *or* any hyperedge, but that isn't pictured here. See the [theory](#theory) section for more
166//! detail.
167//!
168//! # Formal Definition
169//!
170//! Formally, an open hypergraph is a triple of:
171//!
172//! 1. A Hypergraph `h` with `N ∈ ℕ` nodes
173//! 2. An array `s` of length `A ∈ ℕ` whose elements `s_i ∈ {0..N-1}` are nodes
174//! 3. An array `t` of length `B ∈ ℕ` whose elements `t_i ∈ {0..N-1}` are nodes
175//!
176//! Many different kinds of [Hypergraph](https://en.wikipedia.org/wiki/Hypergraph) exist,
177//! but an *open* hypergraph uses a specific kind of directed hypergraph, which has:
178//!
179//! - A finite set of `N` nodes, labeled with an element from a set `Σ₀`
180//! - A finite set of `E` *hyperedges*, labeled from the set `Σ₁`
181//! - For each hyperedge `e ∈ E`,
182//!   - An ordered array of *source nodes*
183//!   - An ordered array of *target nodes*
184//!
185//! # Comparison to Trees and Graphs
186//!
187//! Let's compare the open hypergraph representation of the example term above against *tree* and
188//! *graph* representations.
189//!
190//! When considered as a tree, the term `(x, - (x - y))` can be drawn as follows:
191//!
192//! ```text
193//!         Pair
194//!        /    \
195//!       /      Neg
196//!      x        |
197//!              Sub
198//!             /   \
199//!            x     y
200//! ```
201//!
202//! There are two problems here:
203//!
204//! 1. To handle multiple outputs, we had to include a tuple constructor "Pair" in our language.
205//!    This means we'd also need to add other functions to deal with pairs, "polluting" the base
206//!    language.
207//! 2. The "sharing" of variables is not evident from the tree structure: x is used twice, but we
208//!    have to compare strings to "discover" that fact.
209//!
210//! In contrast, the open hypergraph:
211//!
212//! 1. Allows for terms with **multiple outputs**, without having to introduce a tuple type to the
213//!    language.
214//! 2. Encodes the **sharing** of variables naturally by allowing nodes to appear in multiple
215//!    sources and targets.
216//!
217//! Another common approach is to use a *graph* for syntax where nodes are operations, and an edge
218//! between two nodes indicates the *output* of the source node is the *input* of the target.
219//! Problems:
220//!
221//! 1. Nodes don't distinguish the order of edges, so argument order has to be tracked separately
222//! 2. There is no notion of input or output to the whole system.
223//!
224//! In contrast, the open hypergraph:
225//!
226//! 1. Naturally handles operations with multiple ordered inputs and outputs (as *hyperedges*)
227//! 2. Comes equipped with global source and target nodes
228//!
229//! Open Hypergraphs have general utility because they model any system which can be described in terms of symmetric monoidal
230//! categories.
231//! Some examples are listed [above](#what-are-open-hypergraphs-for);
232//! see the [Theory](#theory) section for more pointers to detail on the mathematical
233//! underpinnings.
234//!
235//! # Theory
236//!
237//! Formally, an `OpenHypergraph<Σ₀, Σ₁>` is an arrow of
238//! the free [symmetric monoidal category](https://en.wikipedia.org/wiki/Symmetric_monoidal_category)
239//! presented by the signature `(Σ₀, Σ₁)` plus "Special Frobenius" structure.
240//!
241//! This extra structure is sometimes useful (e.g. in autodiff), but can be removed by restricting
242//! the open hypergraph such that nodes always appear in exactly one source and target.
243//! This condition is called "monogamous acyclicity".
244//!
245//! A complete mathematical explanation can be found in the papers
246//! [String Diagram Rewrite Theory I](https://arxiv.org/abs/2012.01847),
247//! [II](https://arxiv.org/abs/2104.14686),
248//! and
249//! [III](https://arxiv.org/abs/2109.06049),
250//! which also includes details on how to *rewrite* open hypergraphs.
251//!
252//! The implementation in *this* library is based on the data-parallel algorithms described in the
253//! paper [Data Parallel Algorithms for String Diagrams](https://arxiv.org/pdf/2305.01041).
254//! In particular, the "generalised autodiff" algorithm can be found in Section 10 ("Optic
255//! Composition using Frobenius Structure") of that paper.
256
257pub mod array;
258pub mod category;
259pub mod finite_function;
260pub mod indexed_coproduct;
261pub mod operations;
262pub mod semifinite;
263
264// Strict open hypergraphs
265pub mod strict;
266
267// imperative interface to building open hypergraphs
268pub mod lax;
open_hypergraphs/lib.rs

open_hypergraphs/
lib.rs