[−][src]Crate ocaml_interop
Zinc-iron alloy coating is used in parts that need very good corrosion protection.
ocaml-interop is an OCaml<->Rust FFI with an emphasis on safety inspired by caml-oxide and ocaml-rs.
Table of Contents
How does it work
ocaml-interop, just like caml-oxide, encodes the invariants of OCaml's garbage collector into the rules of Rust's borrow checker. Any violation of these invariants results in a compilation error produced by Rust's borrow checker.
This requires that the user is explicit about delimiting blocks that interact with the OCaml runtime, and that calls into the OCaml runtime are done only inside these blocks, and wrapped by a few special macros.
Usage
Rules
There are a few rules that have to be followed when calling into the OCaml runtime:
Rule 1: OCaml function calls, allocations and the GC Frame
Calls into the OCaml runtime that perform allocations should only occur inside ocaml_frame!
blocks, wrapped by either the ocaml_call!
(for declared OCaml functions) or ocaml_alloc!
(for allocation or conversion functions) macros.
Example:
ocaml_frame!(gc, { let arg1 = ocaml_alloc!(arg1.to_ocaml(gc)); // ... let result = ocaml_call!(ocaml_function(gc, arg1, /* ..., argN */)); let ocaml_string: OCaml<String> = ocaml_alloc!(a_string.to_ocaml(gc)); // ... })
Without the macros, this error is produced, because without the macros an incorrect token is passed as the first argument:
error[E0308]: mismatched types
--> example.rs
|
| let result = ocaml_function(gc, arg1, ..., argN);
| ^^ expected struct `ocaml_interop::OCamlAllocToken`, found `&mut ocaml_interop::GCFrame<'_>`
Rule 2: OCaml value references
OCaml values that are obtained as a result of calling an OCaml function can only be referenced directly until another call to an OCaml function happens. This is enforced by Rust's borrow checker. If a value has to be referenced after other OCaml function calls, a special reference has to be kept.
Example:
ocaml_frame!(gc, { let arg1 = ocaml_alloc!(arg1.to_ocaml(gc)); let result = ocaml_call!(ocaml_function(gc, arg1, /* ..., argN */)).unwrap(); let result_ref = &gc.keep(result); let arg2 = ocaml_alloc!(arg2.to_ocaml(gc)); let another_result = ocaml_call!(ocaml_function(gc, arg2, /* ..., argN */)).unwrap(); // ... let more_results = ocaml_call!(another_ocaml_function(gc, gc.get(result_ref))).unwrap(); // ... })
If the value is not kept with gc.keep
, and instead is attempted to be re-used directly, Rust's borrow checker will complain:
error[E0502]: cannot borrow `*gc` as mutable because it is also borrowed as immutable
--> example.rs
|
| let result = ocaml_call!(ocaml_function(gc, arg1, ..., argN)).unwrap();
| ------------------------------------ immutable borrow occurs here
...
| let another_result = ocaml_call!(ocaml_function(gc, arg1, ..., argN)).unwrap();
| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ mutable borrow occurs here
...
| let more_results = ocaml_call!(another_ocaml_function(gc, result)).unwrap();
| ------ immutable borrow later used here
|
There is no need to keep values that are used immediately without any calls into the OCaml runtime in-between their allocation and use.
Rule 3: Liveness and scope of OCaml values
OCaml values that are the result of an allocation by the OCaml runtime cannot escape the ocaml_frame!
block inside which they where created. This is enforced by Rust's borrow checker.
Example:
let s = ocaml_frame!(gc, { let arg1 = ocaml_alloc!(arg1.to_ocaml(gc)); let result = ocaml_call!(ocaml_function(gc, arg1, /* ..., argN */)).unwrap(); String::from_ocaml(result) }); // ...
If the result escapes the block, Rust's borrow checker will complain:
error[E0597]: `frame` does not live long enough
--> example.rs
|
| let s = ocaml_frame!(gc, {
| _________-___^
| | |
| | borrow later stored here
| | let result = let result = ocaml_call!(ocaml_function(gc, arg1, ..., argN)).unwrap();
| | result
| | });
| | ^
| | |
| |______borrowed value does not live long enough
| `frame` dropped here while still borrowed
|
TODO: show escape hatch for values that need to escape the frame scope using raw OCaml values.
Converting between OCaml and Rust data
FromOCaml
trait
The FromOCaml
trait implements conversion from OCaml values into Rust values, using the from_ocaml
function.
IntoRust
trait
IntoRust
is the counterpart to FromOCaml
just like Into
is to From
. Using ocaml_val.into_rust()
instead of Type::from_ocaml(ocaml_val)
is usually more convenient, specially when more complicated types are involved.
ToOCaml
trait
The ToOCaml
trait implements conversion from Rust values into OCaml values, using the to_ocaml
function. to_ocaml
can only be called when wrapped by the ocaml_alloc!
macro form, and it takes a single parameter that must be a handle to the current GC frame.
Calling into OCaml from Rust
The following code defines two OCaml functions and registers them using the Callback.register
mechanism:
let increment_bytes bytes first_n =
let limit = (min (Bytes.length bytes) first_n) - 1 in
for i = 0 to limit do
let value = (Bytes.get_uint8 bytes i) + 1 in
Bytes.set_uint8 bytes i value
done;
bytes
let twice x = 2 * x
let () =
Callback.register "increment_bytes" increment_bytes;
Callback.register "twice" twice
To be able to call these from Rust, there are a few things that need to be done:
- The OCaml runtime has to be initialized. If the driving program is a Rust application, it has to be done explicitly by doing
let runtime = OCamlRuntime::init()
, but if the driving program is an OCaml application, this is not required. Whenruntime
goes out of scope it will be dropped and the OCaml runtime cleanup functions will be executed. - Functions that were exported from the OCaml side with
Callback.register
have to be declared using theocaml!
macro. - Blocks of code that call OCaml functions, or allocate OCaml values, must be wrapped by the
ocaml_frame!
macro. - Calls to functions that allocate OCaml values must be wrapped by the
ocaml_alloc!
macro. These always return a value and cannot signal failure. - Calls to functions exported by OCaml with
Callback.register
must be wrapped by theocaml_call!
macro. These return a value of typeResult<OCaml<T>, ocaml_interop::Error>
, with the error being returned to signal that an exception was raised by the called OCaml code.
Example
use ocaml_interop::{ ocaml_alloc, ocaml_call, ocaml_frame, to_ocaml, IntoRust, FromOCaml, OCaml, OCamlRef, ToOCaml, OCamlRuntime }; // To call an OCaml function, it first has to be declared inside an `ocaml!` macro block: mod ocaml_funcs { use ocaml_interop::{ocaml, OCamlInt}; ocaml! { // OCaml: `val increment_bytes: bytes -> int -> bytes` // registered with `Callback.register "increment_bytes" increment_bytes` pub fn increment_bytes(bytes: String, first_n: OCamlInt) -> String; // OCaml: `val twice: int -> int` // registered with `Callback.register "twice" twice` pub fn twice(num: OCamlInt) -> OCamlInt; } // The two OCaml functions declared above can now be invoked with the // `ocaml_call!` macro: `ocaml_call!(func_name(gc, args...))`. // Note the first `gc` parameter, it is an OCaml Garbage Collector handle, and // it is obtained by opening a new GC frame block, sung the `ocaml_frame!` macro. } fn increment_bytes(bytes1: String, bytes2: String, first_n: usize) -> (String, String) { // Any calls into the OCaml runtime have to happen inside an // `ocaml_frame!` block. Inside this block, OCaml allocations and references // to OCaml allocated values are tracked and validated by Rust's borrow checker. // The first argument to the macro is a name for the GC handle, the second // is the block of code that will run inside that frame. ocaml_frame!(gc, { // The `ToOCaml` trait provides the `to_ocaml` method to convert Rust // values into OCaml values. Because such conversions usually require // the OCaml runtime to perform an allocation, calls to `to_ocaml` have // to be wrapped by the `ocaml_alloc!` macro. A shorter version uses // the `to_ocaml!` macro. let ocaml_bytes1: OCaml<String> = to_ocaml!(gc, bytes1); // `ocaml_bytes1` is going to be referenced later, but there calls into the // OCaml runtime that perform allocations happening before this value is used again. // Those calls into the OCaml runtime invalidate this reference, so it has to be // kept alive somehow. To do so, `gc.keep(ocaml_bytes1)` is used. It returns // a reference to an OCaml value that is going to be valid during the scope of // the current `ocaml_frame!` block. Later `gc.get(the_reference)` can be used // to obtain the kept value. let bytes1_ref: &OCamlRef<String> = &gc.keep(ocaml_bytes1); // A shorter way to write the above two lines is: // let bytes1_ref = &to_ocaml!(gc, bytes1).keep(gc); // Same as above. Note that if we waited to perform this conversion // until after `ocaml_bytes1` is used, no references would have to be // kept for either of the two OCaml values, because they would be // used immediately, with no allocations being performed by the // OCaml runtime in-between. let bytes2_ref = &to_ocaml!(gc, bytes2).keep(gc); // Rust `i64` integers can be converted into OCaml fixnums with `OCaml::of_i64`. // Such conversion doesn't require any allocation on the OCaml side, // so this call doesn't have to be wrapped by `ocaml_alloc!` or `to_ocaml!`, // and no GC handle is passed as an argument. let ocaml_first_n = unsafe { OCaml::of_i64(first_n as i64) }; // To call an OCaml function (declared above in a `ocaml!` block) the // `ocaml_call!` macro is used. The GC handle has to be passed as the first argument, // before all the other declared arguments. // The result of this call is a Result<OCamlValue<T>, ocaml_interop::Error>, with `Err(...)` // being the result of calls for which the OCaml runtime raises an exception. let result1 = ocaml_call!(ocaml_funcs::increment_bytes( gc, // The reference created above is used here to obtain the value // of `ocaml_bytes1` gc.get(bytes1_ref), ocaml_first_n )).unwrap(); // Perform the conversion of the OCaml result value into a // Rust value while the reference is still valid because the // `ocaml_call!` that follows will invalidate it. // Alternatively, the result of `gc.keep(result1)` could be used // to be able to reference the value later through an `OCamlRef` value. let new_bytes1: String = result1.into_rust(); let result2 = ocaml_call!(ocaml_funcs::increment_bytes( gc, gc.get(bytes2_ref), ocaml_first_n )).unwrap(); // The `FromOCaml` trait provides the `from_ocaml` method to convert from // OCaml values into OCaml values. Unlike the `to_ocaml` method, it doesn't // require a GC handle argument, because no allocation is performed by the // OCaml runtime when converting into Rust values. // A more convenient alternative, is to use the `into_rust` method as // above when `result1` was converted. (new_bytes1, String::from_ocaml(result2)) }) } fn twice(num: usize) -> usize { ocaml_frame!(gc, { let ocaml_num = unsafe { OCaml::of_i64(num as i64) }; let result = ocaml_call!(ocaml_funcs::twice(gc, ocaml_num)); i64::from_ocaml(result.unwrap()) as usize }) } fn entry_point() { // IMPORTANT: the OCaml runtime has to be initialized first. let ocaml_runtime = OCamlRuntime::init(); let first_n = twice(5); let bytes1 = "000000000000000".to_owned(); let bytes2 = "aaaaaaaaaaaaaaa".to_owned(); println!("Bytes1 before: {}", bytes1); println!("Bytes2 before: {}", bytes2); let (result1, result2) = increment_bytes(bytes1, bytes2, first_n); println!("Bytes1 after: {}", result1); println!("Bytes2 after: {}", result2); // At this point the `ocaml_runtime` handle will be dropped, triggering // the execution of the necessary cleanup by the OCaml runtime }
Calling into Rust from OCaml
To be able to call a Rust function from OCaml, it has to be defined in a way that exposes it to OCaml. This can be done with the ocaml_export!
macro.
Example
use ocaml_interop::{to_ocaml, ocaml_export, ocaml_frame, FromOCaml, OCamlInt, OCaml, OCamlBytes, ToOCaml}; // `ocaml_export` expands the function definitions by adding `pub` visibility and // the required `#[no_mangle]` and `extern` declarations. It also takes care of // binding the GC frame handle to the name provided as the first parameter. ocaml_export! { // The first parameter is a name to which the GC frame handle will be bound to. // The remaining parameters and return value must have a declared type of `OCaml<T>`. fn rust_twice(_gc, num: OCaml<OCamlInt>) -> OCaml<OCamlInt> { let num = i64::from_ocaml(num); unsafe { OCaml::of_i64(num * 2) } } fn rust_increment_bytes(gc, bytes: OCaml<OCamlBytes>, first_n: OCaml<OCamlInt>) -> OCaml<OCamlBytes> { let first_n = i64::from_ocaml(first_n) as usize; let mut vec = Vec::from_ocaml(bytes); for i in 0..first_n { vec[i] += 1; } // Note that unlike in `ocaml_frame!` blocks, where values of type `OCaml<T>` // cannot escape, in functions defined inside `ocaml_export!` blocks, // only results of type `OCaml<T>` are valid. to_ocaml!(gc, vec) } }
Then in OCaml, these functions can be referred to in the same way as C functions:
external rust_twice: int -> int = "rust_twice"
external rust_increment_bytes: bytes -> int -> bytes = "rust_increment_bytes"
References and links
- OCaml Manual: Chapter 20 Interfacing C with OCaml.
- Safely Mixing OCaml and Rust paper by Stephen Dolan.
- Safely Mixing OCaml and Rust talk by Stephen Dolan.
- CAMLroot: revisiting the OCaml FFI.
- caml-oxide, the code from that paper.
- ocaml-rs, another OCaml<->Rust FFI library.
Macros
impl_conv_ocaml_record | Implements conversion between a Rust struct and an OCaml record. |
impl_conv_ocaml_variant | Implements conversion between a Rust enum and an OCaml variant. |
impl_from_ocaml_record | Implements |
impl_from_ocaml_variant | Implements |
impl_to_ocaml_record | Implements |
impl_to_ocaml_variant | Implements |
ocaml | Declares OCaml functions and allocators. |
ocaml_alloc | Calls an OCaml allocator function. |
ocaml_alloc_record | Allocates an OCaml record built from a Rust record |
ocaml_alloc_tagged_block | Allocates an OCaml memory block tagged with the specified value. |
ocaml_alloc_variant | Allocates an OCaml variant, mapped from a Rust enum. |
ocaml_call | Calls an OCaml function |
ocaml_export | Defines Rust functions callable from OCaml. |
ocaml_frame | Opens a new frame inside which new OCaml values can be allocated, and OCaml functions called. |
ocaml_unpack_record | Unpacks an OCaml record into a Rust record |
ocaml_unpack_variant | Unpacks an OCaml variant and maps it into a Rust enum. |
to_ocaml | Converts Rust values into OCaml values. |
Structs
OCaml | Representation of OCaml values inside |
OCamlAllocResult | Intermediary allocation result. |
OCamlAllocToken | Token used by allocation functions. Used internally. |
OCamlBytes |
|
OCamlException | An OCaml exception value. |
OCamlInt32 |
|
OCamlInt64 |
|
OCamlList |
|
OCamlRef |
|
OCamlRuntime | OCaml runtime handle. |
Enums
OCamlError |
Traits
FromOCaml | Implements conversion from OCaml values into Rust values. |
IntoRust | Counterpart to |
ToOCaml | Implements conversion from Rust values into OCaml values. |
Type Definitions
OCamlFn1 | OCaml function that accepts one argument. |
OCamlFn2 | OCaml function that accepts two arguments. |
OCamlFn3 | OCaml function that accepts three arguments. |
OCamlFn4 | OCaml function that accepts four arguments. |
OCamlFn5 | OCaml function that accepts five arguments. |
OCamlInt |
|
OCamlResult | The result of calls to OCaml functions. Can be a value or an error. |
RawOCaml | Represent OCaml |