Struct Allocator

Source
pub struct Allocator { /* private fields */ }
Expand description

A bump-allocated memory arena.

§Anatomy of an Allocator

Allocator is flexibly sized. It grows as required as you allocate data into it.

To do that, an Allocator consists of multiple memory chunks.

Allocator::new creates a new allocator without any chunks. When you first allocate an object into it, it will lazily create an initial chunk, the size of which is determined by the size of that first allocation.

As more data is allocated into the Allocator, it will likely run out of capacity. At that point, a new memory chunk is added, and further allocations will use this new chunk (until it too runs out of capacity, and another chunk is added).

The data from the 1st chunk is not copied into the 2nd one. It stays where it is, which means & or &mut references to data in the first chunk remain valid. This is unlike e.g. Vec which copies all existing data when it grows.

Each chunk is at least double the size of the last one, so growth in capacity is exponential.

Allocator::reset keeps only the last chunk (the biggest one), and discards any other chunks, returning their memory to the global allocator. The last chunk has its cursor rewound back to the start, so it’s empty, ready to be re-used for allocating more data.

§Recycling allocators

For good performance, it’s ideal to create an Allocator, and re-use it over and over, rather than repeatedly creating and dropping Allocators.

// This is good!
use oxc_allocator::Allocator;
let mut allocator = Allocator::new();

for i in 0..100 {
    do_stuff(i, &allocator);
    // Reset the allocator, freeing the memory used by `do_stuff`
    allocator.reset();
}
// DON'T DO THIS!
for i in 0..100 {
    let allocator = Allocator::new();
    do_stuff(i, &allocator);
}
// DON'T DO THIS EITHER!
for i in 0..100 {
    do_stuff(i, &allocator);
    // We haven't reset the allocator, so we haven't freed the memory used by `do_stuff`.
    // The allocator will grow and grow, consuming more and more memory.
}

§Why is re-using an Allocator good for performance?

3 reasons:

§1. Avoid expensive system calls

Creating an Allocator is a fairly expensive operation as it involves a call into global allocator, which in turn will likely make a system call. Ditto when the Allocator is dropped. Re-using an existing Allocator avoids these costs.

§2. CPU cache

Re-using an existing allocator means you’re re-using the same block of memory. If that memory was recently accessed, it’s likely to be warm in the CPU cache, so memory accesses will be much faster than accessing “cold” sections of main memory.

This can have a very significant positive impact on performance.

§3. Capacity stabilization

The most efficient Allocator is one with only 1 chunk which has sufficient capacity for everything you’re going to allocate into it.

Why?

  1. Every allocation will occur without the allocator needing to grow.

  2. This makes the “is there sufficient capacity to allocate this?” check in alloc completely predictable (the answer is always “yes”). The CPU’s branch predictor swiftly learns this, speeding up operation.

  3. When the Allocator is reset, there are no excess chunks to discard, so no system calls.

Because reset keeps only the biggest chunk (see above), re-using the same Allocator for multiple similar workloads will result in the Allocator swiftly stabilizing at a capacity which is sufficient to service those workloads with a single chunk.

If workload is completely uniform, it reaches stable state on the 3rd round.

let mut allocator = Allocator::new();

fn workload(allocator: &Allocator) {
    // Allocate 4 MB of data in small chunks
    for i in 0..1_000_000u32 {
        allocator.alloc(i);
    }
}

// 1st round
workload(&allocator);

// `allocator` has capacity for 4 MB data, but split into many chunks.
// `reset` throws away all chunks except the last one which will be approx 2 MB.
allocator.reset();

// 2nd round
workload(&allocator);

// `workload` filled the 2 MB chunk, so a 2nd chunk was created of double the size (4 MB).
// `reset` discards the smaller chunk, leaving only a single 4 MB chunk.
allocator.reset();

// 3rd round
// `allocator` now has sufficient capacity for all allocations in a single 4 MB chunk.
workload(&allocator);

// `reset` has no chunks to discard. It keeps the single 4 MB chunk. No system calls.
allocator.reset();

// More rounds
// All serviced without needing to grow the allocator, and with no system calls.
for _ in 0..100 {
  workload(&allocator);
  allocator.reset();
}

§No Drops

Objects allocated into Oxc memory arenas are never Dropped. Memory is released in bulk when the allocator is dropped, without dropping the individual objects in the arena.

Therefore, it would produce a memory leak if you allocated Drop types into the arena which own memory allocations outside the arena.

Static checks make this impossible to do. Allocator::alloc, Box::new_in, Vec::new_in, HashMap::new_in, and all other methods which store data in the arena will refuse to compile if called with a Drop type.

use oxc_allocator::{Allocator, Box};

let allocator = Allocator::new();

struct Foo {
    pub a: i32
}

impl std::ops::Drop for Foo {
    fn drop(&mut self) {}
}

// This will fail to compile because `Foo` implements `Drop`
let foo = Box::new_in(Foo { a: 0 }, &allocator);

struct Bar {
    v: std::vec::Vec<u8>,
}

// This will fail to compile because `Bar` contains a `std::vec::Vec`, and it implements `Drop`
let bar = Box::new_in(Bar { v: vec![1, 2, 3] }, &allocator);

§Examples

Consumers of the oxc umbrella crate pass Allocator references to other tools.

use oxc::{allocator::Allocator, parser::Parser, span::SourceType};

let allocator = Allocator::default();
let parsed = Parser::new(&allocator, "let x = 1;", SourceType::default());
assert!(parsed.errors.is_empty());

Implementations§

Source§

impl Allocator

Source

pub fn new() -> Self

Create a new Allocator with no initial capacity.

This method does not reserve any memory to back the allocator. Memory for allocator’s initial chunk will be reserved lazily, when you make the first allocation into this Allocator (e.g. with Allocator::alloc, Box::new_in, Vec::new_in, HashMap::new_in).

If you can estimate the amount of memory the allocator will require to fit what you intend to allocate into it, it is generally preferable to create that allocator with with_capacity, which reserves that amount of memory upfront. This will avoid further system calls to allocate further chunks later on. This point is less important if you’re re-using the allocator multiple times.

See Allocator docs for more information on efficient use of Allocator.

Source

pub fn with_capacity(capacity: usize) -> Self

Create a new Allocator with specified capacity.

See Allocator docs for more information on efficient use of Allocator.

Source

pub fn alloc<T>(&self, val: T) -> &mut T

Allocate an object in this Allocator and return an exclusive reference to it.

§Panics

Panics if reserving space for T fails.

§Examples
use oxc_allocator::Allocator;

let allocator = Allocator::default();
let x = allocator.alloc([1u8; 20]);
assert_eq!(x, &[1u8; 20]);
Source

pub fn alloc_str<'alloc>(&'alloc self, src: &str) -> &'alloc mut str

Copy a string slice into this Allocator and return a reference to it.

§Panics

Panics if reserving space for the string fails.

§Examples
use oxc_allocator::Allocator;
let allocator = Allocator::default();
let hello = allocator.alloc_str("hello world");
assert_eq!(hello, "hello world");
Source

pub fn reset(&mut self)

Reset this allocator.

Performs mass deallocation on everything allocated in this arena by resetting the pointer into the underlying chunk of memory to the start of the chunk. Does not run any Drop implementations on deallocated objects.

If this arena has allocated multiple chunks to bump allocate into, then the excess chunks are returned to the global allocator.

§Examples
use oxc_allocator::Allocator;

let mut allocator = Allocator::default();

// Allocate a bunch of things.
{
    for i in 0..100 {
        allocator.alloc(i);
    }
}

// Reset the arena.
allocator.reset();

// Allocate some new things in the space previously occupied by the
// original things.
for j in 200..400 {
    allocator.alloc(j);
}
Source

pub fn capacity(&self) -> usize

Calculate the total capacity of this Allocator including all chunks, in bytes.

Note: This is the total amount of memory the Allocator owns NOT the total size of data that’s been allocated in it. If you want the latter, use used_bytes instead.

§Examples
use oxc_allocator::Allocator;

let capacity = 64 * 1024; // 64 KiB
let mut allocator = Allocator::with_capacity(capacity);
allocator.alloc(123u64); // 8 bytes

// Result is the capacity (64 KiB), not the size of allocated data (8 bytes).
// `Allocator::with_capacity` may allocate a bit more than requested.
assert!(allocator.capacity() >= capacity);
Source

pub fn used_bytes(&self) -> usize

Calculate the total size of data used in this Allocator, in bytes.

This is the total amount of memory that has been used in the Allocator, NOT the amount of memory the Allocator owns. If you want the latter, use capacity instead.

The result includes:

  1. Padding bytes between objects which have been allocated to preserve alignment of types where they have different alignments or have larger-than-typical alignment.
  2. Excess capacity in Vecs, Strings and HashMaps.
  3. Objects which were allocated but later dropped. Allocator does not re-use allocations, so anything which is allocated into arena continues to take up “dead space”, even after it’s no longer referenced anywhere.
  4. “Dead space” left over where a Vec, String or HashMap has grown and had to make a new allocation to accommodate its new larger size. Its old allocation continues to take up “dead” space in the allocator, unless it was the most recent allocation.

In practice, this almost always means that the result returned from this function will be an over-estimate vs the amount of “live” data in the arena.

However, if you are using the result of this method to create a new Allocator to clone an AST into, it is theoretically possible (though very unlikely) that it may be a slight under-estimate of the capacity required in new allocator to clone the AST into, depending on the order that &strs were allocated into arena in parser vs the order they get allocated during cloning. The order allocations are made in affects the amount of padding bytes required.

§Examples
use oxc_allocator::{Allocator, Vec};

let capacity = 64 * 1024; // 64 KiB
let mut allocator = Allocator::with_capacity(capacity);

allocator.alloc(1u8); // 1 byte with alignment 1
allocator.alloc(2u8); // 1 byte with alignment 1
allocator.alloc(3u64); // 8 bytes with alignment 8

// Only 10 bytes were allocated, but 16 bytes were used, in order to align `3u64` on 8
assert_eq!(allocator.used_bytes(), 16);

allocator.reset();

let mut vec = Vec::<u64>::with_capacity_in(2, &allocator);

// Allocate something else, so `vec`'s allocation is not the most recent
allocator.alloc(123u64);

// `vec` has to grow beyond it's initial capacity
vec.extend([1, 2, 3, 4]);

// `vec` takes up 32 bytes, and `123u64` takes up 8 bytes = 40 total.
// But there's an additional 16 bytes consumed for `vec`'s original capacity of 2,
// which is still using up space
assert_eq!(allocator.used_bytes(), 56);

Trait Implementations§

Source§

impl Allocator for &Allocator

Source§

fn allocate(&self, layout: Layout) -> Result<NonNull<[u8]>, AllocError>

Attempts to allocate a block of memory. Read more
Source§

unsafe fn deallocate(&self, ptr: NonNull<u8>, layout: Layout)

Deallocates the memory referenced by ptr. Read more
Source§

unsafe fn shrink( &self, ptr: NonNull<u8>, old_layout: Layout, new_layout: Layout, ) -> Result<NonNull<[u8]>, AllocError>

Attempts to shrink the memory block. Read more
Source§

unsafe fn grow( &self, ptr: NonNull<u8>, old_layout: Layout, new_layout: Layout, ) -> Result<NonNull<[u8]>, AllocError>

Attempts to extend the memory block. Read more
Source§

unsafe fn grow_zeroed( &self, ptr: NonNull<u8>, old_layout: Layout, new_layout: Layout, ) -> Result<NonNull<[u8]>, AllocError>

Behaves like grow, but also ensures that the new contents are set to zero before being returned. Read more
Source§

fn allocate_zeroed(&self, layout: Layout) -> Result<NonNull<[u8]>, AllocError>

Behaves like allocate, but also ensures that the returned memory is zero-initialized. Read more
Source§

fn by_ref(&self) -> &Self
where Self: Sized,

Creates a “by reference” adapter for this instance of Allocator. Read more
Source§

impl Default for Allocator

Source§

fn default() -> Allocator

Returns the “default value” for a type. Read more
Source§

impl Send for Allocator

SAFETY: Not actually safe, but for enabling Send for downstream crates.

Source§

impl Sync for Allocator

SAFETY: Not actually safe, but for enabling Sync for downstream crates.

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<'a, T> FromIn<'a, T> for T

Source§

fn from_in(t: T, _: &'a Allocator) -> T

Converts to this type from the input type within the given allocator.
Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<'a, T, U> IntoIn<'a, U> for T
where U: FromIn<'a, T>,

Source§

fn into_in(self, allocator: &'a Allocator) -> U

Converts this type into the (usually inferred) input type within the given allocator.
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.