Skip to main content

InternedStringMap

Struct InternedStringMap 

Source
pub struct InternedStringMap { /* private fields */ }
Expand description

The short-string intern table, shaped like C-Lua’s stringtable (lstring.c): power-of-two hash buckets of GcRef<LuaString> chained by Vec instead of u.hnext. Compared to the previous HashMap<Box<[u8]>, GcRef<LuaString>>:

  • lookup hashes the input bytes ONCE and never allocates (the entry-API shape boxed the key on every call — rejected experiment intern-hitpath-borrowed-lookup documents why a partial fix loses);
  • insert reuses the same hash, no second probe;
  • bytes are stored once (in the LuaString), not duplicated in a map key;
  • dead strings are removed O(dead) by (hash, identity) pairs collected during the GC mark phase, replacing the O(table·log live) sort + binary-search retain that dominated churn-heavy profiles (concat_chain 20260609T2201Z: intern machinery ~25% of wall).

Implementations§

Source§

impl InternedStringMap

Source

pub fn len(&self) -> usize

Source

pub fn is_empty(&self) -> bool

Source

pub fn bucket_count(&self) -> usize

Number of hash buckets currently allocated (the power-of-two table size, C’s strt.size). Exposed for the shrink-policy test, which asserts the array grows under a flood and shrinks back toward 64 once the interned strings are collected.

Source

pub fn find(&self, bytes: &[u8], hash: u32) -> Option<GcRef<LuaString>>

Source

pub fn insert(&mut self, s: GcRef<LuaString>)

C’s luaS_resize growth rule: keep average chain length ~1.

Source

pub fn shrink_if_sparse(&mut self)

C’s luaS_resize shrink path, driven by lgc.c:checkSizes (if (g->strt.nuse < g->strt.size / 4) luaS_resize(L, size/2)). Shrinks the bucket array when the live load factor falls below 25% (count * 4 < buckets.len()), down to next_power_of_two(count) floored at the initial 64 (C’s MINSTRTABSIZE). The 4× gap is hysteresis: a table at load factor just under 1.0 will not thrash, since the next grow-then-shrink cycle needs the population to drop fourfold first. Rehashing only relocates the surviving GcRef<LuaString> entries by their cached hash(); it never derefs a string, so it is safe to call from the post-mark/sweep GC hook AFTER all dead entries have been removed (only live refs remain).

Source

pub fn remove(&mut self, hash: u32, identity: usize)

O(dead): removes one entry located by its cached hash + GC identity.

Source

pub fn iter(&self) -> impl Iterator<Item = &GcRef<LuaString>>

Source

pub fn contains_key(&self, bytes: &[u8]) -> bool

Trait Implementations§

Source§

impl Default for InternedStringMap

Source§

fn default() -> Self

Returns the “default value” for a type. Read more

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.