Expand description
§mem_dbg
Traits and associated procedural macros to inspect recursively the memory usage and layout of a value.
The trait MemSize can be used to compute the overall memory usage of a value
in bytes; the standard library function std::mem::size_of returns the
stack size of a type in bytes, but it does not take into consideration heap
memory. We provide implementations for most basic types, a derive macro for
structs and enums whose fields implement MemSize, and support for a few other
crates via optional features.
The trait MemDbg, which depends on MemSize, can be used to display the
recursive layout of a value, together with the size of each part and the
associated padding bytes.
§Why MemSize
Other traits partially provide the functionality of MemSize, but either they
require implementing manually a trait, which is prone to error, or they do not
provide the flexibility necessary for MemDbg. Most importantly, MemSize
uses the type system to avoid iterating over the content of a container (a
vector, etc.) when it is not necessary, making it possible to compute instantly
the size of values occupying hundreds of gigabytes of heap memory.
This is the result of the benchmark btree_set_comp contained in the examples
directory. It builds a B-tree set with a hundred million usize entries and
then measures its heap size:
Allocated: 3428571500
get_size: 1600000024 349387500 ns
deep_size_of: 1800000024 284149583 ns
mem_size: 3416666554 41 nsThe first line is the number of bytes allocated by the program as returned by
cap. Then, we display the result of get-size, deepsize, and our own
MemSize. Note that the first two crates are just measuring the space used by
the items, and not by the data structure (i.e., they are not taking into account
the load factor and the power-of-two size constraint of the hash map). Moreover,
all other crates are about six orders of magnitude slower than our
implementation, due to the necessity to iterate over all elements.
In general, while the size estimation of BTreeSet, BTreeMap, HashSet,
and HashMap is heuristic in all libraries, mem_dbg is significantly more
precise, and takes into account the load factor and the power-of-two size
constraint of the hash table.
The following table compares the MemSize trait from this crate against the
crates deepsize and get-size. The true memory usage (0% error) is
again calculated using the allocator from the cap crate.
| Type | Container | Crate | Error (%) | Type | Container | Crate | Error (%) |
|---|---|---|---|---|---|---|---|
| usize | BTreeMap | deep_size_of | 36.51 ± 20.88 | String | BTreeMap | deep_size_of | 21.53 ± 8.88 |
get_size | 42.01 ± 21.73 | get_size | 16.04 ± 7.74 | ||||
mem_size | 1.25 ± 2.59 | mem_size | 3.17 ± 1.45 | ||||
| BTreeSet | deep_size_of | 28.18 ± 12.41 | BTreeSet | deep_size_of | 18.58 ± 8.51 | ||
get_size | 46.91 ± 23.70 | get_size | 17.54 ± 8.38 | ||||
mem_size | 1.57 ± 2.97 | mem_size | 3.73 ± 2.65 | ||||
| HashMap | deep_size_of | 15.64 ± 6.34 | HashMap | deep_size_of | 6.12 ± 3.42 | ||
get_size | 15.64 ± 6.34 | get_size | 6.12 ± 3.42 | ||||
mem_size | 0.00 ± 0.00 | mem_size | 0.62 ± 1.30 | ||||
| HashSet | deep_size_of | 19.57 ± 7.91 | HashSet | deep_size_of | 6.91 ± 3.82 | ||
get_size | 19.57 ± 7.91 | get_size | 6.91 ± 3.82 | ||||
mem_size | 0.00 ± 0.00 | mem_size | 1.13 ± 2.32 |
§References
Two flags, SizeFlags::FOLLOW_REFS and DbgFlags::FOLLOW_REFS, make it
possible to follow references when computing the size or displaying the layout
of a value. Analogously, SizeFlags::FOLLOW_RC and DbgFlags::FOLLOW_RC
make it possible to follow Rc/Arc smart pointers.
§Padding
The trait MemDbg is useful to display the layout of a value and understand
how much memory is used by each part. In particular, it exploits the new stable
macro std::mem::offset_of to display the padding of each field in square
brackets; moreover, the flag DbgFlags::RUST_LAYOUT makes it possible to
display structures in the layout used by the Rust compiler, rather than
that given by declaration order.
These features are also available for enums using the feature offset_of_enum,
which however needs the nightly compiler, as it enables the unstable feature
offset_of_enum.
§Features
std: enables the use of the standard library; this is enabled by default.derive: enables the derive macrosMemSizeandMemDbg; this is enabled by default.offset_of_enum: support for padding and for theDbgFlags::RUST_LAYOUTflag for enums. Requires the nightly compiler as it enables the unstable featureoffset_of_enum. Callingmem_dbgwith the flagDbgFlags::RUST_LAYOUTwithout this feature enabled will result in a panic.half: support for thehalfcrate.maligned: support for themalignedcrate.mmap-rs: support for themmap-rscrate.rand: support for therandcrate.
§Examples
This is an example program using MemSize and MemDbg. Note that we cannot
visualize the effect of the useful DbgFlags::COLOR flag, which colorizes
sizes depending on their magnitude.
use mem_dbg::*;
#[derive(MemSize, MemDbg)]
struct Struct<A, B> {
a: A,
b: B,
test: isize,
}
#[derive(MemSize, MemDbg)]
struct Data<A> {
a: A,
b: Vec<i32>,
c: (u8, String),
}
#[derive(MemSize, MemDbg)]
union SingletonUnion<A: Copy> {
a: A
}
#[derive(MemSize, MemDbg)]
enum TestEnum {
Unit,
Unit2(),
Unit3 {},
Union(SingletonUnion<u8>),
Unnamed(usize, u8),
Named { first: usize, second: u8 },
}
let b = Vec::with_capacity(100);
let s = Struct {
a: TestEnum::Unnamed(0, 16),
b: Data {
a: vec![0x42_u8; 700],
b,
c: (1, "foo".to_owned()),
},
test: -0xbadf00d,
};
println!("size: {}", s.mem_size(SizeFlags::default()));
println!("capacity: {}", s.mem_size(SizeFlags::CAPACITY));
println!();
s.mem_dbg(DbgFlags::empty())?;
println!();
println!("size: {}", s.mem_size(SizeFlags::default()));
println!("capacity: {}", s.mem_size(SizeFlags::CAPACITY));
println!();
s.mem_dbg(DbgFlags::default() | DbgFlags::CAPACITY | DbgFlags::HUMANIZE)?;
#[cfg(feature = "offset_of_enum")]
{
println!();
println!("size: {}", s.mem_size(SizeFlags::default()));
println!("capacity: {}", s.mem_size(SizeFlags::CAPACITY));
println!();
s.mem_dbg(DbgFlags::empty() | DbgFlags::RUST_LAYOUT)?;
}The previous program prints:
size: 807
capacity: 1207
807 B ⏺
16 B ├╴a
│ ├╴Variant: Unnamed
8 B │ ├╴0
1 B │ ╰╴1
783 B ├╴b
724 B │ ├╴a
24 B │ ├╴b
35 B │ ╰╴c
1 B │ ├╴0 [7B]
27 B │ ╰╴1
8 B ╰╴test
size: 807
capacity: 1207
1.207 kB 100.00% ⏺: readme::main::Struct<readme::main::TestEnum, readme::main::Data<alloc::vec::Vec<u8>>>
16 B 1.33% ├╴a: readme::main::TestEnum
│ ├╴Variant: Unnamed
8 B 0.66% │ ├╴0: usize
1 B 0.08% │ ╰╴1: u8
1.183 kB 98.01% ├╴b: readme::main::Data<alloc::vec::Vec<u8>>
724 B 59.98% │ ├╴a: alloc::vec::Vec<u8>
424 B 35.13% │ ├╴b: alloc::vec::Vec<i32>
35 B 2.90% │ ╰╴c: (u8, alloc::string::String)
1 B 0.08% │ ├╴0: u8 [7B]
27 B 2.24% │ ╰╴1: alloc::string::String
8 B 0.66% ╰╴test: isizeIf run with the feature offset_of_enum, it prints:
size: 807
capacity: 1207
807 B ⏺
16 B ├╴a
│ ├╴Variant: Unnamed
8 B │ ├╴0
1 B │ ╰╴1 [6B]
783 B ├╴b
724 B │ ├╴a
24 B │ ├╴b
35 B │ ╰╴c
1 B │ ├╴0 [7B]
27 B │ ╰╴1
8 B ╰╴test
size: 807
capacity: 1207
1.207 kB 100.00% ⏺: readme::main::Struct<readme::main::TestEnum, readme::main::Data<alloc::vec::Vec<u8>>>
16 B 1.33% ├╴a: readme::main::TestEnum
│ ├╴Variant: Unnamed
8 B 0.66% │ ├╴0: usize
1 B 0.08% │ ╰╴1: u8 [6B]
1.183 kB 98.01% ├╴b: readme::main::Data<alloc::vec::Vec<u8>>
724 B 59.98% │ ├╴a: alloc::vec::Vec<u8>
424 B 35.13% │ ├╴b: alloc::vec::Vec<i32>
35 B 2.90% │ ╰╴c: (u8, alloc::string::String)
1 B 0.08% │ ├╴0: u8 [7B]
27 B 2.24% │ ╰╴1: alloc::string::String
8 B 0.66% ╰╴test: isize
size: 807
capacity: 1207
807 B ⏺
783 B ├╴b
724 B │ ├╴a
24 B │ ├╴b
35 B │ ╰╴c
1 B │ ├╴0 [7B]
27 B │ ╰╴1
16 B ├╴a
│ ├╴Variant: Unnamed
1 B │ ├╴1 [6B]
8 B │ ╰╴0
8 B ╰╴test§Caveats
-
We support out-of-the-box most basic types, and tuples up to size ten. The derive macros
MemSize/MemDbgwill generate implementations for structs and enums whose fields implement the associated interface: if this is not the case (e.g., because of the orphan rule) one can implement the traits manually. -
If you invoke the methods of this crate on a shared reference, the compiler will automatically dereference it, and the method will be invoked on the referenced type:
use mem_dbg::*;
let mut x: [i32; 4] = [0, 0, 0, 0];
assert_eq!(
(&x).mem_size(SizeFlags::default()),
std::mem::size_of::<[i32; 4]>()
);
assert_eq!(
(&mut x).mem_size(SizeFlags::default()),
std::mem::size_of::<&mut [i32; 4]>()
);
assert_eq!(
<&[i32; 4] as MemSize>::mem_size(&&x, SizeFlags::default()),
std::mem::size_of::<&[i32; 4]>()
);-
Computation of the size of arrays, slices, and vectors will be performed by iterating over their elements unless the type is a copy type that does not contain non-
'staticreferences and it is declared as such using the attribute#[copy_type]. SeeCopyTypefor more details. -
The content of vectors and slices is not expanded recursively as the output might be too complex; this might change in the future (e.g., via a flag) should interesting use cases arise.
-
Regarding
unions, we only support completely the special case of the single fieldunion, for which we implement both the derive macrosMemSize/MemDbg. For the more complex cases of unions with multiple fields, we only provide theMemSizederive macro with partial support, excluding support for theSizeFlags::FOLLOW_REFSflag. If full support for derive macrosMemSize/MemDbgin the case of an union with multiple fields, one can implement the traits manually.
Structs§
- DbgFlags
- Flags for
MemDbg. - False
- One of the two possible implementations of
Boolean. - Size
Flags - Flags for
MemSize. - True
- One of the two possible implementations of
Boolean.
Traits§
- Boolean
- Internal trait used within
CopyTypeto implementMemSizedepending on whether a type isCopyor not. - Copy
Type - Marker trait for copy types.
- MemDbg
- A trait providing methods to display recursively the content and size of a structure.
- MemDbg
Impl - Inner trait used to implement
MemDbg. - MemSize
- A trait to compute recursively the overall size or capacity of a structure,
as opposed to the stack size returned by
core::mem::size_of().
Functions§
- color
- Returns the color code corresponding to the size.
- humanize_
float - Given a size in bytes, returns it in a human readable format using SI suffixes.
- n_
of_ digits - Returns the number of digits of a number.
- reset_
color - Returns the color code to reset the color.
- type_
color - Returns the color used to print types.