Struct Tensor

Source

pub struct Tensor { /* private fields */ }

Expand description

High-performance multi-dimensional tensor with automatic differentiation support

The core data structure for machine learning operations, designed for maximum performance with zero-cost abstractions. Supports arbitrary dimensionality, SIMD optimization, gradient tracking, device placement, and natural mathematical expressions through operator overloading.

§Key Features

Raw Pointer Storage: Zero-overhead memory access for maximum performance
SIMD Optimization: AVX2 alignment and vectorized operations
Memory Efficiency: Optimized alignment strategies for different tensor sizes
gradtrack Integration: Built-in gradient tracking and computation
Device Support: CPU and future CUDA device placement
View Tensors: Zero-copy tensor views with shared memory management
Thread Safety: Send + Sync implementation for concurrent usage
Operator Overloading: Natural mathematical expressions (+, -, *, /, +=, -=, *=, /=)

§Memory Layout

Tensors use row-major memory layout with size-dependent alignment:

Small tensors (≤8 elements): 16-byte SSE alignment
Medium tensors (8-1024 elements): 32-byte AVX2 alignment
Large tensors (>1024 elements): 64-byte cache-line alignment

§Performance Characteristics

Memory Overhead: ~64 bytes per tensor (excluding data)
SIMD Ready: Properly aligned for vectorized operations
Cache Friendly: Optimized memory layout for CPU cache hierarchies
Zero-Cost Views: View tensors share memory without copying
Thread Safe: Atomic ID generation and lock-free operations
Operator Performance: Zero-cost operator overloading for mathematical expressions

§Safety

This struct uses unsafe code for performance. The following invariants must be maintained:

data must be valid for shape.size elements
data must be properly aligned for f32
data must not be aliased while the tensor exists
shape.size must match the actual allocated memory
allocation_owner must be valid if present

§Examples

§Basic Tensor Operations

use train_station::Tensor;

// Create tensors with different configurations
let tensor = Tensor::new(vec![2, 3]);
let tensor_with_grad = Tensor::ones(vec![10, 10]).with_requires_grad();

// Access tensor properties
assert_eq!(tensor.size(), 6);
assert_eq!(tensor.shape().dims(), vec![2, 3]);
assert!(tensor.is_contiguous());

§Operator Overloading

use train_station::Tensor;

// Create tensors for operations
let a = Tensor::from_slice(&[1.0, 2.0, 3.0, 4.0], vec![2, 2]).unwrap();
let b = Tensor::from_slice(&[5.0, 6.0, 7.0, 8.0], vec![2, 2]).unwrap();

// Tensor operations with operators
let result = a.clone() + b.clone();                    // Tensor addition
let result = a.clone() * b.clone();                    // Element-wise multiplication
let result = a.clone() - b.clone();                    // Tensor subtraction
let result = a.clone() / b.clone();                    // Element-wise division

// Scalar operations
let result = a.clone() + 5.0;                          // Tensor + scalar
let result = 5.0 + a.clone();                          // Scalar + tensor
let result = a.clone() * 3.0;                          // Tensor * scalar
let result = 3.0 * a.clone();                          // Scalar * tensor

// Compound expressions
let result = (a.clone() + b.clone()) * 2.0 - 1.0;      // Complex mathematical expressions

// Assignment operators
let mut c = a.clone();
c += b.clone();                                        // In-place addition
c *= 2.0;                                              // In-place scalar multiplication

// Negation
let result = -a;                                       // Negate all elements

§Thread Safety

This type is Send + Sync and can be safely shared between threads. All operations are thread-safe through atomic ID generation and thread-local gradtrack storage.

Struct Tensor Copy item path

§Key Features

§Memory Layout

§Performance Characteristics

§Safety

§Examples

§Basic Tensor Operations

§Operator Overloading

§Thread Safety

Implementations§

impl Tensor

pub fn capacity_elems(&self) -> usize

pub fn new(shape_dims: Vec<usize>) -> Self

§Arguments

§Returns

§Performance

§Safety

§Examples

pub fn shape(&self) -> &Shape

§Returns

§Performance

§Examples

pub fn size(&self) -> usize

§Returns

§Performance

§Examples

pub fn device(&self) -> Device

§Returns

§Performance

§Examples

pub fn new_on_device(shape_dims: Vec<usize>, device: Device) -> Self

§Arguments

§Returns

§Performance

§Panics

§Examples

§Arguments

§Returns

§Panics

§Performance

§Examples

pub fn with_requires_grad(self) -> Self

§Returns

§Performance

§Examples

pub fn set_requires_grad(&mut self, requires_grad: bool)

§Arguments

§Performance

§Examples

pub fn retain_grad(self) -> Self

pub fn retain_grad_(&mut self, enable: bool)

pub fn requires_grad(&self) -> bool

§Returns

§Examples

pub fn grad(&self) -> Option<&Tensor>

§Returns

§Examples

pub fn materialize_grad(&mut self) -> bool

pub fn grad_or_fetch(&mut self) -> Option<&Tensor>

pub fn grad_owned(&self) -> Option<Tensor>

§Returns

§Examples

pub fn id(&self) -> usize

§Returns

§Examples

pub fn detach(&self) -> Self

§Returns

§Examples

pub fn detach_(&mut self)

§Examples

pub fn backward(&mut self, grad_output: Option<Tensor>)

§Arguments

§Examples

pub unsafe fn as_ptr(&self) -> *const f32

§Safety

pub unsafe fn as_mut_ptr(&mut self) -> *mut f32

§Safety

pub fn grad_fn(&self) -> &GradFn

§Returns

§Implementation Details

Struct Tensor