Struct Tensor

Source

pub struct Tensor<B: Backend> { /* private fields */ }

Expand description

An n-dimensional array of numbers on a specific backend.

Tensors are the fundamental data type in Shrew. All neural network operations accept and return tensors.

§Type Parameter

B: Backend — the compute backend (e.g., CpuBackend, CudaBackend)

§Example

use shrew_core::Tensor;
use shrew_cpu::CpuBackend;

let a = Tensor::<CpuBackend>::from_slice(&[1.0, 2.0, 3.0, 4.0], (2, 2))?;
let b = Tensor::<CpuBackend>::ones((2, 2), DType::F32, &CpuDevice)?;
let c = a.add(&b)?;

Implementations§

Source §

impl<B: Backend> Tensor

Source

pub fn id(&self) -> TensorId

Unique tensor ID.

Source

pub fn shape(&self) -> &Shape

The shape of this tensor.

Source

pub fn dims(&self) -> &[usize]

The dimensions as a slice (shortcut for shape().dims()).

Source

pub fn rank(&self) -> usize

Number of dimensions (rank).

Source

pub fn elem_count(&self) -> usize

Total number of elements.

Source

pub fn dtype(&self) -> DType

Data type of the elements.

Source

pub fn device(&self) -> &B::Device

The device this tensor is on.

Source

pub fn layout(&self) -> &Layout

The memory layout (shape + strides + offset).

Source

pub fn is_contiguous(&self) -> bool

Whether this tensor is contiguous in memory.

Source

pub fn is_variable(&self) -> bool

Whether this tensor tracks gradients.

Source

pub fn storage(&self) -> RwLockReadGuard<'_, B::Storage>

Access the underlying storage (read lock).

Source

pub fn op(&self) -> &Op

The op that created this tensor.

Source

pub fn update_data_inplace(&self, new_data: &[f64]) -> Result<()>

Update the underlying storage data in place.

This writes new_data directly into the existing Arc<RwLock<Storage>>, so any other tensor sharing this storage (e.g., a clone held by a Module) will also see the updated values.

This is the mechanism that makes optimizer parameter updates visible to model layers without needing to re-assign parameters.

§Safety (logical)

The new data must have the same number of elements and dtype as the current storage. The shape is not changed.

Source

pub fn zeros( shape: impl Into<Shape>, dtype: DType, device: &B::Device, ) -> Result<Self>

Create a tensor filled with zeros.

Source

pub fn ones( shape: impl Into<Shape>, dtype: DType, device: &B::Device, ) -> Result<Self>

Create a tensor filled with ones.

Source

pub fn full( shape: impl Into<Shape>, val: f64, dtype: DType, device: &B::Device, ) -> Result<Self>

Create a tensor filled with a constant value.

Source

pub fn from_f64_slice( data: &[f64], shape: impl Into<Shape>, dtype: DType, device: &B::Device, ) -> Result<Self>

Create a tensor from a flat slice of f64 values. The data is converted to the specified dtype.

Source

pub fn rand( shape: impl Into<Shape>, dtype: DType, device: &B::Device, ) -> Result<Self>

Create a tensor with random uniform values in [0, 1).

Source

pub fn randn( shape: impl Into<Shape>, dtype: DType, device: &B::Device, ) -> Result<Self>

Create a tensor with random normal values (mean=0, std=1).

Source

pub fn linspace( start: f64, end: f64, steps: usize, dtype: DType, device: &B::Device, ) -> Result<Self>

Create a 1-D tensor with steps evenly spaced values from start to end (inclusive).

let t = Tensor::linspace(0.0, 1.0, 5, DType::F64, &dev)?;
// => [0.0, 0.25, 0.5, 0.75, 1.0]

Source

pub fn eye(n: usize, dtype: DType, device: &B::Device) -> Result<Self>

Create an identity matrix of size n × n.

let I = Tensor::eye(3, DType::F64, &dev)?;
// [[1, 0, 0],
//  [0, 1, 0],
//  [0, 0, 1]]

Source

pub fn zeros_like(other: &Self) -> Result<Self>

Create a tensor of zeros with the same shape, dtype, and device as other.

Source

pub fn ones_like(other: &Self) -> Result<Self>

Create a tensor of ones with the same shape, dtype, and device as other.

Source

pub fn full_like(other: &Self, val: f64) -> Result<Self>

Create a tensor filled with val, with the same shape, dtype, and device as other.

Source

pub fn set_variable(self) -> Self

Mark this tensor as a variable (trainable parameter). Variables accumulate gradients during backward().

Source

pub fn transpose(&self, dim0: usize, dim1: usize) -> Result<Self>

Transpose two dimensions (no data copy).

Source

pub fn t(&self) -> Result<Self>

Transpose a 2D matrix (shorthand for transpose(0, 1)).

Source

pub fn narrow(&self, dim: usize, start: usize, len: usize) -> Result<Self>

Narrow (slice) along a dimension.

Source

pub fn reshape(&self, new_shape: impl Into<Shape>) -> Result<Self>

Reshape to a new shape. The new shape must have the same total elements. If the tensor is not contiguous, it will be made contiguous first.

Source

pub fn contiguous(&self) -> Result<Self>

Ensure the tensor is contiguous in memory. If already contiguous, returns a clone (cheap Arc copy). Otherwise, copies the data into a new contiguous storage.

Source

pub fn unsqueeze(&self, dim: usize) -> Result<Self>

Add a dimension of size 1 at the given position. unsqueeze(0) on [3, 4] → [1, 3, 4] unsqueeze(2) on [3, 4] → [3, 4, 1]

Source

pub fn squeeze_all(&self) -> Self

Remove dimensions of size 1. squeeze on [1, 3, 1, 4] → [3, 4]

Source

pub fn squeeze(&self, dim: usize) -> Result<Self>

Remove a specific dimension of size 1.

squeeze(1) on [3, 1, 4] → [3, 4]

Returns an error if the specified dimension is not size 1.

Source

pub fn permute(&self, dims: &[usize]) -> Result<Self>

Permute the dimensions of this tensor.

permute(&[2, 0, 1]) on [A, B, C] → [C, A, B]

This is a generalization of transpose to arbitrary dimension orderings. No data copy — just changes strides.

Source

pub fn cumsum(&self, dim: usize) -> Result<Self>

Cumulative sum along dimension dim.

// [1, 2, 3] → [1, 3, 6]
let y = x.cumsum(0)?;

Source

pub fn sort(&self, dim: usize, descending: bool) -> Result<(Self, Self)>

Sort along a dimension. Returns (sorted_values, sorted_indices).

let (vals, idxs) = x.sort(0, false)?; // ascending along dim 0

Source

pub fn argsort(&self, dim: usize, descending: bool) -> Result<Self>

Argsort: returns indices that would sort the tensor along dim.

let indices = x.argsort(0, false)?; // ascending

Source

pub fn add(&self, rhs: &Self) -> Result<Self>

Element-wise addition: self + rhs.

Source

pub fn sub(&self, rhs: &Self) -> Result<Self>

Element-wise subtraction: self - rhs.

Source

pub fn mul(&self, rhs: &Self) -> Result<Self>

Element-wise multiplication: self * rhs.

Source

pub fn div(&self, rhs: &Self) -> Result<Self>

Element-wise division: self / rhs.

Source

pub fn eq(&self, rhs: &Self) -> Result<Self>

Element-wise equal: self == rhs. Returns a U8 tensor (0 or 1).

Source

pub fn ne(&self, rhs: &Self) -> Result<Self>

Element-wise not-equal: self != rhs. Returns a U8 tensor (0 or 1).

Source

pub fn gt(&self, rhs: &Self) -> Result<Self>

Element-wise greater-than: self > rhs. Returns a U8 tensor (0 or 1).

Source

pub fn ge(&self, rhs: &Self) -> Result<Self>

Element-wise greater-or-equal: self >= rhs. Returns a U8 tensor (0 or 1).

Source

pub fn lt(&self, rhs: &Self) -> Result<Self>

Element-wise less-than: self < rhs. Returns a U8 tensor (0 or 1).

Source

pub fn le(&self, rhs: &Self) -> Result<Self>

Element-wise less-or-equal: self <= rhs. Returns a U8 tensor (0 or 1).

Source

pub fn neg(&self) -> Result<Self>

Element-wise negation: -self.

Source

pub fn abs(&self) -> Result<Self>

Element-wise absolute value.

Source

pub fn exp(&self) -> Result<Self>

Element-wise exponential: e^x.

Source

pub fn log(&self) -> Result<Self>

Element-wise natural logarithm.

Source

pub fn sqrt(&self) -> Result<Self>

Element-wise square root.

Source

pub fn square(&self) -> Result<Self>

Element-wise square: x².

Source

pub fn relu(&self) -> Result<Self>

ReLU activation: max(0, x).

Source

pub fn sigmoid(&self) -> Result<Self>

Sigmoid activation: 1 / (1 + e^(-x)).

Source

pub fn tanh(&self) -> Result<Self>

Tanh activation.

Source

pub fn gelu(&self) -> Result<Self>

GELU activation (Gaussian Error Linear Unit).

Source

pub fn silu(&self) -> Result<Self>

SiLU / Swish activation: x * sigmoid(x).

Source

pub fn sin(&self) -> Result<Self>

Element-wise sine.

Source

pub fn cos(&self) -> Result<Self>

Element-wise cosine.

Source

pub fn floor(&self) -> Result<Self>

Element-wise floor: largest integer ≤ x.

Source

pub fn ceil(&self) -> Result<Self>

Element-wise ceiling: smallest integer ≥ x.

Source

pub fn round(&self) -> Result<Self>

Element-wise round to nearest integer.

Source

pub fn powf(&self, exponent: f64) -> Result<Self>

Element-wise power: self^exponent.

Source

pub fn clamp(&self, min: f64, max: f64) -> Result<Self>

Element-wise clamp to [min, max].

Source

pub fn where_cond(mask: &Self, on_true: &Self, on_false: &Self) -> Result<Self>

Conditional select: result[i] = if mask[i] != 0 { on_true[i] } else { on_false[i] }.

mask is typically a U8 tensor from comparison ops. on_true and on_false must have the same shape and dtype.

Source

pub fn gather(&self, dim: usize, index: &Self) -> Result<Self>

Gather elements along dim using an index tensor.

output[i][j][k] = input[index[i][j][k]][j][k] (when dim=0)

The index tensor must have the same number of dimensions as self. The output has the same shape as the index tensor.

Source

pub fn masked_fill(&self, mask: &Self, value: f64) -> Result<Self>

Fill elements where mask != 0 with value, keeping other elements.

result[i] = if mask[i] != 0 { value } else { self[i] }

This is implemented via where_cond so autograd is automatic.

Source

pub fn pad(&self, padding: &[[usize; 2]], value: f64) -> Result<Self>

Pad the last N dimensions with constant value.

padding is a list of [before, after] pairs, one per dimension, applied to the last dimensions of the tensor.

Example: pad(&[[1, 1], [2, 2]], 0.0) pads the last 2 dims.

Source

pub fn topk(&self, k: usize, dim: usize) -> Result<(Self, Vec<usize>)>

Return the k largest elements along dim.

Returns (values, indices) where both have the same shape as self except dimension dim has size k.

Non-differentiable (returns detached values).

Source

pub fn sum_all(&self) -> Result<Self>

Sum all elements, returning a scalar tensor.

Source

pub fn sum(&self, dim: usize, keep_dim: bool) -> Result<Self>

Sum along a specific dimension.

Source

pub fn mean_all(&self) -> Result<Self>

Mean of all elements, returning a scalar tensor.

Source

pub fn mean(&self, dim: usize, keep_dim: bool) -> Result<Self>

Mean along a specific dimension.

Source

pub fn max(&self, dim: usize, keep_dim: bool) -> Result<Self>

Max along a specific dimension.

Source

pub fn min(&self, dim: usize, keep_dim: bool) -> Result<Self>

Min along a specific dimension.

Source

pub fn argmax(&self, dim: usize, keep_dim: bool) -> Result<Self>

ArgMax along a specific dimension (returns i64 indices).

Source

pub fn argmin(&self, dim: usize, keep_dim: bool) -> Result<Self>

ArgMin along a specific dimension (returns i64 indices).

Source

pub fn softmax(&self, dim: usize) -> Result<Self>

Softmax along a dimension: softmax(x)_i = exp(x_i) / sum(exp(x_j))

Uses the numerically stable trick: subtract max before exp. This is built from existing differentiable ops (exp, sum, div, sub) so gradients flow through automatically.

Source

pub fn log_softmax(&self, dim: usize) -> Result<Self>

Log-softmax along a dimension: log(softmax(x)) but numerically stable.

log_softmax(x)_i = x_i - max(x) - log(sum(exp(x - max(x))))

Source

pub fn var(&self, dim: usize, keep_dim: bool) -> Result<Self>

Variance along a dimension: var(x) = mean((x - mean(x))²)

Source

pub fn cat(tensors: &[Self], dim: usize) -> Result<Self>

Concatenate tensors along a dimension.

All tensors must have the same shape except in the concatenation dimension. This creates a new tensor by copying data from all inputs.

Source

pub fn chunk(&self, n: usize, dim: usize) -> Result<Vec<Self>>

Split a tensor into n equal chunks along a dimension. If the dimension size is not evenly divisible, the last chunk is smaller.

Source

pub fn expand(&self, target_shape: impl Into<Shape>) -> Result<Self>

Expand a tensor to a larger shape by repeating data along size-1 dims. Only dims that are currently size 1 can be expanded. A size of -1 (usize::MAX) means don’t change that dim.

Source

pub fn stack(tensors: &[Self], dim: usize) -> Result<Self>

Stack tensors along a new dimension.

All tensors must have the same shape. Inserts a new dimension at dim. stack([a, b], dim=0) where a,b are shape [2,3] → [2, 2, 3].

Source

pub fn arange(n: usize, dtype: DType, device: &B::Device) -> Result<Self>

Create a 1-D tensor with values [0, 1, …, n-1].

Source

pub fn arange_step( start: f64, end: f64, step: f64, dtype: DType, device: &B::Device, ) -> Result<Self>

Create a 1-D tensor with values [start, start+step, …, <end).

Source

pub fn triu( n: usize, m: usize, diagonal: i64, dtype: DType, device: &B::Device, ) -> Result<Self>

Upper triangular mask: returns a 2-D tensor of shape [n, m] where elements on and above the diagonal-th diagonal are 1.0, rest 0.0.

diagonal = 0 → main diagonal. diagonal > 0 → above. diagonal < 0 → below.

Source

pub fn tril( n: usize, m: usize, diagonal: i64, dtype: DType, device: &B::Device, ) -> Result<Self>

Lower triangular mask: returns a 2-D tensor of shape [n, m] where elements on and below the diagonal-th diagonal are 1.0, rest 0.0.

Source

pub fn matmul(&self, rhs: &Self) -> Result<Self>

Matrix multiplication: self @ rhs.

[m, k] @ [k, n] → [m, n]
Batched: [b, m, k] @ [b, k, n] → [b, m, n]

Source

pub fn conv2d( &self, weight: &Self, bias: Option<&Self>, stride: [usize; 2], padding: [usize; 2], ) -> Result<Self>

2D convolution: applies convolution filters to a 4D input tensor.

self (input): [N, C_in, H, W]
weight: [C_out, C_in, kH, kW]
bias: optional [C_out]
stride: [sH, sW]
padding: [pH, pW]

Returns tensor of shape [N, C_out, H_out, W_out] where H_out = (H + 2*pH - kH) / sH + 1.

Source

pub fn max_pool2d( &self, kernel_size: [usize; 2], stride: [usize; 2], padding: [usize; 2], ) -> Result<Self>

2D max pooling on a 4D input tensor [N, C, H, W].

Returns (output, indices) where indices stores argmax positions (flat indices into the input) for backward.

Source

pub fn avg_pool2d( &self, kernel_size: [usize; 2], stride: [usize; 2], padding: [usize; 2], ) -> Result<Self>

Apply 2D average pooling to a 4D tensor [N, C, H, W].

Source

pub fn conv1d( &self, weight: &Self, bias: Option<&Self>, stride: usize, padding: usize, ) -> Result<Self>

Apply 1D convolution to a 3D tensor [N, C_in, L]. weight: [C_out, C_in, K]

Source

pub fn affine(&self, mul: f64, add: f64) -> Result<Self>

Affine transform: result[i] = self[i] * mul + add. Useful for normalization and scaling.

Source

pub fn to_f64_vec(&self) -> Result<Vec<f64>>

Extract all elements as a flat Vec.

Source

pub fn to_scalar_f64(&self) -> Result<f64>

Extract a scalar value (tensor must have exactly 1 element).

Source

pub fn to_dtype(&self, dtype: DType) -> Result<Self>

Convert this tensor to a different dtype.

Returns a new tensor with the same shape but different element type. Uses the backend’s on-device cast when available, avoiding host round-trips. Records Op::ToDtype so gradients flow back through dtype conversions.

Source

pub fn to_string_with_data(&self) -> Result<String>

Display the tensor contents in a human-readable format.

Source

pub fn backward(&self) -> Result<GradStore>

Compute gradients via reverse-mode automatic differentiation.

This tensor must be a scalar (single element). Returns a GradStore containing gradients for all tensors in the computation graph.

§Example

let a = Tensor::from_f64_slice(&[2.0], 1, DType::F32, &dev)?.set_variable();
let b = Tensor::from_f64_slice(&[3.0], 1, DType::F32, &dev)?.set_variable();
let c = a.mul(&b)?;
let grads = c.backward()?;
// grad_a = b = 3.0, grad_b = a = 2.0