Skip to main content

VitEncoder

jepa_vision::vit

Struct VitEncoder

pub struct VitEncoder<B: Backend> { /* private fields */ }

Expand description

Vision Transformer encoder.

Maps images to patch-level representations via:

Patch embedding (linear projection of flattened patches)
2D Rotary Position Encoding
Stack of transformer blocks (self-attention + MLP)
Final layer normalization

Output shape: [batch, num_patches, embed_dim]

Implementations§

impl<B: Backend> VitEncoder<B>

pub fn forward(&self, images: &Tensor<B, 4>) -> Representation<B>

Forward pass: image → representation.

§Arguments

images - Input images. Shape: [batch, channels, height, width]

§Returns

Patch-level representations. Shape: [batch, num_patches, embed_dim]

pub fn forward_visible_tokens( &self, images: &Tensor<B, 4>, visible_indices: &[usize], ) -> Representation<B>

Encode only the visible patch tokens for strict JEPA context encoding.

The image is patchified and position-encoded using the full grid so the surviving tokens retain their real flattened positions, then masked tokens are removed before self-attention runs.

pub fn load_named_tensors( self, tensors: &HashMap<String, TensorData>, ) -> Result<Self, VitLoadError>

Load a ViT encoder from a map of burn-style parameter names to tensor data.

Expected parameter names match the burn module record layout, for example patch_embed.projection.weight and blocks.0.attn.out_proj.bias.

pub fn ema_update_from(self, online: &Self, ema: &Ema, step: usize) -> Self

Update this encoder toward an online encoder using EMA.

The returned encoder preserves the gradient setting of the target encoder parameters while detaching the blended tensors from any active autodiff graph.

Trait Implementations§

impl<B> AutodiffModule<B> for VitEncoder<B>
where B: AutodiffBackend + Backend, <B as AutodiffBackend>::InnerBackend: Backend,

type InnerModule = VitEncoder<<B as AutodiffBackend>::InnerBackend>

Inner module without auto-differentiation.

fn valid(&self) -> Self::InnerModule

Get the same module, but on the inner backend without auto-differentiation.

impl<B: Backend> Clone for VitEncoder<B>

fn clone(&self) -> Self

Returns a duplicate of the value. Read more

1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more

impl<B: Debug + Backend> Debug for VitEncoder<B>

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

impl<B: Backend> Display for VitEncoder<B>

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

impl<B: Backend> Encoder<B> for VitEncoder<B>

type Input = Tensor<B, 4>

The type of input this encoder accepts.

fn encode(&self, input: &Self::Input) -> Representation<B>

Encode input into a representation. Read more

fn embed_dim(&self) -> usize

Get the output embedding dimension.

impl<B: Backend> Module<B> for VitEncoder<B>

type Record = VitEncoderRecord<B>

Type to save and load the module.

fn load_record(self, record: Self::Record) -> Self

Load the module state from a record.

fn into_record(self) -> Self::Record

Convert the module into a record containing the state.

fn num_params(&self) -> usize

Get the number of parameters the module has, including all of its sub-modules.

fn visit<Visitor: ModuleVisitor<B>>(&self, visitor: &mut Visitor)

Visit each tensor parameter in the module with a visitor.

fn map<Mapper: ModuleMapper<B>>(self, mapper: &mut Mapper) -> Self

Map each tensor parameter in the module with a mapper.

fn collect_devices(&self, devices: Devices<B>) -> Devices<B>

Return all the devices found in the underneath module tree added to the given vector without duplicates.

fn to_device(self, device: &B::Device) -> Self

Move the module and all of its sub-modules to the given device. Read more

fn fork(self, device: &B::Device) -> Self

Fork the module and all of its sub-modules to the given device. Read more

fn devices(&self) -> Vec<<B as Backend>::Device>

Return all the devices found in the underneath module tree without duplicates.

fn no_grad(self) -> Self

Each tensor in the module tree will not require grad. Read more

fn save_file<FR, PB>( self, file_path: PB, recorder: &FR, ) -> Result<(), RecorderError>
where FR: FileRecorder<B>, PB: Into<PathBuf>,

Save the module to a file using the provided file recorder. Read more

fn load_file<FR, PB>( self, file_path: PB, recorder: &FR, device: &<B as Backend>::Device, ) -> Result<Self, RecorderError>
where FR: FileRecorder<B>, PB: Into<PathBuf>,

Load the module from a file using the provided file recorder. Read more

fn quantize_weights(self, quantizer: &mut Quantizer) -> Self

Quantize the weights of the module.

impl<B: Backend> ModuleDisplay for VitEncoder<B>

fn format(&self, passed_settings: DisplaySettings) -> String

Formats the module with provided display settings. Read more

fn custom_settings(&self) -> Option<DisplaySettings>

Custom display settings for the module. Read more

fn custom_content(&self, _content: Content) -> Option<Content>

Custom attributes for the module. Read more

impl<B: Backend> ModuleDisplayDefault for VitEncoder<B>

fn content(&self, content: Content) -> Option<Content>

Attributes of the module used for display purposes. Read more

fn num_params(&self) -> usize

Gets the number of the parameters of the module.

Auto Trait Implementations§

impl<B> !Freeze for VitEncoder<B>

impl<B> !RefUnwindSafe for VitEncoder<B>

impl<B> Send for VitEncoder<B>

impl<B> !Sync for VitEncoder<B>

impl<B> Unpin for VitEncoder<B>
where <B as Backend>::FloatTensorPrimitive: Unpin, <B as Backend>::QuantizedTensorPrimitive: Unpin, <B as Backend>::Device: Unpin,

impl<B> UnsafeUnpin for VitEncoder<B>
where <B as Backend>::FloatTensorPrimitive: UnsafeUnpin, <B as Backend>::QuantizedTensorPrimitive: UnsafeUnpin, <B as Backend>::Device: UnsafeUnpin,

impl<B> !UnwindSafe for VitEncoder<B>

Blanket Implementations§

impl<T> Any for T
where T: 'static + ?Sized,

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

impl<T> Borrow<T> for T
where T: ?Sized,

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

impl<T> BorrowMut<T> for T
where T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

impl<T> CloneToUninit for T
where T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)

Performs copy-assignment from self to dest. Read more

impl<T> From<T> for T

fn from(t: T) -> T

Returns the argument unchanged.

impl<T> Instrument for T

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more

impl<T, U> Into<U> for T
where U: From<T>,

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

impl<T> IntoComptime for T

fn comptime(self) -> Self

impl<T> ToOwned for T
where T: Clone,

type Owned = T

The resulting type after obtaining ownership.

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more

impl<T> ToString for T
where T: Display + ?Sized,

fn to_string(&self) -> String

Converts the given value to a String. Read more

impl<T, U> TryFrom<U> for T
where U: Into<T>,

type Error = Infallible

The type returned in the event of a conversion error.

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

fn vzip(self) -> V

impl<T> WithSubscriber for T

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more