pub struct QuantizePerTokenBackwardArgs<'a, TIn: Element> {
pub d_output: TensorRef<'a, TIn, 2>,
pub input: TensorRef<'a, TIn, 2>,
pub scale: TensorRef<'a, TIn, 1>,
pub zero_point: TensorRef<'a, i32, 1>,
pub d_input: TensorMut<'a, TIn, 2>,
}Expand description
Args bundle for the per-token BW launch.
Fields§
§d_output: TensorRef<'a, TIn, 2>Upstream gradient [N, D].
input: TensorRef<'a, TIn, 2>Saved input from FW (needed for the in-range mask) [N, D].
scale: TensorRef<'a, TIn, 1>Saved scale [N].
zero_point: TensorRef<'a, i32, 1>Saved zero-point [N].
d_input: TensorMut<'a, TIn, 2>Output dx [N, D].
Auto Trait Implementations§
impl<'a, TIn> !UnwindSafe for QuantizePerTokenBackwardArgs<'a, TIn>
impl<'a, TIn> Freeze for QuantizePerTokenBackwardArgs<'a, TIn>
impl<'a, TIn> RefUnwindSafe for QuantizePerTokenBackwardArgs<'a, TIn>where
TIn: RefUnwindSafe,
impl<'a, TIn> Send for QuantizePerTokenBackwardArgs<'a, TIn>
impl<'a, TIn> Sync for QuantizePerTokenBackwardArgs<'a, TIn>where
TIn: Sync,
impl<'a, TIn> Unpin for QuantizePerTokenBackwardArgs<'a, TIn>
impl<'a, TIn> UnsafeUnpin for QuantizePerTokenBackwardArgs<'a, TIn>
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more