pub struct QuantizePerTokenArgs<'a, TIn: Element, TOut: IntElement> {
pub input: TensorRef<'a, TIn, 2>,
pub scale: TensorRef<'a, TIn, 1>,
pub zero_point: TensorRef<'a, i32, 1>,
pub output: TensorMut<'a, TOut, 2>,
}Expand description
Args bundle for a quantize_per_token forward launch.
Fields§
§input: TensorRef<'a, TIn, 2>Input [N, D] in FP.
scale: TensorRef<'a, TIn, 1>Per-row scale [N] in FP.
zero_point: TensorRef<'a, i32, 1>Per-row zero-point [N] in i32.
output: TensorMut<'a, TOut, 2>Output [N, D] in int.
Auto Trait Implementations§
impl<'a, TIn, TOut> !UnwindSafe for QuantizePerTokenArgs<'a, TIn, TOut>
impl<'a, TIn, TOut> Freeze for QuantizePerTokenArgs<'a, TIn, TOut>
impl<'a, TIn, TOut> RefUnwindSafe for QuantizePerTokenArgs<'a, TIn, TOut>where
TIn: RefUnwindSafe,
TOut: RefUnwindSafe,
impl<'a, TIn, TOut> Send for QuantizePerTokenArgs<'a, TIn, TOut>
impl<'a, TIn, TOut> Sync for QuantizePerTokenArgs<'a, TIn, TOut>
impl<'a, TIn, TOut> Unpin for QuantizePerTokenArgs<'a, TIn, TOut>
impl<'a, TIn, TOut> UnsafeUnpin for QuantizePerTokenArgs<'a, TIn, TOut>
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more