Module speculative_decoding

Module speculative_decoding 

Source
Expand description

Speculative decoding utilities for draft/target verification.

Structs§

PredictionConfig
Configuration for the prediction head.
SpeculativeCandidates
Batch of speculative trees.
SpeculativeDecoder
Speculative decoder for draft/target model verification.
SpeculativeToken
Token entry in the speculative tree.
SpeculativeTree
Speculative token tree arranged by depth levels.
SpeculativeVerification
Verification output with accepted tokens and updated cache.
TreeConfig
Configuration for speculative tree expansion.

Enums§

PredictionHeadType
Prediction head choices.
VerificationStrategy
Verification strategy for speculative decoding.