pub fn greedy_sample(logits: &Tensor) -> Result<i64, Error>
Greedy sampling - always return the most likely token