After applying softmax to the final layer output, sort the vocabulary in descending probability order. Select candidates until the cumulative probability exceeds a threshold
1 min read
After applying softmax to the final layer output, sort the vocabulary in descending probability order. Select candidates until the cumulative probability exceeds a threshold