Definition
Top-p is a sampling parameter that limits the model's choice to tokens whose cumulative probability does not exceed the value p. With top-p=0.9, the model considers only tokens covering 90% of the total probability, excluding improbable options. Together with temperature, it allows balancing creativity and reliability in AI responses.
Related terms
EXPLORE