Let’s say we ask an LLM the following query: query = “Classify the sentiment of this sentence as positive or negative: I am very excited today.” and the LLM returns: response = “positive” The goal of uncertainty estimation is to answer: How sure is the LLM in this response?In order to do this, we query the LLM with the same example k times. Then, we use the k responses to determine how dissimmilar these responses are. Three possible metrics are: