[Feature] `min_p` sampling parameter

### Motivation

The `min_p` sampling parameter is becoming quite popular. It's conceptually simple and "makes sense", and (at least anecdotally, according to opinions of many model fine-tuners and users in the LocalLlama community) it tends to perform better than the usual `top_p`+`top_k` approach. You can see the readmes of HF repositories of many new model finetunes/merges recommend to use `min_p` instead of `top_p` and `top_k`.

### Related resources

* vLLM: https://github.com/vllm-project/vllm/blob/8ea5e44a435e8731fd6f5ba4c329dd112752532a/vllm/sampling_params.py#L64C9-L66C57

> min_p: Float that represents the minimum probability for a token to be considered, **relative to** the probability of the most likely token. Must be in [0, 1]. Set to 0 to disable this.

So e.g. a `min_p` of 0.07 means that if a token's probability is less than 7% of the size *of the highest-probability token*, it will be disqualified. A `min_p` of 0.5 would mean that if a token's probability is not at least half the size of the highest-probability token, then it is disqualified. Said another way, `min_p` allows you to set a *minimum fraction* of the most likely token's probability, else the token cannot be sampled.

* https://github.com/vllm-project/vllm/pull/1642
* https://github.com/oobabooga/text-generation-webui/pull/4449
* https://github.com/ggerganov/llama.cpp/pull/3841
* (Edit - SGLang recently added it:) https://github.com/sgl-project/sglang/pull/1167

Please see the above links for more info.

![image](https://github.com/InternLM/lmdeploy/assets/1167575/44438e84-17eb-471b-b80e-01a894d85ce1)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature] `min_p` sampling parameter #1745

Motivation

Related resources

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature] min_p sampling parameter #1745

Description

Motivation

Related resources

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

[Feature] `min_p` sampling parameter #1745