Skip to content

Temperature application order non standard? #4091

@electronjoe

Description

@electronjoe

I was reading a really interesting piece on Reddit regarding samplers, and a particularly interesting exchange came up which appears to have highlighted a discrepancy between the order llama.cpp applies temperature (to probabilities) while research literature / other implementations apply temperature earlier in the chain (to logits).

I thought it would be unfortunate for this discussion to die without visibility and discussion, so I've tossed up a GH issue.

I would normally look for historical / closed issues that are related, but I'm on my phone and that's rather complex.

The interesting Reddit discussion:

https://www.reddit.com/r/LocalLLaMA/s/WonSDiMCoD

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions