You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Frequency response at 0 causes an issue with no content in the response. > 0 by < 1 cause other weird responses. 1 seems to be the only reliable value and its unclear if its the model or something else.
POST http://127.0.0.1:5272/v1/chat/completions
content-type: application/json
{
"messages": [
{
"role": "user",
"content": "Whats the golden ratio"
}
],
"frequency_penalty": 0,
"model": "Phi-3-mini-4k-cpu-int4-rtn-block-32-acc-level-4-onnx"
}
As the tooltip mentions, this parameter controls likelihood of repetition. So if you set a lower value, it will likely repeat itself. That's why you will see weird values when set to 0~1.
The value 1 is not the only reliable value. You can also set it to greater than 1, which will decrease the likelihood of repetition more.
I think we can add range validation for input parameters as @swatDong said
Frequency response at 0 causes an issue with no content in the response. > 0 by < 1 cause other weird responses. 1 seems to be the only reliable value and its unclear if its the model or something else.
You will get a response like:
The text was updated successfully, but these errors were encountered: