Response Got Cut Off Mid-Sentence Because max_tokens Was Too Low
The model's response ends abruptly in the middle of a sentence, a JSON object, or a code block. Almost always max_tokens. How to size it, detect truncation, and recover.
Articles tagged with #truncation
The model's response ends abruptly in the middle of a sentence, a JSON object, or a code block. Almost always max_tokens. How to size it, detect truncation, and recover.