vllm.renderers.online_derenderer ¶
_convert_chat_logprobs_to_completion_logprobs(logprobs) ¶
Convert ChatCompletionLogProbs (per-token objects) to CompletionLogProbs (parallel flat lists) as required by the /v1/completions response schema.
Source code in vllm/renderers/online_derenderer.py
_correct_decoded_token(token_id, context_token_ids, tokenizer) ¶
Use preceding tokens as context to fix U+FFFD from byte-fallback.
Mirrors LogprobsProcessor._correct_decoded_token in v1/engine/logprobs.py.
Source code in vllm/renderers/online_derenderer.py
_parse_token_id_placeholder(token) ¶
Extract token ID from a 'token_id:N' placeholder string.
Source code in vllm/renderers/online_derenderer.py
_resolve_logprobs(logprobs, tokenizer) ¶
Resolve token_id:N placeholders in a ChatCompletionLogProbs object.