`vllm.parser.engine` ¶

Streaming parser engine framework for tool call and reasoning extraction.

Instead of hand-rolling a parser for every model's tool-call / reasoning format, each format is declared as a ParserEngineConfig (terminals, states, and transitions) and a shared incremental engine handles streaming, ambiguity buffering, token-ID mapping, and delta computation.

Modules:

adapters –

Adapters that expose :class:ParserEngine through the legacy
events –

Semantic event types emitted by the streaming parser engine.
incremental_lexer –

Incremental text lexer that converts text chunks into terminal
parser_engine –

Parser engine base that handles both reasoning and tool call
parser_engine_config –

Declarative configuration for model tool-call and reasoning formats.
registered_adapters –

Concrete adapter classes for each registered parser engine.
streaming_parser_engine –

Streaming parser engine that orchestrates token ID scanning,
token_id_scanner –

Scan delta token IDs for special tokens and split the stream into

vllm.parser.engine ¶

`vllm.parser.engine` ¶