vllm.v1.core.sched.journey_events ¶
Request Journey Event tracing for lifecycle observability.
This module provides sparse lifecycle event tracking for requests as they move through the vLLM scheduler. Events are emitted at key state transitions with full progress snapshots, enabling detailed request journey analysis.
RequestJourneyEvent ¶
Bases: Struct
A single request lifecycle event with full progress snapshot.
Each event captures the complete state of a request at a specific lifecycle transition, including accurate progress counters that survive preemption.
Progress Tracking: - prefill_done_tokens: High-water mark of prompt tokens processed (survives preemption, tracked via scheduler-side dict) - decode_done_tokens: Output tokens generated (from num_output_tokens) - phase: Current processing phase (PREFILL or DECODE)
Scheduler Context: - scheduler_step: Monotonic counter from Scheduler.scheduler_step_counter (None only for QUEUED events before first schedule)
Source code in vllm/v1/core/sched/journey_events.py
RequestJourneyEventType ¶
Bases: IntEnum
Request lifecycle event types.
Events are emitted at key state transitions during a request's journey through the scheduler. Each event captures a full progress snapshot.
Source code in vllm/v1/core/sched/journey_events.py
ScheduleKind ¶
Bases: IntEnum
Type of scheduling transition for SCHEDULED events.
Source code in vllm/v1/core/sched/journey_events.py
_map_finish_status ¶
_map_finish_status(
status: RequestStatus,
) -> Literal[
"stopped", "length", "aborted", "ignored", "error"
]
Map RequestStatus terminal state to journey event finish_status string.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
status | RequestStatus | Terminal RequestStatus (FINISHED_*) | required |
Returns:
| Type | Description |
|---|---|
Literal['stopped', 'length', 'aborted', 'ignored', 'error'] | Human-readable finish status string |
Raises:
| Type | Description |
|---|---|
ValueError | If status is not a terminal FINISHED_* status |