We provide some visualization videos and qualitatively analysis for baseline methods on Bench2Drive.For TCP, UniAD and VAD, we choose the best version of these models(TCP-traj,UniAD-Base,VAD-Base), and make visualization on 10 Scenarios as below.
Note: It might take some time to load all gifs. Please be patient.
Driving Skill | Scenario Name | Route ID | Success | |||
---|---|---|---|---|---|---|
AD-MLP | TCP-traj | UniAD-Base | VAD-Base | |||
Merging | MergerIntoSlowTraffic | 2283 | x | √ | √ | x |
SignalizedJunctionLeftTurn | 4183 | x | √ | x | √ | |
Overtaking | ParkedObstacle | 25318 | x | x | x | √ |
HazardAtSideLane | 25439 | x | x | √ | √ | |
Emergency Brake | ParkingCutIn | 18305 | x | √ | √ | x |
StaticCutIn | 26396 | x | x | √ | √ | |
Give Way | YieldToEmergencyVehicle | 3378 | x | x | x | x |
InvadingTurn | 2802 | x | √ | √ | x | |
Traffic Sign | EnterActorFlow | 3749 | x | √ | x | x |
VanillaNonSignalizedTurnEncounterStopsign | 3905 | x | √ | x | x |
Note: In this visualization, the definition of 'success' differs from the standard definition used in the Bench2Drive. A route may involve multiple actions, such as turning after passing through a traffic light. In this visualization, we only evaluate whether the selected segment's action is successful. For example, if the vehicle obeys the traffic light and passes through the intersection, but collides while turning, it is still considered a successful case in following traffic sign.
we visualize the behavior of three models on MergeIntoSlowTraffic
and SignalizedJunctionLeftTurn
scenarios to show their ability of mering . The ego vehicle should drive to off-ramp to exit the highway in MergeIntoSlowTraffic
, and it should perform a left turn in SignalizedJunctionLeftTurn
.
We visualize the behavior of three models on ParkedObstacle
and HazardAtSideLane
scenarios to show their ability of overtaking. The ego vehicle encounters a parked vehicle blocking part of the lane in ParkedObstacle
, and encounters a slow-moving hazard blocking part of the lane in HazardAtSideLane
.It should perform a lane change to avoid it.
We visualize the behavior of three models on ParkingCutIn
and StaticCutIn
scenarios to show their ability of emergency brake. In these scenarios, the ego must slow down or brake to allow a vehicle cut in.
We visualize the behavior of three models on YieldToEmergencyVehicle
and InvadingTurn
scenarios to show their ability of giving way. In YieldToEmergencyVehicle
, ego must maneuver to allow the emergency vehicle behind to pass. In InvadingTurn
,a vehicle coming from the opposite lane invades the ego’s lane, forcing the ego to move right to avoid a possible collision.
We visualize the behavior of three models on EnterActorFlow
and VanillaNonSignalizedTurnEncounterStopsign
scenarios to show their ability of following traffic sign. In EnterActorFlow
, ego should follow the traffic light. In VanillaNonSignalizedTurnEncounterStopsign
, ego should stop and start at stop signs.
-
Strategies: The three E2E-AD models implement distinct strategies. The TCP model adopts a conservative approach, favoring slower speeds to enhance obstacle avoidance and response to unexpected events, potentially at the cost of traffic flow efficiency. Conversely, the VAD model employs a more aggressive strategy, increasing the risk of excessive speed and abrupt maneuvers, which could lead to collisions. The strategy of UniAD lies between these extremes.
-
Diverse Behaviors in Identical Scenarios: It is notable that in certain scenarios, such as cases
10
,11
, and12
, different models trained on the same dataset exhibit varied behaviors, with more than one approach being reasonable, as seen in cases11
and12
. -
Perception and Planning: The interplay between perception and planning is critical. A model’s failure to detect nearby vehicles often leads to incorrect routing, as demonstrated in case
5
. However, even precise object detection does not always ensure correct planning decisions, as illustrated in case10
. -
Failure Cases nalysis: Several factors contribute to the failure cases observed. These include the inability to perceive other vehicles (cases
5
and20
), inaccurate motion predictions (case8
), actions that are either too mild (case7
) or too forceful (case24
), and misinterpretations of the scene (cases29
and30
).