Method Overview:

Given the performance logs generated from a rigorous pre-evaluation process, SERA utilizes a large language model (LLM) to analyze and interpret failure patterns. These insights guide a Failure-Aware Scenario Recommendation pipeline that retrieves semantically relevant scenarios from a structured Scenario Bank. The recommended scenarios are further refined by an LLM-based reflection module to ensure semantic alignment and diversity, producing high-quality, efficient scenarios for subsequent Self-Evolving Scenario Repair. This iterative repair process significantly enhances the robustness and safety of autonomous driving systems, particularly under challenging, rare, and safety-critical conditions.