The Parallel Domain Team
In our previous post, “Signs of Trust: Matching Real-World Performance with PD Replica Sim,” we shared our latest research qualitatively and quantitatively assessing the simulation to real gap. We found that sign-detection models trained exclusively on real data exhibit nearly identical behavior on both real and Replica Sim validation data sets, a strong indication that PD Replica Sim is a trustworthy environment for validation.
In this post, we’ll delve into how ML teams can use PD Replica Sim to push their autonomous system to its limits, interrogating model weaknesses and testing its generalization more rigorously than real-world data typically allows —covering tricky edge cases, handling occlusion and orientation sensitivity, and even evaluating model performance on unseen traffic sign types.
Capturing enough diverse, high-quality data for robust testing is expensive, time-consuming, and often incomplete. Most of the labeled data that ML teams manage to collect ends up going toward training, leaving little in the validation pool for comprehensive testing. These blind spots can be effectively addressed using simulation, where PD Replica Sim offers two key advantages:
Even high-performing models often struggle with specific failure points in real-world deployment. Some common failure points, and areas we explored in our latest research include:
These failure scenarios are challenging to capture in large numbers in real-world datasets, but PD Replica Sim allows developers to intentionally generate and study them.
If a traffic sign detection model struggles with angled signs, simulation enables precise experimentation. With PD Replica Sim, developers can:
This methodology allows teams to pinpoint the exact angle at which detection performance drops. In one of our internal tests, we observed a sharp decline in detection confidence when signs exceeded a certain tilt, despite strong front-facing performance. This angle as seen in the table below is around 30%. Identifying this failure mode in a controlled environment provided actionable insights—such as collecting additional training data at those orientations or refining the model architecture to improve viewpoint robustness.
Rotation experiment results running against a Mapillary trained perception model for sign detection:
bbox ratio width/hight) | <30% | 30-40% | 40-60% | >60% |
missed | 6 | 3 | 0 | 0 |
detected | 1 | 4 | 7 | 32 |
% detection rate | 14% | 57% | 100% | 100% |
What happens when a traffic sign is partially blocked by another object? Data Lab makes it possible to:
A surprising pattern emerged in our experiments:
This insight revealed a dataset bias—training data lacked examples of overlapping labeled signs. The solution? Either collect or synthesize more training images featuring these occlusion scenarios or refine the label ontology to better handle occluded signs.
Even if a model performs well on known traffic sign types, can it generalize to novel variants? For instance, if a training set includes only “30 mph” and “70 mph” speed limit signs but lacks “50 mph,” how does the model handle the missing class?
PD Replica Sim enables precise evaluation by generating simulation tests featuring:
By testing model behavior in these generated scenarios, ML teams can determine if their models are learning abstract traffic sign features or simply memorizing specific designs. If generalization fails, additional real-world or simulation data can be gathered to strengthen the training set.
Realistic sensor simulation is a powerful ally in testing autonomous systems. By leveraging PD Replica Sim’s trusted fidelity and flexible, iterative control, you can be more creative with your validation scenarios: pinpoint critical edge cases, validate model generalization, and continually refine your solutions before they ever hit the road or fly in the air.
If you’re looking for a reliable, multi-sensor, high-resolution simulation that’s fully controllable and repeatable, PD Replica Sim stands out from the pack. Fill out the form below, and our team will be happy to provide you with a personalized demo.