Examining the Adversarial Test Accuracy of Later Exits in NEO-KD Networks

As can be seen from the results for the anytime prediction in the main manuscript, the adversarial test accuracy of the later exits is sometimes lower than the performance of earlier exits. This phenomenon can be explained as follows: In general, we observed via experiments that adversarial examples targeting later exits has the higher sum of losses from all exits compared to adversarial examples targeting earlier exits. This makes max-average or average attack mainly focus on attacking the later exits, leading to low adversarial test accuracy at later exits. The performance of later exits can be improved by adopting the ensemble strategy as in the main manuscript for the budgeted prediction setup.

This paper is available on arxiv under CC 4.0 license.

← Previous

The Impact of Hyperparameters on Adversarial Training Performance

Up Next →

Evaluating NEO-KD Against Single-Exit Defense Methods in Multi-Exit Networks