(a) Row 1: The ranks of the models. (b) Row 2: The confidence score (worse<0.4≤fair<0.6≤good<0.8≤excellent).