Table 1

Methodological quality and risk of bias of included studies assessed with the Quality Appraisal of Reliability Studies*

StudyItem
1
Item
2
Item
3
Item
4
Item
5
Item
6
Item
7
Item
8
Item
9
Item
10
Item
11
Ageberg et al13YYYNAUUUNAYYY
Barker-Davis et al56YYYUUNAUNYYY
Chmielewski et al14YYUUNAUUYYYY
Cornell et al66YYNAUNAUUUYYN
Crossley et al19YYUUYUUYYYN
Di Mattia et al20YYUNAUUUNAYYN
Edmondston et al59YYUNAUUUYYYN
Friedrich et al61YYYNANANAYUUYY
Frohm et al15YYYNANAYUNAYYY
Gianola et al57YYYUUNAUUYYY
Harris-Hayes et al16YYYUYUUYYYY
Herman et al21YYUNAYUUNAYYY
Junge et al17YYUNANAUUNAYYY
Kaukinen et al58YYYYNAYUYYYY
Kennedy et al23YYUUNAUUUYYY
Lenzlinger-Asprion et al51YYYUNAYYYYYY
McKeown et al24YYUUNAUUNAYYN
Nae et al30YYYNANAUUUYYY
Park et al25YYYNANAUUYYYY
Piva et al26YYYNANAUUYUYY
Poulsen et al52YYYYUUUUYYY
Rabin et al64YYYNANAUUNAYYY
Rabin et al65YYYNANAUUNAYYY
Räisänen et al53YYUYUUUYYYN
Stensrud et al27YYNAUUUUUUYY
Teyhen et al60YYYNANAUUYYYY
Van Mastrigt et al63YYYNAUYUYYUY
Weeks et al22YYYYYYUYYYN
Weir et al54YYUUNAUUYYYY
Whatman et al18YYUUYUUYYYY
Örtqvist et al55YYYUNAUUYUYY
  • *Assesses study quality based on 11 items. Items: 1. Was the test evaluated in a sample of subjects who were representative of those to whom the authors intended the results to be applied? 2. Was the test performed by raters who were representative of those to whom the authors intended the results to be applied? 3. Were raters blinded to the findings of other raters during the study? 4. Were raters blinded to their own prior findings of the test under evaluation? 5. Were raters blinded to the results of the accepted reference standard or the disease status for the target disorder (or variable) being evaluated? 6. Were raters blinded to clinical information that was not intended to be provided as part of the testing procedure or study design? 7. Were raters blinded to additional cues that were not part of the test? 8. Was the order of examination varied? 9. Was the stability (or theoretical stability) of the variable being measured taken into account when determining the suitability of the time-interval among repeated measures? 10. Was the test applied correctly and interpreted appropriately? 11. Were appropriate statistical measures of agreement used?

  • N, no; NA, not applicable; U, unclear; Y, yes (marked in bold).