Inter-rater reliability and concurrent validity of ROBINS-I: Protocol for a cross-sectional study

Maya M. Jeyaraman; Rasheda Rabbani; Nameer Al-Yousif; Reid C. Robson; Leslie Copstein; Jun Xia; Michelle Pollock; Samer Mansour; Mohammed T. Ansari; Andrea C. Tricco; Ahmed M. Abou-Setta

doi:10.1186/s13643-020-1271-6

Inter-rater reliability and concurrent validity of ROBINS-I: Protocol for a cross-sectional study

Maya M. Jeyaraman, Rasheda Rabbani, Nameer Al-Yousif, Reid C. Robson, Leslie Copstein, Jun Xia, Michelle Pollock, Samer Mansour, Mohammed T. Ansari, Andrea C. Tricco, Ahmed M. Abou-Setta

Nottingham China Health Institute

Research output: Journal Publication › Article › peer-review

15 Citations (Scopus)

Abstract

Background: The Cochrane Bias Methods Group recently developed the "Risk of Bias (ROB) in Non-randomized Studies of Interventions" (ROBINS-I) tool to assess ROB for non-randomized studies of interventions (NRSI). It is important to establish consistency in its application and interpretation across review teams. In addition, it is important to understand if specialized training and guidance will improve the reliability of the results of the assessments. Therefore, the objective of this cross-sectional study is to establish the inter-rater reliability (IRR), inter-consensus reliability (ICR), and concurrent validity of ROBINS-I. Furthermore, as this is a relatively new tool, it is important to understand the barriers to using this tool (e.g., time to conduct assessments and reach consensus-evaluator burden). Methods: Reviewers from four participating centers will appraise the ROB of a sample of NRSI publications using the ROBINS-I tool in two stages. For IRR and ICR, two pairs of reviewers will assess the ROB for each NRSI publication. In the first stage, reviewers will assess the ROB without any formal guidance. In the second stage, reviewers will be provided customized training and guidance. At each stage, each pair of reviewers will resolve conflicts and arrive at a consensus. To calculate the IRR and ICR, we will use Gwet's AC₁ statistic. For concurrent validity, reviewers will appraise a sample of NRSI publications using both the New-castle Ottawa Scale (NOS) and ROBINS-I. We will analyze the concordance between the two tools for similar domains and for the overall judgments using Kendall's tau coefficient. To measure the evaluator burden, we will assess the time taken to apply the ROBINS-I (without and with guidance), and the NOS. To assess the impact of customized training and guidance on the evaluator burden, we will use the generalized linear models. We will use Microsoft Excel and SAS 9.4 to manage and analyze study data, respectively. Discussion: The quality of evidence from systematic reviews that include NRS depends partly on the study-level ROB assessments. The findings of this study will contribute to an improved understanding of the ROBINS-I tool and how best to use it.

Original language	English
Article number	12
Journal	Systematic Reviews
Volume	9
Issue number	1
DOIs	https://doi.org/10.1186/s13643-020-1271-6
Publication status	Published - 13 Jan 2020

Keywords

Concurrent validity
Cross-sectional study
Inter-consensus reliability
Inter-rater reliability
Non-randomized studies
ROBINS-I

ASJC Scopus subject areas

Medicine (miscellaneous)

Access to Document

10.1186/s13643-020-1271-6

Cite this

@article{a77fab9ea88a46d7807647fe4554ffd1,

title = "Inter-rater reliability and concurrent validity of ROBINS-I: Protocol for a cross-sectional study",

abstract = "Background: The Cochrane Bias Methods Group recently developed the {"}Risk of Bias (ROB) in Non-randomized Studies of Interventions{"} (ROBINS-I) tool to assess ROB for non-randomized studies of interventions (NRSI). It is important to establish consistency in its application and interpretation across review teams. In addition, it is important to understand if specialized training and guidance will improve the reliability of the results of the assessments. Therefore, the objective of this cross-sectional study is to establish the inter-rater reliability (IRR), inter-consensus reliability (ICR), and concurrent validity of ROBINS-I. Furthermore, as this is a relatively new tool, it is important to understand the barriers to using this tool (e.g., time to conduct assessments and reach consensus-evaluator burden). Methods: Reviewers from four participating centers will appraise the ROB of a sample of NRSI publications using the ROBINS-I tool in two stages. For IRR and ICR, two pairs of reviewers will assess the ROB for each NRSI publication. In the first stage, reviewers will assess the ROB without any formal guidance. In the second stage, reviewers will be provided customized training and guidance. At each stage, each pair of reviewers will resolve conflicts and arrive at a consensus. To calculate the IRR and ICR, we will use Gwet's AC1 statistic. For concurrent validity, reviewers will appraise a sample of NRSI publications using both the New-castle Ottawa Scale (NOS) and ROBINS-I. We will analyze the concordance between the two tools for similar domains and for the overall judgments using Kendall's tau coefficient. To measure the evaluator burden, we will assess the time taken to apply the ROBINS-I (without and with guidance), and the NOS. To assess the impact of customized training and guidance on the evaluator burden, we will use the generalized linear models. We will use Microsoft Excel and SAS 9.4 to manage and analyze study data, respectively. Discussion: The quality of evidence from systematic reviews that include NRS depends partly on the study-level ROB assessments. The findings of this study will contribute to an improved understanding of the ROBINS-I tool and how best to use it.",

keywords = "Concurrent validity, Cross-sectional study, Inter-consensus reliability, Inter-rater reliability, Non-randomized studies, ROBINS-I",

author = "Jeyaraman, \{Maya M.\} and Rasheda Rabbani and Nameer Al-Yousif and Robson, \{Reid C.\} and Leslie Copstein and Jun Xia and Michelle Pollock and Samer Mansour and Ansari, \{Mohammed T.\} and Tricco, \{Andrea C.\} and Abou-Setta, \{Ahmed M.\}",

note = "Publisher Copyright: {\textcopyright} 2020 The Author(s).",

year = "2020",

month = jan,

day = "13",

doi = "10.1186/s13643-020-1271-6",

language = "English",

volume = "9",

journal = "Systematic Reviews",

issn = "2046-4053",

publisher = "BioMed Central Ltd.",

number = "1",

}

TY - JOUR

T1 - Inter-rater reliability and concurrent validity of ROBINS-I

T2 - Protocol for a cross-sectional study

AU - Jeyaraman, Maya M.

AU - Rabbani, Rasheda

AU - Al-Yousif, Nameer

AU - Robson, Reid C.

AU - Copstein, Leslie

AU - Xia, Jun

AU - Pollock, Michelle

AU - Mansour, Samer

AU - Ansari, Mohammed T.

AU - Tricco, Andrea C.

AU - Abou-Setta, Ahmed M.

PY - 2020/1/13

Y1 - 2020/1/13

N2 - Background: The Cochrane Bias Methods Group recently developed the "Risk of Bias (ROB) in Non-randomized Studies of Interventions" (ROBINS-I) tool to assess ROB for non-randomized studies of interventions (NRSI). It is important to establish consistency in its application and interpretation across review teams. In addition, it is important to understand if specialized training and guidance will improve the reliability of the results of the assessments. Therefore, the objective of this cross-sectional study is to establish the inter-rater reliability (IRR), inter-consensus reliability (ICR), and concurrent validity of ROBINS-I. Furthermore, as this is a relatively new tool, it is important to understand the barriers to using this tool (e.g., time to conduct assessments and reach consensus-evaluator burden). Methods: Reviewers from four participating centers will appraise the ROB of a sample of NRSI publications using the ROBINS-I tool in two stages. For IRR and ICR, two pairs of reviewers will assess the ROB for each NRSI publication. In the first stage, reviewers will assess the ROB without any formal guidance. In the second stage, reviewers will be provided customized training and guidance. At each stage, each pair of reviewers will resolve conflicts and arrive at a consensus. To calculate the IRR and ICR, we will use Gwet's AC1 statistic. For concurrent validity, reviewers will appraise a sample of NRSI publications using both the New-castle Ottawa Scale (NOS) and ROBINS-I. We will analyze the concordance between the two tools for similar domains and for the overall judgments using Kendall's tau coefficient. To measure the evaluator burden, we will assess the time taken to apply the ROBINS-I (without and with guidance), and the NOS. To assess the impact of customized training and guidance on the evaluator burden, we will use the generalized linear models. We will use Microsoft Excel and SAS 9.4 to manage and analyze study data, respectively. Discussion: The quality of evidence from systematic reviews that include NRS depends partly on the study-level ROB assessments. The findings of this study will contribute to an improved understanding of the ROBINS-I tool and how best to use it.

AB - Background: The Cochrane Bias Methods Group recently developed the "Risk of Bias (ROB) in Non-randomized Studies of Interventions" (ROBINS-I) tool to assess ROB for non-randomized studies of interventions (NRSI). It is important to establish consistency in its application and interpretation across review teams. In addition, it is important to understand if specialized training and guidance will improve the reliability of the results of the assessments. Therefore, the objective of this cross-sectional study is to establish the inter-rater reliability (IRR), inter-consensus reliability (ICR), and concurrent validity of ROBINS-I. Furthermore, as this is a relatively new tool, it is important to understand the barriers to using this tool (e.g., time to conduct assessments and reach consensus-evaluator burden). Methods: Reviewers from four participating centers will appraise the ROB of a sample of NRSI publications using the ROBINS-I tool in two stages. For IRR and ICR, two pairs of reviewers will assess the ROB for each NRSI publication. In the first stage, reviewers will assess the ROB without any formal guidance. In the second stage, reviewers will be provided customized training and guidance. At each stage, each pair of reviewers will resolve conflicts and arrive at a consensus. To calculate the IRR and ICR, we will use Gwet's AC1 statistic. For concurrent validity, reviewers will appraise a sample of NRSI publications using both the New-castle Ottawa Scale (NOS) and ROBINS-I. We will analyze the concordance between the two tools for similar domains and for the overall judgments using Kendall's tau coefficient. To measure the evaluator burden, we will assess the time taken to apply the ROBINS-I (without and with guidance), and the NOS. To assess the impact of customized training and guidance on the evaluator burden, we will use the generalized linear models. We will use Microsoft Excel and SAS 9.4 to manage and analyze study data, respectively. Discussion: The quality of evidence from systematic reviews that include NRS depends partly on the study-level ROB assessments. The findings of this study will contribute to an improved understanding of the ROBINS-I tool and how best to use it.

KW - Concurrent validity

KW - Cross-sectional study

KW - Inter-consensus reliability

KW - Inter-rater reliability

KW - Non-randomized studies

KW - ROBINS-I

UR - http://www.scopus.com/inward/record.url?scp=85077785697&partnerID=8YFLogxK

U2 - 10.1186/s13643-020-1271-6

DO - 10.1186/s13643-020-1271-6

M3 - Article

C2 - 31931871

AN - SCOPUS:85077785697

SN - 2046-4053

VL - 9

JO - Systematic Reviews

JF - Systematic Reviews

IS - 1

M1 - 12

ER -

Inter-rater reliability and concurrent validity of ROBINS-I: Protocol for a cross-sectional study

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this