Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Evaluating interobserver reliability of interval data.

B L Hopkins1, J A Hermann

  • 1University of Kansas.

Journal of Applied Behavior Analysis
|April 1, 1977
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

ABA accreditation of graduate programs of study.

The Behavior analyst·2012
Same author

A critique of the usefulness of inferential statistics in applied behavior analysis.

The Behavior analyst·2012
Same author

Evaluation of the scintillation factor for laser hazard analysis.

Applied optics·2010
Same author

Dispersive two-photon optical multistability.

Optics letters·2009
Same author

Stark effect in dispersive optical bistability.

Optics letters·2009
Same author

DNA sequence and analysis of human chromosome 9.

Nature·2004
Same journal

Latency and persistence of renewal in an intensive outpatient clinic.

Journal of applied behavior analysis·2026
Same journal

The effect of varied versus constant high-probability instructional sequences on cooperation.

Journal of applied behavior analysis·2026
Same journal

Relations between heart rate and precursors: A replication and extension of prior research.

Journal of applied behavior analysis·2026
Same journal

Integrating five linear trend techniques into performance-criteria-based effect size measurements: Impressions and recommendations.

Journal of applied behavior analysis·2026
Same journal

Functional analysis and treatment of higher level restricted repetitive behavior displayed by individuals with autism.

Journal of applied behavior analysis·2026
Same journal

Contingency drives children's vocal behavior.

Journal of applied behavior analysis·2026
See all related articles

This study reviews interobserver reliability for interval data, explaining how to compare it against chance agreement. It rejects statistical significance tests due to sample size dependency, offering a more robust approach to reliability assessment.

Area of Science:

  • Psychometrics
  • Behavioral Science

Background:

  • Previous recommendations for interobserver reliability indices (occurrence, nonoccurrence, overall) for interval data are examined.
  • The importance of comparing obtained reliability to chance agreement is highlighted.

Purpose of the Study:

  • To provide a rationale for comparing obtained reliability to chance agreement.
  • To present methods for determining chance agreement for interval data reliability indices.
  • To discuss the interpretability and limitations of statistical significance testing for reliability.

Main Methods:

  • Review of existing recommendations for interobserver reliability.
  • Explanation of a rationale for chance agreement comparison.
  • Presentation of formulae and graphic functions for calculating chance agreement.

Related Experiment Videos

  • Discussion of statistical procedures for assessing reliability superiority over chance.
  • Main Results:

    • All three indices (occurrence, nonoccurrence, overall) are interpretable across all possible obtained values.
    • Chance agreement levels vary directly with the percentage of intervals with recorded responses.
    • Statistical significance testing for reliability is deemed inappropriate due to sample size influence and lack of generalizable rules.

    Conclusions:

    • The study provides a framework for evaluating interobserver reliability against chance.
    • It emphasizes the limitations of traditional statistical significance testing in this context.
    • The findings support a more nuanced interpretation of reliability indices based on chance agreement.