Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Response Surface Methodology01:16

Response Surface Methodology

195
Response Surface Methodology (RSM) is a collection of statistical and mathematical techniques used to develop, improve, and optimize processes. It is particularly valuable when many input variables or factors potentially influence a response variable.
The process of RSM involves several key steps:
195

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Reconceptualizing Scoring Reliability Through Linguistic Similarity.

Educational and psychological measurement·2025
Same author

Disentangling individual differences in cognitive response mechanisms for rating scale items: A flexible-mixture multidimensional IRTree approach.

Behavior research methods·2025
Same author

A Robust Method for Detecting Item Misfit in Large-Scale Assessments.

Educational and psychological measurement·2023
Same author

Erratum to: A Response-Time-Based Latent Response Mixture Model for Identifying and Modeling Careless and Insufficient Effort Responding in Survey Data.

Psychometrika·2022
Same author

A Response-Time-Based Latent Response Mixture Model for Identifying and Modeling Careless and Insufficient Effort Responding in Survey Data.

Psychometrika·2021
Same author

Commentary: Matching IRT Models to PRO Constructs- Modeling Alternatives, and Some Thoughts on What Makes a Model Different.

Psychometrika·2021
Same journal

A Simple Approach for Differential Test Functioning Based on Sum Scores.

Educational and psychological measurement·2026
Same journal

Evaluating Factor Retention in Large Factor Analysis Models: A Simulation Study Comparing 15 Methods.

Educational and psychological measurement·2026
Same journal

Agreement and Alignment in Binary Rating Tasks: Strategic Convergence as an Equilibrium Outcome.

Educational and psychological measurement·2026
Same journal

Interactions Between Termination Criteria and Ability Estimators in Computerized Adaptive Testing.

Educational and psychological measurement·2026
Same journal

Identification and Diagnosis of Misreporting in Surveys.

Educational and psychological measurement·2026
Same journal

The Aggregated Latent Profile Index: Measuring Person Profile Differentiation Within a Bootstrap-Validated Latent Profile Space.

Educational and psychological measurement·2026
See all related articles

Related Experiment Video

Updated: Jul 30, 2025

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques
08:05

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Published on: June 30, 2020

7.6K

Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks.

Matthias von Davier1, Lillian Tyack1, Lale Khorramdel1

  • 1Boston College, Chestnut Hill, MA, USA.

Educational and Psychological Measurement
|May 15, 2023
PubMed
Summary
This summary is machine-generated.

Convolutional neural networks (CNNs) accurately score student drawings in large-scale assessments, outperforming traditional methods. This automated scoring approach offers a cost-effective and valid alternative to human raters.

Keywords:
TIMSSautomated scoringconvolutional neural networkfeed-forward neural networkimage responsesinternational large-scale assessment

More Related Videos

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
06:37

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

3.9K
Author Spotlight: IntelliSleepScorer — A High-Accuracy, Accessible GUI Software for Automated Sleep Stage Scoring in Mice and its Application in Psychiatric Research
04:54

Author Spotlight: IntelliSleepScorer — A High-Accuracy, Accessible GUI Software for Automated Sleep Stage Scoring in Mice and its Application in Psychiatric Research

Published on: November 8, 2024

580

Related Experiment Videos

Last Updated: Jul 30, 2025

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques
08:05

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Published on: June 30, 2020

7.6K
Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
06:37

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

3.9K
Author Spotlight: IntelliSleepScorer — A High-Accuracy, Accessible GUI Software for Automated Sleep Stage Scoring in Mice and its Application in Psychiatric Research
04:54

Author Spotlight: IntelliSleepScorer — A High-Accuracy, Accessible GUI Software for Automated Sleep Stage Scoring in Mice and its Application in Psychiatric Research

Published on: November 8, 2024

580

Area of Science:

  • Artificial Intelligence
  • Educational Measurement
  • Computer Science

Background:

  • Automated scoring of graphical student responses is not yet established for large-scale assessments.
  • Traditional methods for scoring complex constructed-response items can be labor-intensive and prone to variability.

Purpose of the Study:

  • To propose and compare artificial neural network approaches for automated scoring of image-based responses.
  • To evaluate the accuracy of convolutional neural networks (CNNs) against feed-forward neural networks for classifying student drawings.

Main Methods:

  • Utilized a TIMSS 2019 item with free drawing responses.
  • Compared classification accuracy between convolutional neural networks (CNNs) and feed-forward neural networks.
  • Implemented an item response theory-based method for selecting training data.

Main Results:

  • CNNs demonstrated superior performance over feed-forward networks in both loss and accuracy.
  • CNN models achieved up to 97.53% accuracy in classifying image responses, comparable to human raters.
  • The most accurate CNN models correctly identified errors made by human raters.

Conclusions:

  • CNN-based automated scoring is a highly accurate method for evaluating graphical student responses.
  • This technology can potentially reduce costs and workload associated with human raters in international large-scale assessments.
  • Automated scoring using CNNs can enhance the validity and comparability of scoring complex constructed-response items.