Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Response Surface Methodology

Response Surface Methodology

Response Surface Methodology (RSM) is a collection of statistical and mathematical techniques used to develop, improve, and optimize processes. It is particularly valuable when many input variables or factors potentially influence a response variable.
The process of RSM involves several key steps:

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Reconceptualizing Scoring Reliability Through Linguistic Similarity.

Educational and psychological measurement·2025

Same author

Disentangling individual differences in cognitive response mechanisms for rating scale items: A flexible-mixture multidimensional IRTree approach.

Behavior research methods·2025

Same author

A Robust Method for Detecting Item Misfit in Large-Scale Assessments.

Educational and psychological measurement·2023

Same author

Erratum to: A Response-Time-Based Latent Response Mixture Model for Identifying and Modeling Careless and Insufficient Effort Responding in Survey Data.

Psychometrika·2022

Same author

A Response-Time-Based Latent Response Mixture Model for Identifying and Modeling Careless and Insufficient Effort Responding in Survey Data.

Psychometrika·2021

Same author

Commentary: Matching IRT Models to PRO Constructs- Modeling Alternatives, and Some Thoughts on What Makes a Model Different.

Psychometrika·2021

Same journal

A Simple Approach for Differential Test Functioning Based on Sum Scores.

Educational and psychological measurement·2026

Same journal

Evaluating Factor Retention in Large Factor Analysis Models: A Simulation Study Comparing 15 Methods.

Educational and psychological measurement·2026

Same journal

Agreement and Alignment in Binary Rating Tasks: Strategic Convergence as an Equilibrium Outcome.

Educational and psychological measurement·2026

Same journal

Interactions Between Termination Criteria and Ability Estimators in Computerized Adaptive Testing.

Educational and psychological measurement·2026

Same journal

Identification and Diagnosis of Misreporting in Surveys.

Educational and psychological measurement·2026

Same journal

The Aggregated Latent Profile Index: Measuring Person Profile Differentiation Within a Bootstrap-Validated Latent Profile Space.

Educational and psychological measurement·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jul 30, 2025

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Published on: June 30, 2020

Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks.

Matthias von Davier¹, Lillian Tyack¹, Lale Khorramdel¹

¹Boston College, Chestnut Hill, MA, USA.

Educational and Psychological Measurement

|May 15, 2023

Summary

This summary is machine-generated.

Convolutional neural networks (CNNs) accurately score student drawings in large-scale assessments, outperforming traditional methods. This automated scoring approach offers a cost-effective and valid alternative to human raters.

Keywords:

TIMSS automated scoring convolutional neural network feed-forward neural network image responses international large-scale assessment

More Related Videos

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

Author Spotlight: IntelliSleepScorer — A High-Accuracy, Accessible GUI Software for Automated Sleep Stage Scoring in Mice and its Application in Psychiatric Research

Author Spotlight: IntelliSleepScorer — A High-Accuracy, Accessible GUI Software for Automated Sleep Stage Scoring in Mice and its Application in Psychiatric Research

Published on: November 8, 2024

Related Experiment Videos

Last Updated: Jul 30, 2025

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Published on: June 30, 2020

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

Author Spotlight: IntelliSleepScorer — A High-Accuracy, Accessible GUI Software for Automated Sleep Stage Scoring in Mice and its Application in Psychiatric Research

Author Spotlight: IntelliSleepScorer — A High-Accuracy, Accessible GUI Software for Automated Sleep Stage Scoring in Mice and its Application in Psychiatric Research

Published on: November 8, 2024

Area of Science:

Artificial Intelligence
Educational Measurement
Computer Science

Background:

Automated scoring of graphical student responses is not yet established for large-scale assessments.
Traditional methods for scoring complex constructed-response items can be labor-intensive and prone to variability.

Purpose of the Study:

To propose and compare artificial neural network approaches for automated scoring of image-based responses.
To evaluate the accuracy of convolutional neural networks (CNNs) against feed-forward neural networks for classifying student drawings.

Main Methods:

Utilized a TIMSS 2019 item with free drawing responses.
Compared classification accuracy between convolutional neural networks (CNNs) and feed-forward neural networks.
Implemented an item response theory-based method for selecting training data.

Main Results:

CNNs demonstrated superior performance over feed-forward networks in both loss and accuracy.
CNN models achieved up to 97.53% accuracy in classifying image responses, comparable to human raters.
The most accurate CNN models correctly identified errors made by human raters.

Conclusions:

CNN-based automated scoring is a highly accurate method for evaluating graphical student responses.
This technology can potentially reduce costs and workload associated with human raters in international large-scale assessments.
Automated scoring using CNNs can enhance the validity and comparability of scoring complex constructed-response items.