Crash outcomes of yellow school buses: Random parameter and correlated random parameter logit models with heterogeneity in means

  • 0Texas State University, 601 University Drive, San Marcos, TX 77866, United States.
Accident; analysis and prevention +

|

Abstract

Despite rigorous safety standards, school buses in the United States still experience around 26,000 crashes annually, resulting in approximately 10 fatalities, with a fatality rate persistently static despite advances in vehicle safety. Such crashes often have significant implications, particularly for the young passengers involved. This study utilized a novel approach by analyzing Texas Crash Records Information System (CRIS) data from 2017 to 2022 through a Random Parameter Logit Model with Heterogeneity in Means (RPLHM) and Correlated RPLHM (CRPLHM). This method allows for a detailed examination of unobserved heterogeneity and specific variances within the data, enhancing the understanding of the complex dynamics influencing crash severity. The analysis revealed that crashes on state highways typically presented a lower likelihood of fatal and severe crash outcomes. Additionally, demographic attributes such as age significantly impacted crash outcomes, with middle-aged drivers (25-54) often experiencing less severe injuries. Additionally, driver inattention was associated with an increased occurrence of no-injury crash outcomes. While daylight is associated with less moderate and possible injury crashes, clear weather was associated with higher no-injury crashes. The transferability tests revealed temporal instability in yellow school bus crash severity patterns across 2017-2022. Key variables such as intersections, daylight, and driver characteristics demonstrated varying effects over time. While morning and afternoon crashes increasingly reduced the likelihood of fatal and severe injuries in later years, factors like divided roadways and clear weather saw greater variability in their impact on no-injury and moderate injury outcomes. These findings highlight the importance of year-specific modeling and support data-driven policymaking to improve school bus safety.

Related Concept Videos

Hypothesis Test for Test of Independence 01:16

3.5K

The test of independence is a chi-square-based test used to determine whether two variables or factors are independent or dependent. This hypothesis test is used to examine the independence of the variables. One can construct two qualitative survey questions or experiments based on the variables in a contingency table. The goal is to see if the two variables are unrelated (independent) or related (dependent). The null and alternative hypotheses for this test are:
H0: The two variables (factors)...

Mechanistic Models: Compartment Models in Individual and Population Analysis 01:23

33

Mechanistic models are utilized in individual analysis using single-source data, but imperfections arise due to data collection errors, preventing perfect prediction of observed data. The mathematical equation involves known values (Xi), observed concentrations (Ci), measurement errors (εi), model parameters (ϕj), and the related function (ƒi) for i number of values. Different least-squares metrics quantify differences between predicted and observed values. The ordinary least...

Probability Histograms 01:17

11.1K

A probability histogram is a visual representation of a probability distribution. Similar a typical histogram, the probability histogram consists of contiguous (adjoining) boxes. It has both a horizontal axis and a vertical axis. The horizontal axis is labeled with what the data represents. The vertical axis is labeled with probability. Each rectangular bar in the histogram is 1 unit wide, which suggests that the area under each bar equals the probability, P(x), where x is 1, 2, 3, and so on.

Residuals and Least-Squares Property 01:11

7.3K

The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...

Determination of Expected Frequency 01:08

2.2K

Suppose one wants to test independence between the two variables of a contingency table. The values in the table constitute the observed frequencies of the dataset. But how does one determine the expected frequency of the dataset? One of the important assumptions is that the two variables are independent, which means the variables do not influence each other. For independent variables, the statistical probability of any event involving both variables is calculated by multiplying the individual...

Introduction to Test of Independence 01:21

2.2K

In statistics, the term independence means that one can directly obtain the probability of any event involving both variables by multiplying their individual probabilities. Tests of independence are chi-square tests involving the use of a contingency table of observed (data) values.
The test statistic for a test of independence is similar to that of a goodness-of-fit test:

where:

O = observed values
E = expected values (which should be at least 5)

A test of independence determines whether...