Predictive methods for CO2 emissions and energy use in vehicles at intersections
View abstract on PubMed
Summary
This summary is machine-generated.This study developed advanced models to accurately predict CO2 emissions and vehicle energy use at busy urban intersections. Extreme Gradient Boosting (XGBoost) showed the highest accuracy, improving traffic management and infrastructure planning.
Area Of Science
- Environmental Science
- Transportation Engineering
- Data Science
Background
- Current emission models struggle with complex urban intersection dynamics, particularly stop-and-go traffic.
- Accurate prediction of vehicle emissions and energy consumption is crucial for urban sustainability.
- High-traffic intersections are significant contributors to urban air pollution and energy waste.
Purpose Of The Study
- To develop and validate novel predictive models for CO2 emissions and vehicle energy consumption at urban intersections.
- To enhance the accuracy of traffic emission modeling in real-world urban driving conditions.
- To provide tools for improved road infrastructure planning and traffic management strategies.
Main Methods
- Utilized machine learning algorithms including linear regression, LASSO, Ridge, Random Forest, and Extreme Gradient Boosting (XGBoost).
- Applied Density-Based Spatial Clustering of Applications with Noise (DBSCAN) for intersection-specific data grouping.
- Collected real-world driving data using portable emissions measurement systems and the Hioki 3390 power analyzer.
Main Results
- Extreme Gradient Boosting (XGBoost) demonstrated superior accuracy in predicting emissions and energy consumption compared to other models.
- The developed models were successfully validated and applied in traffic simulation software (Vissim).
- DBSCAN effectively clustered intersection-specific data, enabling more targeted and accurate analysis.
Conclusions
- The advanced predictive models offer a significant improvement over existing methods for urban intersection traffic analysis.
- These findings support the optimization of energy use and reduction of CO2 emissions in urban transportation networks.
- The study provides a foundation for data-driven decision-making in urban planning and traffic management to mitigate environmental impact.
Related Concept Videos
The correlation coefficient, r, developed by Karl Pearson in the early 1900s, is numerical and provides a measure of strength and direction of the linear association between the independent variable x and the dependent variable y.
If you suspect a linear relationship between x and y, then r can measure how strong the linear relationship is.
What the VALUE of r tells us:
The value of r is always between –1 and +1: –1 ≤ r ≤ 1.
The size of the correlation r indicates the...
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
In statistics, correlation describes the degree of association between two variables. In the subfield of linear regression, correlation is mathematically expressed by the correlation coefficient, which describes the strength and direction of the relationship between two variables. The coefficient is symbolically represented by 'r' and ranges from -1 to +1. A positive value indicates a positive correlation where the two variables move in the same direction. A negative value suggests a...
The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y.
The correlation coefficient, r, developed by Karl Pearson in the early 1900s, is numerical and provides a measure of strength and direction of the linear association between the independent variable, x, and the dependent variable, y. Hence, it is also known as the Pearson product-moment correlation coefficient. It can be calculated using the following equation:
where n = the number of data points.
The 95% critical values of the sample correlation coefficient table can be used to give you a...
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
In the equation, is the dependent...

