Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Survival Tree01:19

Survival Tree

334
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
334
Multiple Regression01:25

Multiple Regression

3.7K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
3.7K
Residuals and Least-Squares Property01:11

Residuals and Least-Squares Property

8.8K
The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...
8.8K
Selected Data About Geographic Locations01:25

Selected Data About Geographic Locations

214
Geographic Information Systems (GIS) rely on two core types of data: spatial data and attribute data.Spatial DataSpatial data defines the physical location of features within a coordinate system, typically expressed in terms of latitude and longitude. It provides precise positioning for elements like roads, rivers, or buildings.Attribute DataAttribute data complements spatial data by adding descriptive information about these features. For example, a road's spatial data includes its start and...
214
Regression Analysis01:11

Regression Analysis

7.6K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
7.6K
Wald-Wolfowitz Runs Test I01:17

Wald-Wolfowitz Runs Test I

881
The Wald-Wolfowitz test, also known as the runs test, is a nonparametric statistical test used to assess the randomness of a sequence of two different types of elements (e.g., positive/negative values, successes/failures). It examines whether the order of the elements in a sequence is random or if there is a pattern or trend present. This nonparametric test applies to any ordered data despite the population and sample data distribution, even if a higher sample size is available.
The test works...
881

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Missing data in ecology: Syntheses, clarifications, and considerations.

Ecological monographs·2026
Same author

Correction: SSNdesign-An R package for pseudo-Bayesian optimal and adaptive sampling designs on stream networks.

PloS one·2026
Same author

Non-Floodplain Wetlands Are Carbon-Storage Powerhouses Across the United States.

Earth's future·2025
Same author

An integrated data model to estimate abundance from counts with temporal dependence and imperfect detection.

Ecology·2025
Same author

Modeling lake conductivity in the contiguous United States using spatial indexing for big spatial data.

Spatial statistics·2025
Same author

Marginal inference for hierarchical generalized linear mixed models with patterned covariance matrices using the Laplace approximation.

Environmetrics·2024
Same journal

Invaders taking over-Mollusc faunal change in volcanic barrier lakes of the Albertine Rift biodiversity hotspot.

PloS one·2026
Same journal

AI-driven molecular diversification and ligand-based optimization of macitentan derivatives targeting VEGFR1 and endothelin signaling pathways.

PloS one·2026
Same journal

Performance patterns and records in the world aquatics masters championships: Where do the most frequently represented nations among the top-ten masters swimmers come from?

PloS one·2026
Same journal

Modeling diurnal Temperature-Rainfall relationships under multicollinearity using PLS-SEM: A case study of Ghana.

PloS one·2026
Same journal

Organizational culture, social capital, and emergency capacity in primary healthcare institutions: A cross-sectional structural equation modeling study comparing ordinary and older communities.

PloS one·2026
Same journal

Impact of kidney function on the metabolome in the general population.

PloS one·2026
See all related articles

Related Experiment Video

Updated: Dec 25, 2025

Watershed Planning within a Quantitative Scenario Analysis Framework
12:44

Watershed Planning within a Quantitative Scenario Analysis Framework

Published on: July 24, 2016

8.4K

Comparing spatial regression to random forests for large environmental data sets.

Eric W Fox1, Jay M Ver Hoef2, Anthony R Olsen3

  • 1Department of Statistics and Biostatistics, California State University East Bay, Hayward, California, United States of America.

Plos One
|March 24, 2020
PubMed
Summary
This summary is machine-generated.

Spatial regression and random forests offer comparable predictive performance for environmental data. Spatial regression provides narrower prediction intervals, making it advantageous for large-scale ecological mapping and analysis.

More Related Videos

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.6K
Use of Principal Components for Scaling Up Topographic Models to Map Soil Redistribution and Soil Organic Carbon
09:44

Use of Principal Components for Scaling Up Topographic Models to Map Soil Redistribution and Soil Organic Carbon

Published on: October 16, 2018

10.6K

Related Experiment Videos

Last Updated: Dec 25, 2025

Watershed Planning within a Quantitative Scenario Analysis Framework
12:44

Watershed Planning within a Quantitative Scenario Analysis Framework

Published on: July 24, 2016

8.4K
Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.6K
Use of Principal Components for Scaling Up Topographic Models to Map Soil Redistribution and Soil Organic Carbon
09:44

Use of Principal Components for Scaling Up Topographic Models to Map Soil Redistribution and Soil Organic Carbon

Published on: October 16, 2018

10.6K

Area of Science:

  • Environmental Science
  • Ecology
  • Statistical Modeling

Background:

  • Large environmental datasets present challenges due to numerous records and covariates.
  • Random forests excel with many covariates and nonlinear relationships.
  • Spatial regression performs well with many records and spatial autocorrelation.

Purpose of the Study:

  • Compare random forests and spatial regression for predicting the macroinvertebrate multimetric index (MMI).
  • Evaluate model performance for mapping MMI across the conterminous United States.
  • Assess prediction intervals and variability between the two methods.

Main Methods:

  • Utilized a dataset of MMI at 1859 stream sites with over 200 landscape covariates.
  • Developed a novel transformation procedure for spatial regression, including Box-Cox transformations and handling zero-inflated data.
  • Employed cross-validation and a simulation study to compare model performance.

Main Results:

  • Spatial regression with transformations achieved cross-validation performance comparable to random forests.
  • Both methods demonstrated prediction interval coverage close to nominal.
  • Spatial regression yielded narrower and less variable prediction intervals compared to quantile regression forests.

Conclusions:

  • Spatial regression, enhanced with transformations, is a viable alternative to random forests for large environmental datasets.
  • The choice between methods depends on specific data characteristics and the need for narrow prediction intervals.
  • The study provides insights for accurate ecological mapping and uncertainty quantification.