The NOAA IR serves as an archival repository of NOAA-published products including scientific findings, journal articles, guidelines, recommendations, or other information authored or co-authored by NOAA or funded partners.
As a repository, the NOAA IR retains documents in their original published format to ensure public access to scientific information.
i
Random forest regression models in ecology: Accounting for messy biological data and producing predictions with uncertainty
-
2024
-
Source: Fisheries Research, 280, 107161
Details:
-
Journal Title:Fisheries Research
-
Personal Author:
-
NOAA Program & Office:
-
Description:Machine learning methods such as random forest regression models are useful tools in ecology when applied correctly, although features inherent to ecological data sets can lead to over-fitting or uncertain predictions. Here, a set of methods are outlined to account for temporal autocorrelation, and sparse, short, or missing data for random forest predictions. Methods are also provided for estimating prediction uncertainty due to the combination of inherent randomness in the random forest algorithm and sparse input data. This suite of methods was used to generate pre-season predictions of total catches with uncertainty for California market squid (Doryteuthis opalescens), the most valuable fishery in California (by ex-vessel value). The methodology presented in this analysis is not only robust, incorporating key cross-validation and hyperparameter tuning techniques from across disciplines, but is also flexible, making it applicable to various ecological and fisheries datasets beyond market squid.
-
Source:Fisheries Research, 280, 107161
-
DOI:
-
ISSN:0165-7836
-
Format:
-
Publisher:
-
Document Type:
-
License:
-
Rights Information:CC0 Public Domain
-
Compliance:Submitted
-
Main Document Checksum:
-
Download URL:
-
File Type: