Return to search results
💡 Advanced Search Tip
Search by organization or tag to find related datasets
Random Forest Regression Model Archive for Estimating Low Streamflow Statistics at Ungaged Locations in New York, excluding Long Island
This USGS data release contains R models (R Core Team, 2020) for estimating 7Q10 and 30Q10 [lowest annual 7-day and 30-day average streamflow that occurs (on average) once every 10 years] statistics at ungaged locations in or adjacent to New York State excluding Long Island. Details on model development are available in Stagnitta and others (2025).
basin_characteristics.csv - 224 basin characteristics represented by variables that describe basin geometry, climatic conditions, land cover, soils and surficial geology, and other characteristics for 213 unaltered gaged locations determined from (Stagnitta and others, 2024). Basin characteristics were used as predictor variables within the models to estimate low-streamflow statistics for ungaged locations.
R scripts used to flag unaltered streamgage datasets for redundant basins, and to train, tune, test, and bias-correct random forest regression models for estimating 7Q10 and 30Q10 are included in model.zip. Users are encouraged to read the readme file in this zipped file for details on the scripts and associated files used to generate the statistics.
Complete Metadata
| @id | http://datainventory.doi.gov/id/dataset/caf3dc632d1878249fc32919252b3da8 |
|---|---|
| bureauCode |
[ "010:12" ] |
| identifier | USGS:65f1e562d34e1403329845e0 |
| spatial | -79.9202,40.5222,-72.6746,45.0037 |
| theme |
[ "geospatial" ] |