Add codes to extract SoilGrids soil texture data and derive ensemble … #3406

Qianyuxuan · 2025-01-02T19:09:49Z

Add codes to extract SoilGrids soil texture data and derive ensemble soil parameter files

Description

Add a script "soilgrids_texture_extraction.R" that can extract and save three types of soil texture data in parallel for a single or group of lat/long locations based on user-defined site location from SoilGrids250m data.
Add a script "soil_params_ensemble.R" that can estimate the soil parameters based on soil texture data and write the ensemble parameter paths into settings.
Modify "soil_utils.R" and "extract_soil_nc.R" to fix the bugs when generating soil parameter files.
Modify "write.config.SIPNET.R" to include codes that can read and write soil water holding capacity into parameter settings.
Fix the bug that the met ensemble members for t=1 and t>1 in SDA codes differ.

Motivation and Context

Provide ensemble soil parameter files based on SoilGrids soil texture data for SDA.

Review Time Estimate

Immediately
Within one week
When possible

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)

Checklist:

My change requires a change to the documentation.
My name is in the list of CITATION.cff
I agree that PEcAn Project may distribute my contribution under any or all of
- the same license as the existing code,
- and/or the BSD 3-clause license.
I have updated the CHANGELOG.md.
I have updated the documentation accordingly.
I have read the CONTRIBUTING document.
I have added tests to cover my changes.
All new and existing tests passed.

…soil parameter files

mdietze · 2025-01-02T21:11:05Z

models/sipnet/R/write.configs.SIPNET.R

+  #working on reading soil file (only working for soilWHC)
+  if (length(settings$run$inputs$soilinitcond$path) > 0) {
+    soil_ICs_num <- length(settings$run$inputs$soilinitcond$path)
+    soil_ICs_path <- settings$run$inputs$soilinitcond$path[[sample(1:soil_ICs_num, 1)]]


I'm guessing the logic of this bit is that if an ensemble input is passed in then you choose one at random? Personally, I'd prefer to throw an error at this point, as by this point in the workflow ensemble inputs should have already been converted to specific inputs for each run. If they're still ensemble at this point it's an error and we should stop. Proceeding on with a random choice means (a) we don't catch an error and (b) we don't know/record which ensemble member mapped to which run.

Yeah I followed the logic as choosing one from ensemble initial conditions here:https://github.com/PecanProject/pecan/blob/develop/models/sipnet/R/write.configs.SIPNET.R#L546. So you suggested we should just randomly choose and write one ensemble member in the script "soil_params_ensemble.R" and read that specific path in "write.configs.SIPNET.R"?

So your soil_params_ensemble should generate a bunch of ensembles and load them ALL into the overall settings object. What I'm saying is that write.configs.SIPNET should only end up seeing ONE of those ensembles because run.write.configs should do the joint sampling of all the different types of ensembles (met, soils, IC, etc), record which ensemble member was assigned which inputs, and then ensure that the settings object reaching write.configs.SIPNET is only for a specific ensemble member. If you're finding that write.configs.SIPNET is being handed all of the soil ensemble members then there's something wrong upstream in the pecan.xml (e.g. not including soils in the section) or in run.write.configs

@mdietze It seems that "run.write.configs" does not play a role in our current SDA workflow but I did find ways to do the sampling of soil ensemble members through the codes in "ensemble.R" by providing "soilinitcond" tag under "ensemble" in the XML file: https://github.com/PecanProject/pecan/blob/develop/modules/uncertainty/R/ensemble.R#L401-L406. However, it is only for a new fresh run and seems not applicable when it is a restart. The current restart components only include met inputs and parameters as https://github.com/PecanProject/pecan/blob/develop/modules/uncertainty/R/ensemble.R#L422C5-L424. What is your suggestion on that?

write.ensemble.configs is what's called by run.write.configs, so if that's what SDA is calling too we should be fine. But as noted earlier, it's important for the code to confirm that it's only receiving on ensemble member for each input, and to throw an error if it's not rather than resampling.

Restart is grabbing inputs and soils should be part of inputs (as should phenology, initial conditions, etc), not just met. You should verify this is working correctly, as it is is critical that ensemble sampling is preserved (not repeated) both iteratively within a SDA run and across SDA runs (e.g., when running a forecast today that starts from yesterday's forecast). Resampling things (params, inputs, IC, etc) that have already been samples is going to seriously mess up the covariance structures in our products

@mdietze Based on the current argument passing of restart in "sda.enkf_MultiSite.R": https://github.com/PecanProject/pecan/blob/develop/modules/assim.sequential/R/sda.enkf_MultiSite.R#L358-L368, it seems we didn't grab soil, phenology and IC other than met now for inputs (note only "met" is specified in "input.ens.gen"). There is also a comment: #TODO: incorporate Phyllis's restart work. #sample all inputs specified in the settings$ensemble not just met". So I guess it is what we should add here?

I added soil parameter files to inputs for the restart list. Phenology parameters are sampled within parameter sampling and initial conditions don't apply for t>1 situation, so I guess the current solution seems fine.

Phenology parameters are sampled within parameter sampling

Not sure what you mean here, phenology files are separate from the parameter posterior files.

models/sipnet/R/write.configs.SIPNET.R

modules/data.land/R/extract_soil_nc.R

modules/data.land/R/soil_params_ensemble.R

modules/data.land/R/soil_utils.R

modules/data.land/R/soil_params_ensemble.R

modules/data.land/R/soilgrids_texture_extract.R

modules/data.land/R/soil_params_ensemble.R

modules/data.land/R/soil_utils.R

…an into sda_soil_params_soilgrids

DongchenZ

So the general script is well established followed by a similar style as the previous SOC extraction function. The questions that I raised here are: 1) for the parallel design when over the site locations and data products, and 2) for the usage of downloaded local raster files (if they exist).

modules/data.land/R/soilgrids_texture_extract.R

DongchenZ · 2025-01-18T17:33:44Z

modules/data.land/R/soilgrids_texture_extract.R

+    name_tag <- expand.grid(depths, data_tag, stringsAsFactors = F) #find the combinations between data and depth tags.
+    L <- split(as.data.frame(name_tag), seq(nrow(as.data.frame(name_tag))))#convert tags into lists.
+
+    if ("try-error" %in% class(try(soilt_real <- L %>% furrr::future_map(function(l){


When the site number goes up, this parallel will be more likely to produce errors due to the limitation of the SoilGrids server. The extraction speed is similar with or without the parallel in my previous 6400 site runs. Thus, it's recommended to just delete the parallel extraction part from the script.

modules/data.land/R/soilgrids_texture_extract.R

mdietze · 2025-01-20T13:07:00Z

models/sipnet/R/write.configs.SIPNET.R

-      #LitterWHC
-      #param[which(param[, 1] == "litterWHC"), 2] <- unlist(soil_IC_list$vals["volume_fraction_of_water_in_soil_at_saturation"])[1]*100
+  #working on reading soil file
+  if (length(settings$run$inputs$soilinitcond$path) > 0) {


Still no clear how you're handling the case if >1 file is passed in

So the current design is when t=1, only one file path is sampled and passed into the setting based on codes in "write.ensemble.configs". When t>1, file paths of total ensemble size will be passed in but then "template.soilinit" will be assigned with the specific path in the inputs.

@Qianyuxuan I was asking about n=1 versus n>1, not t=1 vs t>1, and I'm still confused -- if you're passed in a whole vector of paths instead of a single one, you should throw an error not continue on, but I'm not seeing that check anywhere.

I followed the steps for the pass of met files: https://github.com/PecanProject/pecan/blob/develop/models/sipnet/R/write.configs.SIPNET.R#L21-L29. I did find that a whole vector of met paths (10 ERA5 ensemble met paths defined in the settings) was passed in when t>1. But I don't think it is an error as the specific path will be overwritten by the one listed in the restart met input later.

per previous conversation, I recommend adding a check to make sure that length(settings$run$inputs$soilinitcond$path) == 1 and throwing an error otherwise

mdietze · 2025-01-20T13:08:15Z

models/sipnet/R/write.configs.SIPNET.R

+    soil_IC_list <- PEcAn.data.land::pool_ic_netcdf2list(template.soilinit)
+    # Calculate the thickness of soil layers based on the assumption that the depth values are at bottoms and the first layer top is at 0
+    if ("depth" %in% names(soil_IC_list$dims)) {
+      thickness<-c(soil_IC_list$dims$depth[1],diff(soil_IC_list$dims$depth))


Does this line still work if the product only has one layer?

Also, should/could there be an "else" here? A bunch of code below continues to check for "depth" being defined, but doesn't ever use depth again -- instead you're just using this to ensure thickness is defined.

@Qianyuxuan sorry for not clarifying, the request isn't to have the code fail if thickness is missing, but to to only have depth/thickness within the if, to take all the other variable out of the if/else, and to use a default or user-specified soil depth if one isn't present in the file.

mdietze · 2025-01-20T13:11:41Z

models/sipnet/R/write.configs.SIPNET.R

+      soilWHC_total <- sum(unlist(soil_IC_list$vals["volume_fraction_of_water_in_soil_at_saturation"])*thickness)
+      param[which(param[, 1] == "soilWHC"), 2] <- soilWHC_total
+    #LitterWHC
+      param[which(param[, 1] == "litterWHC"), 2] <- soilWHC_total


This is incorrect. If there's a litter layer it would have a much much smaller WHC than the entire soil pool.

mdietze · 2025-01-20T13:20:18Z

models/sipnet/R/write.configs.SIPNET.R

      #litwaterDrainrate
-      param[which(param[, 1] == "litWaterDrainRate"), 2] <- unlist(soil_IC_list$vals["soil_hydraulic_conductivity_at_saturation"])[1]*100/(3600*24)
+      param[which(param[, 1] == "litWaterDrainRate"), 2] <- soilHC_total


First, I'm not sure why you're using the average hydraulic conductivity of the entire soil column to control the rate at which moisture leaves the litter and enters the first soil layer. Second, we should check that the way SIPNET uses the conductivity (as a proportional decay rate, see https://github.com/PecanProject/sipnet/blob/becec4d2d6d857fa25dc47f974c48621a0b9044b/sipnet.c#L1516) is compatible with the way the parameter is defined in the soils file (Darcy's law) or whether additional assumptions or unit conversions are needed

Fixed. Only the conductivity for first soil layer is used for litWaterDrainRate and the equation complies to Darcy's law.

mdietze · 2025-01-20T13:21:23Z

models/sipnet/R/write.configs.SIPNET.R

+    #LitterWHC
+      param[which(param[, 1] == "litterWHC"), 2] <- soilWHC_total
+    }
+    if ("depth" %in% names(soil_IC_list$dims) && "soil_hydraulic_conductivity_at_saturation" %in% names(soil_IC_list$vals)) {


per comment below about this being for surface litter, I'm not sure you need "depth" here

mdietze · 2025-01-20T13:23:46Z

models/sipnet/R/write.configs.SIPNET.R

+  #working on reading soil file (only working for soilWHC)
+  if (length(settings$run$inputs$soilinitcond$path) > 0) {
+    soil_ICs_num <- length(settings$run$inputs$soilinitcond$path)
+    soil_ICs_path <- settings$run$inputs$soilinitcond$path[[sample(1:soil_ICs_num, 1)]]


Phenology parameters are sampled within parameter sampling

Not sure what you mean here, phenology files are separate from the parameter posterior files.

mdietze · 2025-01-20T13:31:20Z

modules/assim.sequential/R/metSplit.R

@@ -32,17 +32,18 @@ metSplit <- function(conf.settings, inputs, settings, model, no_split = FALSE, o
      # Loading the model package - this is required bc of the furrr
      library(paste0("PEcAn.",model), character.>

            

-      inputs.split <- list()
+      # Initialize inputs.split with inputs to keep the sub-list of soil parameters "soilinitcond"


I'm not following the logic here. The metSplit code should only ever be touching the met. The comment here about soil parameters is confusing, as metSplit shouldn't be touching ANY of the other inputs, not just soils. I'm having a hard time telling if the code below is incorrect, or if it's fixing a long standing bug, or if the larger issue is that metSplit shouldn't ever have been passed the entirety of inputs to begin with.

I didn't edit "metSplit" this time, instead I added similar modified codes in "sda.enkf_MultiSite.R". In my own opinion, the function "metSplit" lacks some flexibility as the formats "inputs = inputs$samples[[I]]", but we need to add another sublist such as "soilintcond" under inputs soil we need to specify "met".

mdietze · 2025-01-20T13:32:21Z

modules/assim.sequential/R/sda.enkf_MultiSite.R

@@ -355,17 +355,28 @@ sda.enkf.multisite <- function(settings,
      #reformatting params
      new.params <- sda_matchparam(settings, ensemble.samples, site.ids, nens)
    }
-      #sample met ensemble members
+      #sample met and soil parameter ensemble members 


should be sampling ALL input ensemble members, not just met, not just (met + soils)

#TODO: incorporate Phyllis's restart work # sample all inputs specified in the settings$ensemble not just met

FYI Phyllis's work lives on the BU server at /projectnb/dietzelab/yixuanli and still needs going through to make sure (a) write.ensemble.configs is saving ALL the info it needs about ensembles, not just params and (b) that you can have the option to pass in a specific already-sampled ensemble of params, met, soil, IC, etc. without having the code resample it. This is important in the general case to allow for sensitivity analysis designs. It's also important for the SDA case as the sample from t==1 needs to be the same sample that's used for t>1.

I did find the previous design had two separate sampling in the SDA workflow, leading to met file for t==1 differs from that for t>1. I kept the sampling in SDA codes instead, saved the inputs as "samples", and passed it into "write.ensemble.configs" as an additional argument. If it is a fresh new run, those samples will be read and recorded directly. And those samples will be passed into the restart list when t>1 as well. Although it might not be the perfect solution, but I did test it and found it work well to make sure the ensemble member keeps the same through time.

mdietze · 2025-01-20T13:34:49Z

modules/assim.sequential/R/sda.enkf_MultiSite.R

-          parent_ids = NULL 
-        )
-       })
+      input_name <- c("met","soilinitcond")


should not be hard coded. The set of inputs that are ensemble-based needs to be detected from the settings

modules/assim.sequential/R/sda.enkf_MultiSite.R

mdietze · 2025-01-20T13:44:17Z

modules/assim.sequential/R/sda.enkf_MultiSite.R

+      input_name <- c("met","soilinitcond")
+      inputs <- list()
+      new_inputs <- list()
+      for (i_input in seq_along(input_name)){


Inputs can't be sampled in an arbitrary order. See the logic in write.ensemble.configs. Indeed, if the SDA can't call write.ensemble.configs directly, then there's a lot of code there than needs to be refactored into a function that can be called by both forward and SDA code, since maintaining two separate versions of the sampling code is clearly resulting in code divergence.

In my new edits, the ids for samples that were generated in SDA codes and passed into the "write.ensemble.configs" will be recorded in "README.txt".

mdietze · 2025-01-20T13:46:55Z

modules/assim.sequential/inst/MultiSite-Exs/SDA/Create_Multi_settings.R

@@ -225,6 +225,7 @@ template <- PEcAn.settings::Settings(list(
  ensemble = structure(list(size = 25, variable = "NPP", 
                            samplingspace = structure(list(
                              parameters = structure(list(method = "lhc")),
+                              soilinitcond = structure(list(method = "sampling")),


Check to make sure there aren't other inputs that are being left out

Also, is the operational SDA using a LHC sampling of the posterior parameters? That's not random.

I think so. The current SDA workflow used "get.parameter.samples(settings, ens.sample.method = settings$ensemble$samplingspace$parameters$method)" to sample parameters and the method is "lhc".

Code still retains lhc instead of sampling

mdietze · 2025-02-05T10:57:30Z

models/sipnet/R/write.configs.SIPNET.R

-      #LitterWHC
-      #param[which(param[, 1] == "litterWHC"), 2] <- unlist(soil_IC_list$vals["volume_fraction_of_water_in_soil_at_saturation"])[1]*100
+  #working on reading soil file
+  if (length(settings$run$inputs$soilinitcond$path) > 0) {


@Qianyuxuan I was asking about n=1 versus n>1, not t=1 vs t>1, and I'm still confused -- if you're passed in a whole vector of paths instead of a single one, you should throw an error not continue on, but I'm not seeing that check anywhere.

mdietze · 2025-02-05T11:06:11Z

models/sipnet/R/write.configs.SIPNET.R

+    if (!is.null(inputs)) {
+      ## override if specified in inputs
+      if ("soilinitcond" %in% names(inputs)) {
+        template.soilinit <- inputs$soilinit$path


I'm now realizing that I'm also confused by this bit. Why do settings$run$inputs$soilinitcond and settings$run$inputs$soilinit both exist?

"soilinit" is only a typo and will be corrected.

…an into sda_soil_params_soilgrids

dlebauer · 2025-02-27T21:46:03Z

modules/data.land/R/soil_params_ensemble.R

+##'  @return The individual alphas that work best to fit the observed quantiles
+##'  @export 
+##'  @author Qianyu Li
+estimate_dirichlet_parameters <- function(means, quantiles) {


I don't think it should be combined with the existing function prior_fn (that is similar in approach but for univariate distributions though it has its own limitations). But perhaps it should be included in the same file in the priors package?

pecan/modules/priors/R/priors.R

Line 90 in edc14b8

prior.fn <- function(parms, x, alpha, distn, central.tendency = NULL, trait = NULL) {

models/sipnet/R/write.configs.SIPNET.R

modules/uncertainty/R/ensemble.R

Co-authored-by: DongchenZ <82834956+DongchenZ@users.noreply.github.com>

mdietze · 2025-04-30T13:57:37Z

models/sipnet/R/write.configs.SIPNET.R

-      #LitterWHC
-      #param[which(param[, 1] == "litterWHC"), 2] <- unlist(soil_IC_list$vals["volume_fraction_of_water_in_soil_at_saturation"])[1]*100
+  #working on reading soil file
+  if (length(settings$run$inputs$soilinitcond$path) > 0) {


per previous conversation, I recommend adding a check to make sure that length(settings$run$inputs$soilinitcond$path) == 1 and throwing an error otherwise

mdietze · 2025-04-30T14:00:50Z

modules/assim.sequential/R/sda.enkf_MultiSite.R

@@ -355,17 +355,33 @@ sda.enkf.multisite <- function(settings,
      #reformatting params
      new.params <- sda_matchparam(settings, ensemble.samples, site.ids, nens)
    }
-      #sample met ensemble members
-      #TODO: incorporate Phyllis's restart work


please retain this comment, which will be helpful to the folks working on uncertainty partitioning.

mdietze · 2025-04-30T15:36:16Z

modules/assim.sequential/R/sda.enkf_MultiSite.R

+      }
+     for (i in seq_along(samp.ordered)){
+         #call the function responsible for generating the ensemble
+        inputs[[s]][[names(samp.ordered)[i]]] <- input.ens.gen(settings=conf.settings[[s]],


is this sampling occurring for each site independently or across all sites simultanously?

For each site independently.

Is independent sampling (which is NOT what we want) something new here or are you just continuing the status quo. If it's the status quo I can see punting the fix to a new PR, but if you moved from joint to independent sampling you'll need to revert it back to joint.

I didn't do anything new here and just continued the status quo. The ensemble size for met is 10 at each site, does it mean we may need to sample from the joint distribution using bootstrapping?

No, what I mean is that if model ensemble member 1 at site 1 samples met ensemble k (e.g. 6) then model ensemble member 1 needs to also use met ensemble member k at sites 2 through 6400. Even though the specific met data is different from site to site, the spatiotemporal values within a specific met ensemble member share a covariance structure. Similarly if ensemble member 1 at site 1 samples a particular parameter vector with a specific row number, then all other sites need to use the same posterior row number (though row k for one PFT will obviously point to a different set of parameters than row k of another PFT). By having the same parameters, met, etc across sites we induce the correct spatiotemporal covariance structure across sites.

mdietze · 2025-04-30T15:42:37Z

modules/assim.sequential/R/sda.enkf_MultiSite.R

@@ -379,7 +395,42 @@ sda.enkf.multisite <- function(settings,
      if (t>1){
        #for next time step split the met if model requires
        #-Splitting the input for the models that they don't care about the start and end time of simulations and they run as long as their met file.
-        inputs.split <- metSplit(conf.settings, inputs, settings, model, no_split = FALSE, obs.times, t, nens, restart_flag = FALSE, my.split_inputs)
+        #set start and end date for splitting met


Conceptually, I'm not seeing why you're replacing a simple call to a function with a big chunk of code, which undo's the concept of modularization and refactoring. If there's something wrong within the function, I'd fix the function.

Also, conceptually note that MOST models don't need their met split, so make sure that any implementation of metSplit

Previously, I directly modified "metSplit" but it did not seem to be suggested. The challenge is it that we only need to do the split function for "met" sublist but now it lacks the specification: https://github.com/PecanProject/pecan/blob/develop/modules/assim.sequential/R/metSplit.R#L39. Do you suggest we change it to:" inputs.split$met$samples[i] <- do.call(
my.split_inputs,
args = list(
settings = settings,
start.time = (lubridate::ymd_hms(start.time, truncated = 3) + lubridate::second(lubridate::hms("00:00:01"))),
stop.time = lubridate::ymd_hms(stop.time, truncated = 3),
inputs = inputs$met$samples[[i]])

mdietze · 2025-04-30T15:44:16Z

modules/assim.sequential/R/sda.enkf_MultiSite.R

+        inputs.split <- 
+          furrr::future_pmap(list(conf.settings %>% `class<-`(c("list")), inputs, model), function(settings, inputs, model) {
+            # Loading the model package - this is required bc of the furrr
+            library(paste0("PEcAn.",model), character.>

            


We shouldn't have to load library's like this. But if you do, load it once, before the pmap, not repeatedly.

Has removed this block of code.

mdietze · 2025-04-30T15:45:51Z

modules/assim.sequential/inst/MultiSite-Exs/SDA/Create_Multi_settings.R

@@ -225,6 +225,7 @@ template <- PEcAn.settings::Settings(list(
  ensemble = structure(list(size = 25, variable = "NPP", 
                            samplingspace = structure(list(
                              parameters = structure(list(method = "lhc")),
+                              soilinitcond = structure(list(method = "sampling")),


Code still retains lhc instead of sampling

mdietze · 2025-04-30T15:54:14Z

modules/uncertainty/R/ensemble.R

@@ -393,6 +394,7 @@ write.ensemble.configs <- function(defaults, ensemble.samples, settings, model,
          "site        : ", settings$run$site$name, "\n",
          "site  id    : ", format(settings$run$site$id, scientific = FALSE), "\n",
          "met data    : ", samples$met$samples[[i]], "\n",
+          "soil parameter   : ", samples$soilinitcond$samples[[i]], "\n",


This line will fail for any model that doesn't have soilinitcond. You should dynamically detect the set of sampled inputs present

modules/uncertainty/R/ensemble.R

mdietze · 2025-04-30T15:56:56Z

modules/uncertainty/R/ensemble.R

                           RENAME = rename)#for restart from previous model runs, not sharing the same outdir
      )
    }
    params<-new.params
-    return(invisible(list(runs = data.frame(id=run.id), ensemble.id = ensemble.id, samples=list(met=inputs)
+    return(invisible(list(runs = data.frame(id=run.id), ensemble.id = ensemble.id, samples=list(met=inputs$met,soilinitcond=inputs$soilinitcond)


same need to generalize to any set of inputs

…an into sda_soil_params_soilgrids

mdietze · 2025-05-07T10:11:58Z

models/sipnet/R/write.configs.SIPNET.R

+    soil_IC_list <- PEcAn.data.land::pool_ic_netcdf2list(template.soilinit)
+    # Calculate the thickness of soil layers based on the assumption that the depth values are at bottoms and the first layer top is at 0
+    if ("depth" %in% names(soil_IC_list$dims)) {
+      thickness<-c(soil_IC_list$dims$depth[1],diff(soil_IC_list$dims$depth))


@Qianyuxuan sorry for not clarifying, the request isn't to have the code fail if thickness is missing, but to to only have depth/thickness within the if, to take all the other variable out of the if/else, and to use a default or user-specified soil depth if one isn't present in the file.

models/sipnet/R/write.configs.SIPNET.R

mdietze · 2025-05-07T10:18:37Z

modules/assim.sequential/R/sda.enkf_MultiSite.R

+      }
+     for (i in seq_along(samp.ordered)){
+         #call the function responsible for generating the ensemble
+        inputs[[s]][[names(samp.ordered)[i]]] <- input.ens.gen(settings=conf.settings[[s]],


Is independent sampling (which is NOT what we want) something new here or are you just continuing the status quo. If it's the status quo I can see punting the fix to a new PR, but if you moved from joint to independent sampling you'll need to revert it back to joint.

modules/uncertainty/R/ensemble.R

…an into sda_soil_params_soilgrids

mdietze

Thanks @Qianyuxuan for all your hard work on this PR!

mdietze · 2025-05-23T13:16:00Z

modules/assim.sequential/R/sda.enkf_MultiSite.R

+        }
+        input_path <- conf.settings[[s]]$run$inputs[[tolower(input_tag)]]$path
+        inputs[[s]][[input_tag]]$ids<-inputs[[random_site]][[input_tag]]$ids
+        inputs[[s]][[input_tag]]$samples<- input_path[inputs[[random_site]][[input_tag]]$ids]


This is a good start. Tagging @blesson07asd so he's aware of these changes and can account for them when he further generalizes ensemble sampling. Eventually this will need to be expanded to account for parameter sampling too, but we can do that later.

DongchenZ

Great work @Qianyuxuan

modules/assim.sequential/R/sda.enkf_MultiSite.R

mdietze · 2025-05-23T14:50:09Z

@Qianyuxuan there's a small fix that needs to occur in data.land/DESCRIPTION to add the following new libraries to Suggests: ‘doSNOW’ ‘foreach’ ‘MCMCpack’. After that you'll probably need to run scripts/generate_dependencies.R and add any updated files to the PR (e.g., docker/depends/pecan_package_dependencies.csv)

Add codes to extract SoilGrids soil texture data and derive ensemble …

bbb795d

…soil parameter files

github-actions bot added Modules Models labels Jan 2, 2025

mdietze requested changes Jan 2, 2025

View reviewed changes

infotroph reviewed Jan 6, 2025

View reviewed changes

modules/data.land/R/soilgrids_texture_extract.R Outdated Show resolved Hide resolved

infotroph reviewed Jan 6, 2025

View reviewed changes

modules/data.land/R/soil_params_ensemble.R Outdated Show resolved Hide resolved

dlebauer reviewed Jan 15, 2025

View reviewed changes

modules/data.land/R/soil_utils.R Outdated Show resolved Hide resolved

dlebauer mentioned this pull request Jan 17, 2025

Refactor SoilGrids functions #3416

Open

Qianyu Li and others added 4 commits January 17, 2025 19:57

Revise per comments

968ccca

Merge branch 'PecanProject:develop' into sda_soil_params_soilgrids

c5f51f8

Merge branch 'sda_soil_params_soilgrids' of github.com:Qianyuxuan/pec…

326e18d

…an into sda_soil_params_soilgrids

Update the script for generating xml file

14be6bf

DongchenZ suggested changes Jan 18, 2025

View reviewed changes

mdietze reviewed Jan 20, 2025

View reviewed changes

Bug fix

b280f00

mdietze reviewed Feb 5, 2025

View reviewed changes

Qianyuxuan and others added 6 commits February 7, 2025 11:24

Merge branch 'PecanProject:develop' into sda_soil_params_soilgrids

f0319d5

Revise per comments

53283f5

Merge branch 'sda_soil_params_soilgrids' of github.com:Qianyuxuan/pec…

3d68b74

…an into sda_soil_params_soilgrids

Remove unintended testing codes

a07004f

Add document for additional argument in a function

c0c557c

Adopt Dongchen's suggestion for parallel design

6cbf62b

dlebauer reviewed Feb 27, 2025

View reviewed changes

DongchenZ reviewed Mar 7, 2025

View reviewed changes

models/sipnet/R/write.configs.SIPNET.R Outdated Show resolved Hide resolved

DongchenZ reviewed Mar 12, 2025

View reviewed changes

modules/uncertainty/R/ensemble.R Outdated Show resolved Hide resolved

Update models/sipnet/R/write.configs.SIPNET.R

b731ed2

Co-authored-by: DongchenZ <82834956+DongchenZ@users.noreply.github.com>

mdietze requested changes Apr 30, 2025

View reviewed changes

Qianyu Li and others added 2 commits May 7, 2025 01:43

Revise according to comments

cca24fd

Merge branch 'PecanProject:develop' into sda_soil_params_soilgrids

844042a

Merge branch 'sda_soil_params_soilgrids' of github.com:Qianyuxuan/pec…

54babea

…an into sda_soil_params_soilgrids

mdietze reviewed May 7, 2025

View reviewed changes

Qianyu Li and others added 4 commits May 13, 2025 13:22

Fix unit bug

d17a239

Enable the joint sampling of inputs across sites

b26c47f

Merge branch 'PecanProject:develop' into sda_soil_params_soilgrids

a85b9bd

Merge branch 'sda_soil_params_soilgrids' of github.com:Qianyuxuan/pec…

de04744

…an into sda_soil_params_soilgrids

mdietze approved these changes May 23, 2025

View reviewed changes

DongchenZ self-requested a review May 23, 2025 14:13

DongchenZ approved these changes May 23, 2025

View reviewed changes

mdietze reviewed May 23, 2025

View reviewed changes

modules/assim.sequential/R/sda.enkf_MultiSite.R Outdated Show resolved Hide resolved

mdietze added 2 commits May 23, 2025 10:23

Update modules/assim.sequential/R/sda.enkf_MultiSite.R

75987ce

Merge branch 'develop' into sda_soil_params_soilgrids

73d1098

mdietze enabled auto-merge May 23, 2025 14:27

robkooper mentioned this pull request May 23, 2025

Release/v1.9.0 #3526

Merged

3 tasks

robkooper changed the base branch from develop to release/v1.9.0 May 23, 2025 14:51

Add package info

7ff6c05

github-actions bot added the Dockerfile label May 23, 2025

mdietze merged commit 2d724b8 into PecanProject:release/v1.9.0 May 23, 2025
12 of 15 checks passed

Add codes to extract SoilGrids soil texture data and derive ensemble … #3406

Add codes to extract SoilGrids soil texture data and derive ensemble … #3406

Uh oh!

Conversation

Uh oh!

Description

Motivation and Context

Review Time Estimate

Types of changes

Checklist:

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!