Add parallel I/O output system by SeanBryan51 · Pull Request #713 · CABLE-LSM/CABLE

SeanBryan51 · 2026-04-07T16:03:42Z

This change brings in a new output system based on the parallel I/O infrastructure introduced in #706, and is a direct replacement of the previous output module used for offline CABLE. The main motivation behind this is to add MPI support to the serial offline driver, and eventually, to replace the legacy MPI implementation (#358). This also makes progress towards the proposed output redesign (#715) by introducing the underlying data structures needed for its implementation.

The new output system brings in the following enhancements:

Optional parallel I/O support via PIO
A new "aggregator" class for calculating time aggregations throughout a simulation
New data structures for describing output variables which allows for easily adding new diagnostics and/or restarts variables

This change should be brought in after #712.

Type of change

Please delete options that are not relevant.

Enhancement
New or updated documentation

Checklist

The new content is accessible and located in the appropriate section
I have checked that links are valid and point to the intended content
I have checked my code/text and corrected any misspellings

Testing

Are the changes bitwise-compatible with the main branch? If working on an optional feature, are the results bitwise-compatible when this feature is off? If yes, copy benchcab output showing successful completion of the bitwise compatibility tests or equivalent results below this line.

CABLE benchcab runs tested using ifort 2021.10.0.

2026-04-28 06:43:59,566 - INFO - benchcab.benchcab.py:380 - Running comparison tasks...
2026-04-28 06:43:59,593 - INFO - benchcab.benchcab.py:381 - tasks: 168 (models: 2, sites: 42, science configurations: 4)
2026-04-28 06:46:47,366 - INFO - benchcab.benchcab.py:391 - 0 failed, 168 passed

Please add a reviewer when ready for review.

📚 Documentation preview 📚: https://cable--713.org.readthedocs.build/en/713/

Co-authored-by: Lachlan Whyborn <lachlan.s.whyborn@gmail.com>

…temperature

This is done to allow access to private components of derived types in child submodules. See this bug report for more details: https://community.intel.com/t5/Intel-Fortran-Compiler/Intel-oneAPI-bug-with-submodules/td-p/1347530

SeanBryan51 · 2026-04-27T20:19:34Z

Hi @abhaasgoyal @Whyborn, thank you for your time in reviewing these PRs! This one is now ready for review, and no rush of course, it's the heaviest one so far!

As an introduction, it might be useful to read these developer documentation pages:

Benchcab is running currently - I had to make a small change to achieve bitwise compatibility, I will update the status soon.

Thank you again!

Whyborn

Done a first pass. My main issue at the moment is with the structure of the output_variable_t definition. The current structure is amenable with what already existed within CABLE, but I don't see it being amenable to the planned user output API. The current structure will lead to a combinatory explosion of the number of output variables defined, e.g. a variable with 2 allowed reductions would require 15 output_variable_t, for each combination of reduction (3 including no reduction) and aggregation method (5). These would be allocated regardless of whether the variable is written or not.

I think it would make more sense to have a variable definition, which has only one instance per internal variable. It would contain a reference to the target variable, dimensionality and attributes for the variable. Then we use this and the information supplied by the user's output definition to only define the output variables that are actually going to be written.

Whyborn · 2026-05-04T22:50:53Z

+    real(kind=real64), dimension(:,:,:), pointer :: source_data => null()
+  end type aggregator_real64_3d_t
+
+  interface new_aggregator


What's the reason for separating the creation of a new aggregator into new and init phases? Looking at the code, I don't see anything that would stop an aggregator being initialised with new_aggregator(source_data=..., method=...)? Within the aggregator_mod itself the handling of each bit may be functionally separate, but from the "user" POV it seems logical to group this into a single call.

Whyborn · 2026-05-04T22:53:48Z

This is a sort of temporary file right? To replicate the current output behaviour on main, while the new API for defining the output is developed?

Whyborn · 2026-05-04T23:10:19Z

+    real :: scale_by = 1.0
+      !* A multiplicative factor to apply to the native diagnostic values when
+      ! writing output.
+    real :: divide_by = 1.0


Why provide this if we already provide a scale_by? If they want a division, they can just supply 1 / divide_by (and it's a tiny bit more efficient to do a division once then apply multiplications, than to do repeated divisions)

Whyborn · 2026-05-04T23:15:23Z

+    character(256) :: value !! Value of the attribute
+  end type
+
+  type, public :: cable_output_variable_t


I feel like it would make our lives easier in the long run to separate the definitions of output variables, output parameters and restart variables, rather than handle them in one big blob and separate via logical flags on the type. I'm guessing this was something you thought about- what made you decide to do it this way?

Whyborn · 2026-05-04T23:24:09Z

+  character(64), parameter :: NATIVE_DIM_NAME_PATCH           = "patch_native"
+  character(64), parameter :: NATIVE_DIM_NAME_PATCH_GLOBAL    = "patch_global_native"
+  character(64), parameter :: NATIVE_DIM_NAME_PATCH_GRID_CELL = "patch_grid_cell_native"
+  character(64), parameter :: NATIVE_DIM_NAME_LAND            = "land_native"
+  character(64), parameter :: NATIVE_DIM_NAME_LAND_GLOBAL     = "land_global_native"


What is the point of these parameters? I might be missing something, but I didn't see anywhere that using these parameters was functionally different to just using the associated strings, and the strings are already intuitive enough names so I don't think it adds clarity.

SeanBryan51 changed the title ~~Add parallelio output module~~ Add new parallel I/O output system Apr 7, 2026

SeanBryan51 changed the title ~~Add new parallel I/O output system~~ Add parallel I/O output system Apr 7, 2026

SeanBryan51 force-pushed the add-parallelio-output-module branch from 4c009fd to b226a95 Compare April 8, 2026 15:29

SeanBryan51 mentioned this pull request Apr 8, 2026

Parallel I/O output module enhancements #655

Closed

SeanBryan51 force-pushed the add-parallelio-output-module branch 8 times, most recently from beefeab to 0d53f58 Compare April 13, 2026 19:05

SeanBryan51 and others added 7 commits April 28, 2026 03:15

src/offline/file.txt: Delete unused file

60e551b

Move energy balance reporting to drivers

f5a67a7

Move energy and mass balance checks to drivers

e56e133

src/util/cable_array_utils.F90: Add array_eq

5fc9c60

Add aggregator implementation

a147968

Co-authored-by: Lachlan Whyborn <lachlan.s.whyborn@gmail.com>

Add working aggregator variables to compute daily max and min screen …

9e9b71b

…temperature

Bump intel compiler version to 2021.6.0

d448f3b

This is done to allow access to private components of derived types in child submodules. See this bug report for more details: https://community.intel.com/t5/Intel-Fortran-Compiler/Intel-oneAPI-bug-with-submodules/td-p/1347530

SeanBryan51 force-pushed the add-parallelio-output-module branch from 0d53f58 to cfb572c Compare April 27, 2026 17:16

SeanBryan51 mentioned this pull request Apr 27, 2026

config.yaml: Update intel compiler to 2021.10.0 CABLE-LSM/bench_example#26

Merged

SeanBryan51 force-pushed the add-parallelio-output-module branch from cfb572c to 458d67d Compare April 27, 2026 19:23

Add parallel I/O output module implementation

df516f5

SeanBryan51 force-pushed the add-parallelio-output-module branch from 458d67d to df516f5 Compare April 27, 2026 19:31

SeanBryan51 marked this pull request as ready for review April 27, 2026 19:49

SeanBryan51 requested review from Whyborn and abhaasgoyal April 27, 2026 20:19

Whyborn reviewed May 4, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add parallel I/O output system#713

Add parallel I/O output system#713
SeanBryan51 wants to merge 8 commits into
mainfrom
add-parallelio-output-module

SeanBryan51 commented Apr 7, 2026 •

edited by abhaasgoyal

Loading

Uh oh!

SeanBryan51 commented Apr 27, 2026

Uh oh!

Whyborn left a comment

Uh oh!

Whyborn May 4, 2026

Uh oh!

Whyborn May 4, 2026

Uh oh!

Whyborn May 4, 2026

Uh oh!

Whyborn May 4, 2026

Uh oh!

Whyborn May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

SeanBryan51 commented Apr 7, 2026 • edited by abhaasgoyal Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Type of change

Checklist

Testing

Uh oh!

SeanBryan51 commented Apr 27, 2026

Uh oh!

Whyborn left a comment

Choose a reason for hiding this comment

Uh oh!

Whyborn May 4, 2026

Choose a reason for hiding this comment

Uh oh!

Whyborn May 4, 2026

Choose a reason for hiding this comment

Uh oh!

Whyborn May 4, 2026

Choose a reason for hiding this comment

Uh oh!

Whyborn May 4, 2026

Choose a reason for hiding this comment

Uh oh!

Whyborn May 4, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

SeanBryan51 commented Apr 7, 2026 •

edited by abhaasgoyal

Loading