Fix formatting by claude-marie · Pull Request #79 · BLSQ/snt_development

claude-marie · 2026-04-16T08:26:27Z

It is not related to any ticket, this fix come from a bug I find helping Giulia in the workspace.

After running the DRC formatting pipeline I find out a bug in one of the code file which needed a tiny fix
The formatting report totally was crashing in DRC due to too many one to many merge with millions of line which lead to a RAM explosion. After many try to optimize it I decided to completrly rework it using the same plot but slighlty different data (aggregation). You can find the new report here and for comparaison an old one here

EstebanMontandon · 2026-04-20T12:57:38Z

Hey, let's continue to improve our workflow by reusing code from previous notebooks. Moving forward, let's keep an eye on how we can generalize these functions to work across different pipelines.

claude-marie · 2026-04-20T13:23:19Z

I'll let it in stand by until next week, waiting for @sPuntinG to end up with the workshop

EstebanMontandon · 2026-04-20T12:19:27Z

+   },
+   "outputs": [],
+   "source": [
+    "if (!exists(\"SNT_ROOT_PATH\") || length(SNT_ROOT_PATH) == 0L || !nzchar(as.character(SNT_ROOT_PATH)[1])) {\n",


I would say that at least in the case of "notebooks", we can ignore validations of the root path.. let's just assume we are always working in OpenHexa where the root will never change.

so, to load the functions you can just do:
source("~workspace/pipelines/snt_dhis2_formatting/utils/snt_dhis2_formatting_report.r") # hardcoded path...

EstebanMontandon · 2026-04-20T12:28:17Z

+    "  SNT_ROOT_PATH <- \"~/workspace\"\n",
+    "}\n",
+    "source(file.path(SNT_ROOT_PATH, \"pipelines/snt_dhis2_formatting/utils/snt_dhis2_formatting_report.r\"))\n",
+    "formatting_report_paths_and_outputs()\n",


I would go for a generic "bootstrap" option. I like the option of calling a single function that handles all the necessary setup for me, so as a user I don't have to worry about it anymore, makes sense?

If I'm building a report in the future, and I forget to call any of these functions, I might have to start debugging what's going on...
just call 1 "bootstrap" function for example: snt_setup_report() Could be a wrapper function that calls get_setup_variables() and additionally includes specific report configurations, instead of :
formatting_report_paths_and_outputs()
formatting_report_apply_streamlined_defaults()
formatting_report_source_core_and_helpers()
formatting_report_openhexa_sdk()
...etc

Now that I think about it, the naming snt_setup() makes more sense than get_setup_variables()...

EstebanMontandon · 2026-04-20T12:33:57Z

+
+
+# Simplifie `shapes_data` en place (utilise `REPORT_SHAPE_SIMPLIFY_TOL`).
+formatting_report_simplify_shapes_inplace <- function() {


Please never search for global R objects in functions that are applying a transformation. This could lead to very confusing results.

The way we use functions (at least in a notebook) should always be:
transformed_object = may_transformation(object)

This allow us to know what and when a variable is being transform.

EstebanMontandon · 2026-04-20T12:35:50Z

+
+# Routine parquet + filtre années + printdim.
+formatting_report_load_routine_data <- function() {
+    if (!exists("openhexa", inherits = TRUE)) {


Please remember: all these loading functions can be replaced by the existing function . You can copy paste this function in the *_reporting.r (let's keep things separated for now)

Nice addition: This "openhexa" check could be included in the function I shared with you, but first I would just change the message to something generic.
"OpenHEXA SDK is not loaded or available ...or something"

Note: the "bootstrap" function should load the sdk and raise a warning if not available.

EstebanMontandon · 2026-04-20T12:53:59Z

+        stop("Définir SNT_ROOT_PATH (cellule parameters Papermill) avant formatting_report_paths_and_outputs().")
+    }
+    root <- path.expand(as.character(SNT_ROOT_PATH)[1])
+    .report_nb_assign("SNT_ROOT_PATH", root)


I think there's a bit of duplication here. We store the paths in variables and then we return the same paths in the result, I would avoid the former.
From now on, let's try to enforce notebook developments using only the paths provided by the "setup variable" returned by this "bootstrap" function.
(Using the list you are returning at the end of this function)

claude-marie added 5 commits April 14, 2026 10:30

milestone

9ea51d2

other fix

757c522

working in snt_dev

95fb956

changes

1e750a2

final fix, everything is working fine on snt_dev and drc_snt

6f77b99

claude-marie requested a review from EstebanMontandon April 16, 2026 08:26

EstebanMontandon reviewed Apr 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix formatting#79

Fix formatting#79
claude-marie wants to merge 5 commits intomainfrom
fix_formatting

claude-marie commented Apr 16, 2026

Uh oh!

EstebanMontandon commented Apr 20, 2026

Uh oh!

claude-marie commented Apr 20, 2026

Uh oh!

EstebanMontandon Apr 20, 2026

Uh oh!

EstebanMontandon Apr 20, 2026

Uh oh!

EstebanMontandon Apr 20, 2026

Uh oh!

EstebanMontandon Apr 20, 2026

Uh oh!

EstebanMontandon Apr 20, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants



		# Simplifie `shapes_data` en place (utilise `REPORT_SHAPE_SIMPLIFY_TOL`).
		formatting_report_simplify_shapes_inplace <- function() {

Conversation

claude-marie commented Apr 16, 2026

Uh oh!

EstebanMontandon commented Apr 20, 2026

Uh oh!

claude-marie commented Apr 20, 2026

Uh oh!

EstebanMontandon Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

EstebanMontandon Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

EstebanMontandon Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

EstebanMontandon Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

EstebanMontandon Apr 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

EstebanMontandon Apr 20, 2026 •

edited

Loading