Geotransolver Model #1297

coreyjadams · 2025-12-22T18:32:05Z

PhysicsNeMo Pull Request

This PR brings GeoTransolver to physicsnemo and unifies the training recipe with Transolver.

Description

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.
The CHANGELOG.md is up to date with these changes.
An issue is linked to this pull request.
If I am implementing a new model or modifying any existing model, I have followed the Models Implementation Coding Standards.

Dependencies

Review Process

All PRs are reviewed by the PhysicsNeMo team before merging.

Depending on which files are changed, GitHub may automatically assign a maintainer for review.

We are also testing AI-based code review tools (e.g., Greptile), which may add automated comments with a confidence score.
This score reflects the AI’s assessment of merge readiness and is not a qualitative judgment of your work, nor is
it an indication that the PR will be accepted / rejected.

AI-generated feedback should be reviewed critically for usefulness.
You are not required to respond to every AI comment, but they are intended to help both authors and reviewers.
Please react to Greptile comments with 👍 or 👎 to provide feedback on their accuracy.

…erence scripts

greptile-apps · 2025-12-22T18:38:50Z

Greptile Summary

Brings GeoTransolver to physicsnemo as an experimental model and unifies its training recipe with Transolver under a shared transformer_models directory structure
Introduces new model components including GALE attention mechanism, context projectors, and geometric feature processors alongside comprehensive test coverage
Reorganizes external aerodynamics examples by consolidating Transolver and GeoTransolver configurations, refactoring datapipes to support combined surface/volume processing modes

Important Files Changed

Filename	Overview
physicsnemo/experimental/models/geotransolver/geotransolver.py	New GeoTransolver model class missing required tensor shape validation logic (MOD-005) and has parameter documentation inconsistencies
physicsnemo/experimental/models/geotransolver/gale.py	GALE attention mechanism implementation with missing base class inheritance, commented debug code, and incomplete shape validation
examples/cfd/external_aerodynamics/transformer_models/src/train.py	Unified training script with complex `@tensorwise` decorator logic and hard-coded model switching that may introduce maintenance issues
physicsnemo/datapipes/cae/transolver_datapipe.py	Modified to support combined surface/volume mode with proper backward compatibility for normalization factors
test/models/geotransolver/test_geotransolver.py	Comprehensive test suite with inconsistent tensor indexing and non-deterministic randomness in checkpoint tests

greptile-apps

Additional Comments (42)

examples/cfd/external_aerodynamics/transformer_models/src/conf/data/surface.yaml, line 23 (link)

syntax: Typo: 'Surface-speficic' should be 'Surface-specific'
examples/cfd/external_aerodynamics/transformer_models/src/conf/transolver_volume.yaml, line 24 (link)

style: Run ID references 'bfloat16' but precision is set to 'float32' on line 27. Should the run_id match the actual precision setting?

_{Note: If this suggestion doesn't match your team's coding style, reply to this and let me know. I'll remember it for next time!}
examples/cfd/external_aerodynamics/transformer_models/src/conf/data/volume.yaml, line 23 (link)

syntax: Typo: 'speficic' should be 'specific'
examples/cfd/external_aerodynamics/transformer_models/src/conf/data/volume.yaml, line 29 (link)

style: Missing newline at end of file

_{Note: If this suggestion doesn't match your team's coding style, reply to this and let me know. I'll remember it for next time!}
examples/cfd/external_aerodynamics/transformer_models/src/conf/transolver_surface.yaml, line 24 (link)

style: Run ID contains 'bfloat16' but precision is set to float32 on line 27 - consider updating for consistency

_{Note: If this suggestion doesn't match your team's coding style, reply to this and let me know. I'll remember it for next time!}
examples/cfd/external_aerodynamics/transformer_models/src/conf/transolver_surface.yaml, line 37 (link)

style: Missing newline at end of file

_{Note: If this suggestion doesn't match your team's coding style, reply to this and let me know. I'll remember it for next time!}
test/models/geotransolver/test_context_projector.py, line 60 (link)

style: Consider adding CPU testing for plus mode as well - limiting to CUDA-only may miss CPU-specific issues. Is there a specific reason plus mode only needs CUDA testing?

_{Note: If this suggestion doesn't match your team's coding style, reply to this and let me know. I'll remember it for next time!}
examples/cfd/external_aerodynamics/transformer_models/src/compute_normalizations.py, line 92-94 (link)

logic: Bug in Welford's algorithm: N is incremented twice (lines 77 and 92), causing incorrect variance calculation. Remove line 92 since N is already updated on line 77.
examples/cfd/external_aerodynamics/transformer_models/src/compute_normalizations.py, line 77 (link)

style: Variable n is assigned but never used. Consider removing this line since batch_n on line82serves the same purpose.

_{Note: If this suggestion doesn't match your team's coding style, reply to this and let me know. I'll remember it for next time!}
test/models/geotransolver/test_geotransolver.py, line 192 (link)

logic: Incorrect tensor indexing - outputs is a single tensor, not a tuple
test/models/geotransolver/test_geotransolver.py, line 411 (link)

logic: Incorrect tensor indexing - outputs is a single tensor, not a tuple
test/models/geotransolver/test_geotransolver.py, line 460 (link)

logic: Non-deterministic randomness breaks test reproducibility
test/models/geotransolver/test_geotransolver.py, line 740 (link)

logic: Incorrect tensor indexing - outputs is a single tensor, not a tuple
examples/cfd/external_aerodynamics/transformer_models/src/metrics.py, line 172 (link)

logic: Inconsistent tensor dimensionality - mae_num is 3D but indexed as 2D here
examples/cfd/external_aerodynamics/transformer_models/README.md, line 46 (link)

style: The API reference mentions physicsnemo.experimental.models.typhon but the model is described as 'Typhon' throughout the document while the API suggests 'geotransolver'. Consider consistency in naming. Should the documentation consistently use 'Typhon' or 'GeoTransolver' as the model name to match the API structure?

_{Note: If this suggestion doesn't match your team's coding style, reply to this and let me know. I'll remember it for next time!}
examples/cfd/external_aerodynamics/transformer_models/README.md, line 34 (link)

logic: The configuration examples mention typhon_surface and typhon_volume but based on the file structure, these might actually be geotransolver_surface and geotransolver_volume. Do the configuration names match the actual config files in the codebase?
examples/cfd/external_aerodynamics/transformer_models/src/inference_on_vtk.py, line 662 (link)

logic: Potential IndexError if no matching files found
examples/cfd/external_aerodynamics/transformer_models/src/inference_on_vtk.py, line 677 (link)

logic: Potential IndexError if no matching files found
examples/cfd/external_aerodynamics/transformer_models/deprecated/inference_on_vtp.py, line 73-74 (link)

logic: Bug: surface_normals is undefined. Should be normals on line 73.
examples/cfd/external_aerodynamics/transformer_models/src/inference_on_zarr.py, line 543 (link)

logic: Potential directory creation issue - path may not exist before writing file
examples/cfd/external_aerodynamics/transformer_models/src/inference_on_zarr.py, line 543 (link)

logic: Datetime formatting could create invalid filenames due to colons and spaces
examples/cfd/external_aerodynamics/transformer_models/src/inference_on_zarr.py, line 568 (link)

logic: Same directory and datetime filename issues as surface mode
examples/cfd/external_aerodynamics/transformer_models/src/utils.py, line 41-48 (link)

style: Missing required docstring sections per MOD-003d. Add Parameters and Returns sections following NumPy style.

_{Note: If this suggestion doesn't match your team's coding style, reply to this and let me know. I'll remember it for next time!}

Context Used: File from greptile.json - CODING_STANDARDS/MODELS_IMPLEMENTATION.md (source)
physicsnemo/experimental/models/geotransolver/context_projector.py, line 39 (link)

logic: Class should inherit from physicsnemo.Module instead of nn.Module to follow MOD-001 coding standard

Context Used: File from greptile.json - CODING_STANDARDS/MODELS_IMPLEMENTATION.md (source)
physicsnemo/experimental/models/geotransolver/context_projector.py, line 313-314 (link)

style: Remove commented debug print statements before merging
physicsnemo/experimental/models/geotransolver/context_projector.py, line 389 (link)

logic: Argument order inconsistency - spatial_coords and geometry order differs from line 398. Should the argument order be consistent between extract_context_features and extract_local_features method calls?
physicsnemo/experimental/models/geotransolver/context_projector.py, line 126-128 (link)

style: Missing jaxtyping tensor shape annotations in method signature per MOD-006

Context Used: File from greptile.json - CODING_STANDARDS/MODELS_IMPLEMENTATION.md (source)
physicsnemo/experimental/models/geotransolver/geotransolver.py, line 93-95 (link)

syntax: Parameter type documentation is inconsistent. Docstring says 'int' but type annotation shows 'int | tuple[int, ...]'

Context Used: File from greptile.json - CODING_STANDARDS/MODELS_IMPLEMENTATION.md (source)
physicsnemo/experimental/models/geotransolver/geotransolver.py, line 128 (link)

syntax: Default value documented as 512 but line 221 shows 32

Context Used: File from greptile.json - CODING_STANDARDS/MODELS_IMPLEMENTATION.md (source)
physicsnemo/experimental/models/geotransolver/geotransolver.py, line 329-381 (link)

logic: Forward method missing tensor shape validation logic as required by MOD-005. Should validate shapes of local_embedding, geometry, and global_embedding tensors with torch.compiler.is_compiling() guard

Context Used: File from greptile.json - CODING_STANDARDS/MODELS_IMPLEMENTATION.md (source)
physicsnemo/experimental/models/geotransolver/geotransolver.py, line 331-335 (link)

syntax: Missing jaxtyping tensor annotations for forward method parameters as required by MOD-006

Context Used: File from greptile.json - CODING_STANDARDS/MODELS_IMPLEMENTATION.md (source)
examples/cfd/external_aerodynamics/transformer_models/deprecated/datapipe.py, line 64-78 (link)

style: Function missing required docstring sections per MOD-003d. Should have Parameters and Returns sections.

Context Used: File from greptile.json - CODING_STANDARDS/MODELS_IMPLEMENTATION.md (source)
physicsnemo/datapipes/cae/transolver_datapipe.py, line 509 (link)

logic: Potential KeyError if outputs_surf doesn't contain 'fx' key when air_density/stream_velocity are missing. Should there be a check to ensure both outputs_surf and outputs_vol have the same keys before accessing fx?
physicsnemo/datapipes/cae/transolver_datapipe.py, line 568-569 (link)

logic: The ValueError message doesn't handle the 'combined' model type case in the auto factor selection
examples/cfd/external_aerodynamics/transformer_models/src/train.py, line 21-25 (link)

syntax: Duplicate import - Sequence is imported from both typing (line 21) and collections.abc (line 25). Remove one of these imports.
examples/cfd/external_aerodynamics/transformer_models/src/train.py, line 320 (link)

logic: Potential bug: loss.item() called on tensor that may be a list/multi-dimensional. This will fail if loss is not a scalar. Is loss guaranteed to be a scalar tensor here, or could it be multi-dimensional when handling multiple point clouds?
physicsnemo/experimental/models/geotransolver/gale.py, line 82 (link)

syntax: Parameter dim missing type annotation. Should be dim: int per MOD-006 for jaxtyping tensor annotations in public function signatures.

Context Used: File from greptile.json - CODING_STANDARDS/MODELS_IMPLEMENTATION.md (source)
physicsnemo/experimental/models/geotransolver/gale.py, line 105-107 (link)

logic: Method signature doesn't match docstring. Docstring describes slice_tokens as single tensor but method expects list. Line 125 calls torch.cat(slice_tokens, dim=-2) which will fail if slice_tokens is a single tensor.
physicsnemo/experimental/models/geotransolver/gale.py, line 149-151 (link)

syntax: Return type annotation is incorrect. Method returns outputs which is a list (line 211-213), not a single torch.Tensor as annotated.
physicsnemo/experimental/models/geotransolver/gale.py, line 156-158 (link)

syntax: Docstring parameter description doesn't match signature. Docstring describes x as single tensor but signature expects tuple[torch.Tensor, ...].

Context Used: File from greptile.json - CODING_STANDARDS/MODELS_IMPLEMENTATION.md (source)
physicsnemo/experimental/models/geotransolver/gale.py, line 315 (link)

syntax: Return type annotation is incorrect. Method returns fx which is a list (line 338), not a single torch.Tensor as annotated.
physicsnemo/experimental/models/geotransolver/gale.py, line 320-322 (link)

syntax: Docstring parameter descriptions don't match method signature. Both fx and global_context are described as single tensors but signature expects tuple[torch.Tensor, ...] for both.

Context Used: File from greptile.json - CODING_STANDARDS/MODELS_IMPLEMENTATION.md (source)

_{40 files reviewed, 42 comments}

_{Edit Code Review Agent Settings | Greptile}

RishikeshRanade · 2025-12-23T20:17:01Z

examples/cfd/external_aerodynamics/transformer_models/src/inference_on_vtk.py

+
+            # Run batched inference using imported function from inference_on_zarr
+            with torch.no_grad():
+                _, _, (predictions, _) = batched_inference_loop(


There is a shape mismatch here in the global_parameters setting. Global parameters have an extra dimension (1, 1, 2) in this case. We need to modify the datapipe to squeeze the first dimension.

examples/cfd/external_aerodynamics/transformer_models/README.md

coreyjadams and others added 27 commits December 19, 2025 14:39

Add typhon model arch.

05fdcbe

Add typhon example configs.

7a473e5

Enable typhon to work with multiple streams of data.

c0983f8

Clean up configs and attempt to remove duplications

da69cb6

Enable surface / volume combined training.

025b843

deprecate old files

fe60296

typhon bq changes

341343e

adding bq to combined pipeline (being validated)

561ede5

updating typhon model, removing combined and new typhon example

9575c86

updating transolver recipe configs

158376d

fixing errors in inference_on_zarr and compute_norms

ae9e5b3

Starting to add tests to Typhon with BQ. Not yet fully functional.

c7b92a5

Add tests for typhon model

5035fdb

More robust attributes

3935234

Add runtime error passing too

22177f8

Remove printout

1e41e42

combining typhon/transolver to transformer models and cleaning up inf…

f420242

…erence scripts

Refactor typhon to improve readability and maintainability of BW path

55ab1eb

Snapshot before integrating BQ

828ed7d

Fix data dir name for transformer model configs

fc2e56d

fix minor bugs

da6b710

minor bug fix

e08f243

fixing bug in val_epoch

8308a7a

fixing minor bug in inference_on_zarr

6422a66

Rename to geotransolver

9e0bcc0

Rename and add inference script

5bc5375

Merge branch 'NVIDIA:main' into geotransolver

d1780b3

coreyjadams self-assigned this Dec 22, 2025

greptile-apps bot reviewed Dec 22, 2025

View reviewed changes

coreyjadams added 2 commits December 23, 2025 08:07

Merge branch 'main' into geotransolver

4a09881

Fix precommit

d2d6172

RishikeshRanade self-requested a review December 23, 2025 20:14

RishikeshRanade approved these changes Dec 23, 2025

View reviewed changes

RishikeshRanade approved these changes Dec 24, 2025

View reviewed changes

examples/cfd/external_aerodynamics/transformer_models/README.md Outdated Show resolved Hide resolved

Update geotransolver naming

75997a2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Geotransolver Model #1297

Geotransolver Model #1297

Uh oh!

coreyjadams commented Dec 22, 2025

Uh oh!

greptile-apps bot commented Dec 22, 2025

Uh oh!

greptile-apps bot left a comment •

edited

Loading

Uh oh!

RishikeshRanade Dec 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Geotransolver Model #1297

Are you sure you want to change the base?

Geotransolver Model #1297

Uh oh!

Conversation

coreyjadams commented Dec 22, 2025

PhysicsNeMo Pull Request

Description

Checklist

Dependencies

Review Process

Uh oh!

greptile-apps bot commented Dec 22, 2025

Greptile Summary

Important Files Changed

Uh oh!

greptile-apps bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Additional Comments (42)

Uh oh!

RishikeshRanade Dec 23, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

greptile-apps bot left a comment •

edited

Loading