[FEATURE] - Remove the need for `testGoldenScene` #72

timcreatedit · 2025-11-11T16:27:04Z

As you could see from #70, I expected to be able to just use this package from my normal testWidgets tests, just like I would with matchesGoldenFile.

As a POC, I would propose something like this to make usage of this package as frictionless as possible. Things that are still TBD:

What do we do with the runner variants (e.g. testGoldenSceneOnIOS)? I personally would've expected them to be part of the Gallery functionality itself, so that I can just create the same gallery on different platforms.
While I think it probably works in all cases like this, I can't verify 100% accuracy, since the goldens don't pass right now. The failure scenes look equivalent to me, but it would of course be better if we first make sure the goldens pass on main
Probably more, this PR's main purpose is gathering feedback for now

matthew-carroll · 2025-11-11T19:03:00Z

I don't think you "need" to use testGoldenScene(). You can replicate all the code that's in it: https://github.com/Flutter-Bounty-Hunters/flutter_test_goldens/blob/main/lib/src/test_runners.dart#L241

But now you've just moved that responsibility onto yourself for every test.

What do we do with the runner variants (e.g. testGoldenSceneOnIOS)? I personally would've expected them to be part of the Gallery functionality itself, so that I can just create the same gallery on different platforms.

There are some tests that you only want to run on iOS. That runner is a convenience to configure the platform to iOS. If you want a gallery with multiple platforms, you don't need to use that runner.

matthew-carroll

I did some review. Can't tell how much of this is actually intentional because it looks like you auto-reformatted the files you opened so it's mostly noise. Please try to revert and apply just the changes you intend to make.

matthew-carroll · 2025-11-11T19:03:35Z

lib/src/fonts/fonts.dart

+    as golden_toolkit;
+
+/// Remember if fonts have already been loaded in this isolate.
+bool _fontsLoaded = false;


Is this actually needed? What's the current behavior that you're trying to fix with this?

Seems to me like right now if I have multiple testGoldenScene tests in the same suite, they will all read the font manifest, even though the fonts have already been loaded. The same thing would happen with my change, but even if we don't move font loading somewhere else it seems like wasted work?

I'm just looking for clarity as to whether this actually prevents additional execution, or if we're just repeating something that's already tracked inside the font loading behavior. So without this we're re-reading the manifest on every call even if fonts are already loaded? (I haven't looked at that in a while).

matthew-carroll · 2025-11-11T19:04:13Z

lib/src/scenes/gallery.dart

-      // TODO: Return a success/failure report that we can publish to the test output.
-      await _compareGoldens(tester, _fileName, screenshots);
-      FtgLog.pipeline.finer("Done comparing goldens.");
+    await TestFonts.loadAppFonts();


We don't want to force this on everyone. If people want standard Ahem goldens, they need to be able to get them.

matthew-carroll · 2025-11-11T19:05:51Z

lib/src/scenes/gallery.dart

-      FtgLog.pipeline.finer("Done comparing goldens.");
+    await TestFonts.loadAppFonts();
+
+    tester.view


We don't wanna force this either.

I see that you're trying to merge things together, but that's actually the opposite of what a toolkit should do. Higher level conveniences that add default behaviors is fine. But forcing decisions on everyone at the center of the tool will lead to angry devs who get to a certain point of adoption and then realize they can't control something that they need to control.

lib/src/scenes/gallery.dart

timcreatedit · 2025-11-11T21:58:01Z

Can't tell how much of this is actually intentional because it looks like you auto-reformatted the files you opened so it's mostly noise. Please try to revert and apply just the changes you intend to make.

Sorry for that, I didn't notice. I did this quickly at the end of my workday so we can discuss the idea, hence the draft status. Thank you for taking the time to go through it anyway!

We don't wanna force this either.

I see that you're trying to merge things together, but that's actually the opposite of what a toolkit should do. Higher level conveniences that add default behaviors is fine. But forcing decisions on everyone at the center of the tool will lead to angry devs who get to a certain point of adoption and then realize they can't control something that they need to control.

I think I see where you are coming from, but then – at least to me – the current API isn't really intuitive. You are saying that we don't need testGoldenScene, but when you use the core functionality without it, it breaks.

Before this change, users would wrap their galleries in testGoldenScene, which sets up the view so that galleries actually layout properly. My intent here was to move this setup to the gallery itself, since it needs it for its run.

I think on the flip side, it is more confusing to have the test function set up a different view for the entirety of its run, since it affects all other test code as well.

For font loading, I understand why moving it to the gallery and timeline run() is too restrictive, but it also seems weird to have it in the test runner. If I want to render a gallery using Ahem, I need to just know to set up the View correctly, but not load the fonts, since testGoldenScene implicitly did both for me in other cases.

Expected behavior for me would be to set up tester.view only for the run() of the Gallery/Timeline, and have font rendering behavior be opt-in in another way (function call before, or maybe a function on the Gallery for while you build it). It's a bit trickier of course, since you can't unload fonts anymore 🥲

Let me know what you think.

timcreatedit · 2025-11-11T21:59:19Z

.vscode/settings.json

I took the liberty to commit this, but I understand if you don't want this here. I thought it might help other VS Code users like me, that have the "format on save" setting turned on

timcreatedit · 2025-11-11T22:04:14Z

But now you've just moved that responsibility onto yourself for every test.

I suppose another way to deal with this is for the gallery layouts to actually work with different DPRs. Then setting up the view correctly wouldn't be as essential for using them.

matthew-carroll · 2025-11-12T19:43:43Z

I think I see where you are coming from, but then – at least to me – the current API isn't really intuitive. You are saying that we don't need testGoldenScene, but when you use the core functionality without it, it breaks.

Unless I'm missing something, I don't think this is true. I don't think the core functionality is broken - it looked like you just didn't account for the standard behavior of testWidgets(). If you're going to use testWidgets() then you've got to account for that choice.

Is there actually something broken in there, or are you just dealing with the consequences of Flutter setting up 3.0 DPI, etc?

Before this change, users would wrap their galleries in testGoldenScene, which sets up the view so that galleries actually layout properly. My intent here was to move this setup to the gallery itself, since it needs it for its run.

testGoldenScene() has nothing to do with layout. This is the entire implementation of that method:

@isGoldenScene
@isTest
void testGoldenScene(
  String description,
  WidgetTesterCallback test, {
  bool? skip,
  Timeout? timeout,
  bool semanticsEnabled = true,
  TestVariant<Object?> variant = const DefaultTestVariant(),
  dynamic tags,
  int? retry,
}) {
  testWidgets(
    description,
    (tester) async {
      await TestFonts.loadAppFonts();

      tester.view
        ..devicePixelRatio = 1.0
        ..platformDispatcher.textScaleFactorTestValue = 1.0;

      try {
        await test(tester);
      } finally {
        tester.view.reset();
      }
    },
    skip: skip,
    variant: variant,
    timeout: timeout,
    semanticsEnabled: semanticsEnabled,
    tags: tags,
    retry: retry,
  );
}

I think on the flip side, it is more confusing to have the test function set up a different view for the entirety of its run, since it affects all other test code as well.

Please see the implementation above. We call tester.view.reset() after every test run.

For font loading, I understand why moving it to the gallery and timeline run() is too restrictive, but it also seems weird to have it in the test runner. If I want to render a gallery using Ahem, I need to just know to set up the View correctly, but not load the fonts, since testGoldenScene implicitly did both for me in other cases.

I'm not sure there's any situation here that isn't a problem, because Flutter has made working with fonts in a test a problem. The question is, what's the least bad option we can provide? I'm open to options, but I think it's critical that we not put developers in a position where they can't make a choice about when/if a non-undoable thing is done.

Expected behavior for me would be to set up tester.view only for the run() of the Gallery/Timeline, and have font rendering behavior be opt-in in another way (function call before, or maybe a function on the Gallery for while you build it). It's a bit trickier of course, since you can't unload fonts anymore 🥲

Why do you want to control tester.view with run()? This would seem to imply that you want to stick multiple golden tests in a single test run. I think that's probably not advisable for multiple reasons. On the other hand, if there's one golden generation/comparison per test, then I'm not sure what's gained by moving that control into run()?

timcreatedit added 2 commits November 11, 2025 17:20

[FEATURE] - Remove the need for testGoldenScene runner

fd3981f

deprecate old test function

ff785e7

matthew-carroll reviewed Nov 11, 2025

View reviewed changes

timcreatedit added 2 commits November 11, 2025 22:47

reformat with 120 line length

ee6c88c

add line length to project settings for vs code users

42ff456

timcreatedit commented Nov 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[FEATURE] - Remove the need for `testGoldenScene` #72

[FEATURE] - Remove the need for `testGoldenScene` #72

Uh oh!

timcreatedit commented Nov 11, 2025

Uh oh!

matthew-carroll commented Nov 11, 2025

Uh oh!

matthew-carroll left a comment

Uh oh!

matthew-carroll Nov 11, 2025

Uh oh!

timcreatedit Nov 11, 2025

Uh oh!

matthew-carroll Nov 12, 2025

Uh oh!

matthew-carroll Nov 11, 2025

Uh oh!

matthew-carroll Nov 11, 2025

Uh oh!

Uh oh!

timcreatedit commented Nov 11, 2025

Uh oh!

timcreatedit Nov 11, 2025

Uh oh!

timcreatedit commented Nov 11, 2025

Uh oh!

matthew-carroll commented Nov 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[FEATURE] - Remove the need for testGoldenScene #72

Are you sure you want to change the base?

[FEATURE] - Remove the need for testGoldenScene #72

Uh oh!

Conversation

timcreatedit commented Nov 11, 2025

Uh oh!

matthew-carroll commented Nov 11, 2025

Uh oh!

matthew-carroll left a comment

Choose a reason for hiding this comment

Uh oh!

matthew-carroll Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

timcreatedit Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

matthew-carroll Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

matthew-carroll Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

matthew-carroll Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

timcreatedit commented Nov 11, 2025

Uh oh!

timcreatedit Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

timcreatedit commented Nov 11, 2025

Uh oh!

matthew-carroll commented Nov 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[FEATURE] - Remove the need for `testGoldenScene` #72

[FEATURE] - Remove the need for `testGoldenScene` #72