test: optimize example test discovery and execution speed #372

planetf1 · 2026-01-28T12:44:36Z

Misc PR

Type of PR

Bug Fix
New Feature
Documentation
Other

Description

Link to Issue: Fixes Add pytest markers to examples #371

This PR optimizes the test infrastructure to improve test execution speed, reliability, and developer experience. The changes address test collection hangs, improve skip handling, and establish proper test categorization.

Key Changes:

Example Test Discovery Optimization (e1701c1)
- Added pytest markers (@pytest.mark.ollama, @pytest.mark.qualitative, etc.) to 66 example files
- Wrapped heavy examples (intrinsics, safety, mify) with lazy initialization to prevent collection hangs
- Fixed critical skip logic bug in docs/examples/conftest.py (import failure handling)
- Configured qualitative tests to be excluded by default (run with pytest -m "" for full suite)
- Updated documentation: AGENTS.md, README.md, docs/tutorial.md, test/MARKERS_GUIDE.md
Standalone Example Execution (1cd9c7b)
- Modified 59 example files to conditionally import pytest in try/except blocks
- Examples can now run standalone without pytest dependency while maintaining marker functionality
Qualitative Test Marking (c5e36ef)
- Marked docs/examples/rag/mellea_pdf.py as qualitative due to external PDF dependency
Test Failure Fixes (f03581e)
- Fixed vision_openai_examples.py: Simplified skip logic, added requirements docstring
- Enhanced docs/examples/conftest.py: Detect pytest.skip() exceptions in subprocess stderr
- Fixed test_vision_openai.py::test_image_block_in_chat: Added @pytest.mark.qualitative decorator
- Configured testpaths = ["test", "docs"] in pyproject.toml for fail-fast behavior

Impact:

Test execution time: Reduced from hangs/timeouts to ~4 minutes
Reliability: Proper skip handling for missing dependencies (langchain, models, etc.)
Developer experience: Fast feedback loop with default runs, full suite available with pytest -m ""
Test results: 224 passed, 37 skipped, 0 failures

Testing

Tests added to the respective file if code was changed
New code has 100% coverage if code as added
Ensure existing tests and github automation passes (a maintainer will kick off the github automation when the rest of the PR is populated)

mergify · 2026-01-28T12:45:12Z

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert|release)(?:\(.+\))?:

github-actions · 2026-01-28T12:46:46Z

The PR description has been updated. Please fill out the template for your PR to be reviewed.

planetf1 · 2026-01-28T12:59:42Z

There's a few things to clarify from this PR

as with other PRs, the objective is to run as many tests as we can sensibly without too much burden on developer or expectation of a very high spec system
Trying to 'collect' heavy tests with pytest is problematic, particularly from the hugging face examples. They initialize as part of the hugging face import, which attempts to load a large model - which is infeasible on modest systems. A workaround is to add a conditional as above - at the cost of some 'example' clutter.
Further to above, the marking of tests in examples brings in a dependency on pytest. To avoid this there's more conditional logic on the import so that examples can be run - at a little more example clutter
local tests in ollama will fail if models are not found. I've left these failing
We may be able to update particular tests to change models used etc - but should raise distinct issues for those. This PR is to get an initial set working
Issue mellea_pdf example requires access to geoblocked content - needs global accessible content #373 was opened - geoblocked content.

planetf1 · 2026-01-28T14:01:29Z

Here's an example output run after these fixes, just using the default uv run pytest

This was run on a 32GB macbook m1 max - and takes around 4 minutes

= 224 passed, 37 skipped, 78 deselected, 1 xpassed, 24 warnings in 235.98s (0:03:55) =

test_run_after_fixes.txt

psschwei · 2026-01-28T14:23:11Z

Here's the results from a run on my 32GB RAM / 4GB VRAM Thinkpad

$ pytest
=========================================================================== test session starts ===========================================================================
platform linux -- Python 3.12.8, pytest-9.0.0, pluggy-1.6.0
rootdir: /home/paulschw/generative-computing/mellea-pr-372
configfile: pyproject.toml
testpaths: test, docs
plugins: cov-7.0.0, anyio-4.11.0, asyncio-1.3.0, nbmake-1.5.5, Faker-37.12.0
asyncio: mode=Mode.AUTO, debug=False, asyncio_default_fixture_loop_scope=None, asyncio_default_test_loop_scope=function
collected 339 items / 78 deselected / 1 skipped / 261 selected

test/backends/test_adapters/test_adapter.py .                                                                                                                       [  0%]
test/backends/test_huggingface.py s                                                                                                                                 [  0%]
test/backends/test_litellm_ollama.py ......                                                                                                                         [  3%]
test/backends/test_litellm_watsonx.py s                                                                                                                             [  3%]
test/backends/test_model_options.py .....                                                                                                                           [  5%]
test/backends/test_ollama.py X....                                                                                                                                  [  7%]
test/backends/test_openai_ollama.py .......                                                                                                                         [  9%]
test/backends/test_tool_calls.py ...                                                                                                                                [ 11%]
test/backends/test_tool_helpers.py ...                                                                                                                              [ 12%]
test/backends/test_vision_ollama.py ....                                                                                                                            [ 13%]
test/backends/test_vision_openai.py ...                                                                                                                             [ 14%]
test/backends/test_watsonx.py s                                                                                                                                     [ 15%]
test/core/test_base.py ....                                                                                                                                         [ 16%]
test/core/test_component_typing.py .....                                                                                                                            [ 18%]
test/core/test_model_output_thunk.py ..                                                                                                                             [ 19%]
test/formatters/test_template_formatter.py ................                                                                                                         [ 25%]
test/helpers/test_event_loop_helper.py ...                                                                                                                          [ 26%]
test/stdlib/components/docs/test_richdocument.py EEEE.s                                                                                                             [ 29%]
test/stdlib/components/test_chat.py .                                                                                                                               [ 29%]
test/stdlib/components/test_genslot.py ssssssssssssssssss                                                                                                           [ 36%]
test/stdlib/components/test_hello_world.py ..                                                                                                                       [ 37%]
test/stdlib/components/test_mify.py ...........                                                                                                                     [ 41%]
test/stdlib/components/test_transform.py ..                                                                                                                         [ 42%]
test/stdlib/requirements/test_reqlib_markdown.py ......                                                                                                             [ 44%]
test/stdlib/requirements/test_reqlib_python.py .....................                                                                                                [ 52%]
test/stdlib/requirements/test_reqlib_tools.py .                                                                                                                     [ 52%]
test/stdlib/requirements/test_requirement.py .....                                                                                                                  [ 54%]
test/stdlib/sampling/test_majority_voting.py ..                                                                                                                     [ 55%]
test/stdlib/sampling/test_sampling_ctx.py ..                                                                                                                        [ 56%]
test/stdlib/sampling/test_sofai_graph_coloring.py ......................                                                                                            [ 64%]
test/stdlib/sampling/test_sofai_sampling.py ....................                                                                                                    [ 72%]
test/stdlib/test_base_context.py .....                                                                                                                              [ 74%]
test/stdlib/test_chat_view.py ..                                                                                                                                    [ 75%]
test/stdlib/test_functional.py ....                                                                                                                                 [ 76%]
test/stdlib/test_session.py F.......                                                                                                                                [ 79%]
docs/examples/aLora/101_example.py s                                                                                                                                [ 80%]
docs/examples/agents/react.py .                                                                                                                                     [ 80%]
docs/examples/agents/react_instruct.py .                                                                                                                            [ 80%]
docs/examples/conftest.py .                                                                                                                                         [ 81%]
docs/examples/context/contexts_with_sampling.py .                                                                                                                   [ 81%]
docs/examples/generative_slots/generate_with_context.py .                                                                                                           [ 81%]
docs/examples/generative_slots/generative_slots.py .                                                                                                                [ 82%]
docs/examples/generative_slots/generative_slots_with_requirements.py .                                                                                              [ 82%]
docs/examples/generative_slots/inter_module_composition/decision_aides.py .                                                                                         [ 83%]
docs/examples/generative_slots/inter_module_composition/summarize_and_decide.py .                                                                                   [ 83%]
docs/examples/generative_slots/inter_module_composition/summarizers.py .                                                                                            [ 83%]
docs/examples/generative_slots/investment_advice.py .                                                                                                               [ 84%]
docs/examples/hello_world.py .                                                                                                                                      [ 84%]
docs/examples/helper/helpers.py .                                                                                                                                   [ 85%]
docs/examples/image_text_models/vision_litellm_backend.py .                                                                                                         [ 85%]
docs/examples/image_text_models/vision_ollama_chat.py .                                                                                                             [ 85%]
docs/examples/image_text_models/vision_openai_examples.py F                                                                                                         [ 86%]
docs/examples/information_extraction/101_with_gen_slots.py .                                                                                                        [ 86%]
docs/examples/information_extraction/advanced_with_m_instruct.py .                                                                                                  [ 86%]
docs/examples/instruct_validate_repair/101_email.py .                                                                                                               [ 87%]
docs/examples/instruct_validate_repair/101_email_comparison.py .                                                                                                    [ 87%]
docs/examples/instruct_validate_repair/101_email_with_requirements.py .                                                                                             [ 88%]
docs/examples/instruct_validate_repair/101_email_with_validate.py .                                                                                                 [ 88%]
docs/examples/instruct_validate_repair/advanced_email_with_validate_function.py .                                                                                   [ 88%]
docs/examples/intrinsics/answer_relevance.py s                                                                                                                      [ 89%]
docs/examples/intrinsics/answerability.py s                                                                                                                         [ 89%]
docs/examples/intrinsics/citations.py s                                                                                                                             [ 90%]
docs/examples/intrinsics/context_relevance.py s                                                                                                                     [ 90%]
docs/examples/intrinsics/hallucination_detection.py s                                                                                                               [ 90%]
docs/examples/intrinsics/intrinsics.py s                                                                                                                            [ 91%]
docs/examples/intrinsics/query_rewrite.py s                                                                                                                         [ 91%]
docs/examples/library_interop/langchain_messages.py s                                                                                                               [ 91%]
docs/examples/m_serve/m_serve_example_simple.py .                                                                                                                   [ 92%]
docs/examples/melp/lazy_fib.py .                                                                                                                                    [ 92%]
docs/examples/melp/lazy_fib_sample.py .                                                                                                                             [ 93%]
docs/examples/melp/simple_example.py .                                                                                                                              [ 93%]
docs/examples/melp/states.py .                                                                                                                                      [ 93%]
docs/examples/mify/mify.py .                                                                                                                                        [ 94%]
docs/examples/mify/rich_document_advanced.py s                                                                                                                      [ 94%]
docs/examples/mini_researcher/context_docs.py .                                                                                                                     [ 95%]
docs/examples/mobject/table.py .                                                                                                                                    [ 95%]
docs/examples/safety/guardian.py s                                                                                                                                  [ 95%]
docs/examples/safety/guardian_huggingface.py s                                                                                                                      [ 96%]
docs/examples/safety/repair_with_guardian.py s                                                                                                                      [ 96%]
docs/examples/tools/interpreter_example.py .                                                                                                                        [ 96%]
docs/examples/tutorial/compositionality_with_generative_slots.py .                                                                                                  [ 97%]
docs/examples/tutorial/context_example.py .                                                                                                                         [ 97%]
docs/examples/tutorial/example.py .                                                                                                                                 [ 98%]
docs/examples/tutorial/instruct_validate_repair.py .                                                                                                                [ 98%]
docs/examples/tutorial/model_options_example.py .                                                                                                                   [ 98%]
docs/examples/tutorial/sentiment_classifier.py .                                                                                                                    [ 99%]
docs/examples/tutorial/simple_email.py .                                                                                                                            [ 99%]
docs/examples/tutorial/table_mobject.py .                                                                                                                           [100%]

<snip>

========================================================================= short test summary info =========================================================================
FAILED test/stdlib/test_session.py::test_start_session_watsonx - ibm_watsonx_ai.wml_client_error.WMLClientError: `url` must start with `https://`.
FAILED docs/examples/image_text_models/vision_openai_examples.py::vision_openai_examples.py - Example failed with exit code 1.
ERROR test/stdlib/components/docs/test_richdocument.py::test_richdocument_basics - torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 20.00 MiB. GPU 0 has a total capacity of 3.64 GiB of which 69.75 MiB is free. Including non-PyTorch memo...
ERROR test/stdlib/components/docs/test_richdocument.py::test_richdocument_markdown - torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 20.00 MiB. GPU 0 has a total capacity of 3.64 GiB of which 69.75 MiB is free. Including non-PyTorch memo...
ERROR test/stdlib/components/docs/test_richdocument.py::test_richdocument_save - torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 20.00 MiB. GPU 0 has a total capacity of 3.64 GiB of which 69.75 MiB is free. Including non-PyTorch memo...
ERROR test/stdlib/components/docs/test_richdocument.py::test_table - torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 20.00 MiB. GPU 0 has a total capacity of 3.64 GiB of which 69.75 MiB is free. Including non-PyTorch memo...
================================= 2 failed, 219 passed, 36 skipped, 78 deselected, 1 xpassed, 17 warnings, 4 errors in 531.47s (0:08:51) ==================================

psschwei · 2026-01-28T15:03:07Z

I don't think my issues are blocking, so I'm inclined to approve / merge but will wait a bit in case there are other opinions.

planetf1 · 2026-01-28T15:09:32Z

FAILED test/stdlib/test_session.py::test_start_session_watsonx - ibm_watsonx_ai.wml_client_error.WMLClientError: `url` must start with `https://`.

-> A parsing error in the check. I'll fix. The URL is set in my environment

FAILED docs/examples/image_text_models/vision_openai_examples.py::vision_openai_examples.py - Example failed with exit code 1.

qwen2.5vl:7b may not be available. That being said, you'd then fail with memory issues... maybe ollama failed to load

We have a number of models used in our tests. I looked at this one for validation as I didn't have it, but actually we'd need to do something on every test to check the model exists. Docs/upfront check can easily get out of date unless we can do it programatically.

-> Suggest discussion, then additional issue/pr if action needed

and multiple ones like

ERROR test/stdlib/components/docs/test_richdocument.py::test_richdocument_basics - torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 20.00 MiB. GPU 0 has a total capacity of 3.64 GiB of which 69.75 MiB is free. Including non-PyTorch memo...

are down to insufficient resources.

4GB vram is low for AI dev (though we'd always want to drive down min requirements for usage). The error at least is clear. A possible mitigation would be more fine grained markers to document memory usage - perhaps resulting in skipping all llm tests? Another alternative might be to allow cpu only (not gpu)

-> Similar - let's discuss and plan

planetf1 · 2026-01-28T15:16:33Z

The issue with watsonx should be fixed now.

psschwei · 2026-01-28T15:34:17Z

I did not have the qwen model downloaded. Pulled it and trying again just to see what happens
edit: once I downloaded the model, that test worked fine

planetf1 · 2026-01-28T15:54:49Z

I did not have the qwen model downloaded. Pulled it and trying again just to see what happens edit: once I downloaded the model, that test worked fine

ollama probably intelligently split the layers across gpu and cpu. I have marked the test as large ram, as when I ran it the system was struggling somewhat since the model was ~16GB ram by itself.

psschwei · 2026-01-28T16:00:05Z

the model was ~16GB ram by itself.

qwen2.5vl:7b ? For me it's only 6GB (we're starting to approach building cross-platform container images levels of fun here 😄 )

docs/examples/agents/react.py

README.md

…s by default Addresses PR generative-computing#372 review feedback: 1. Keep examples clean by using comment-based pytest markers - Examples now use single-line `# pytest: marker1, marker2` format - No pytest imports or pytestmark assignments in example files - Added pytest hooks to parse markers and skip heavy examples early - Prevents memory exhaustion from HuggingFace imports during collection 2. Run qualitative tests by default - Removed `-m "not qualitative"` from pytest addopts - Users can opt-out with: `pytest -m "not qualitative"` (~2 min) - Default `pytest` now runs full suite including quality checks Changes: - docs/examples/conftest.py: +266 lines for comment marker parsing - docs/examples/*.py: All 61 examples restored clean with single comment marker - pyproject.toml: Removed qualitative exclusion, added explanatory comments - AGENTS.md: Updated documentation to reflect new defaults All examples remain pure Python without test infrastructure pollution.

ajbozarth · 2026-01-29T15:17:27Z

@planetf1 I think you accidentally committed a bunch of temp files in your last commit

- Add 'slow' marker for tests >5 minutes (e.g., dataset loading) - Skip slow tests by default via pyproject.toml addopts - Mark generative_gsm8k.py with slow marker - Exclude conftest.py from test collection - Update all documentation (AGENTS.md, README.md, test/MARKERS_GUIDE.md, PR_372_SUMMARY.md) Addresses PR generative-computing#372 review feedback: - Slow tests (like GSM8K) now skipped by default for fast experience - Qualitative tests still run by default (comprehensive) - conftest.py no longer collected as a test Default: pytest runs qualitative tests, skips slow (~4-6 min) Fast: pytest -m 'not qualitative' (~2 min) Slow: pytest -m slow All: pytest --co -q (bypass config)

planetf1 · 2026-01-29T15:43:22Z

@planetf1 I think you accidentally committed a bunch of temp files in your last commit

ah. Fixed

Still Refining the execution to ensure we can run all tests

README.md

planetf1 · 2026-01-29T16:55:08Z

Test Suite Timing Summary

Quick Reference

Scope	Qualitative	Command	Time	Tests
Full (test/ + docs/)	✅ Yes	`uv run pytest`	8m59s	253 passed, 66 skipped
Full (test/ + docs/)	❌ No	`uv run pytest -m "not qualitative"`	3m38s	222 passed, 75 deselected
Test directory only	✅ Yes	`uv run pytest test/`	3m10s	211 passed, 66 skipped
Test directory only	❌ No	`uv run pytest test/ -m "not qualitative"`	1m21s	184 passed, 70 deselected

Recommended Workflows

Pre-commit: pytest test/ -m "not qualitative" (1m21s) - Ultra-fast core validation
Pre-push: pytest test/ (3m10s) - Core library with quality checks
Full validation: pytest (8m59s) - Complete test suite with examples
CI/CD: Automatically skips qualitative via CICD=1 environment variable

Key Insights

Examples overhead: Add ~5m49s to test runtime (65% of total)
Qualitative speedup: Skipping saves 57-59% runtime
Timeout protection: 15-minute limit configured in pyproject.toml

…s by default Addresses PR generative-computing#372 review feedback: 1. Keep examples clean by using comment-based pytest markers - Examples now use single-line `# pytest: marker1, marker2` format - No pytest imports or pytestmark assignments in example files - Added pytest hooks to parse markers and skip heavy examples early - Prevents memory exhaustion from HuggingFace imports during collection 2. Run qualitative tests by default - Removed `-m "not qualitative"` from pytest addopts - Users can opt-out with: `pytest -m "not qualitative"` (~2 min) - Default `pytest` now runs full suite including quality checks Changes: - docs/examples/conftest.py: +266 lines for comment marker parsing - docs/examples/*.py: All 61 examples restored clean with single comment marker - pyproject.toml: Removed qualitative exclusion, added explanatory comments - AGENTS.md: Updated documentation to reflect new defaults All examples remain pure Python without test infrastructure pollution.

- Add 'slow' marker for tests >5 minutes (e.g., dataset loading) - Skip slow tests by default via pyproject.toml addopts - Mark generative_gsm8k.py with slow marker - Exclude conftest.py from test collection - Update all documentation (AGENTS.md, README.md, test/MARKERS_GUIDE.md, PR_372_SUMMARY.md) Addresses PR generative-computing#372 review feedback: - Slow tests (like GSM8K) now skipped by default for fast experience - Qualitative tests still run by default (comprehensive) - conftest.py no longer collected as a test Default: pytest runs qualitative tests, skips slow (~4-6 min) Fast: pytest -m 'not qualitative' (~2 min) Slow: pytest -m slow All: pytest --co -q (bypass config)

…ples with optional dependency detection - Add automatic detection for optional dependencies (langchain_core) - Fix import error in context_example.py (stdlib.base -> stdlib.context) - Add requires_heavy_ram marker to sofai_graph_coloring.py - Add 15-minute timeout to pytest configuration - Remove langchain_messages.py from manual skip list (now auto-detected) Resolves test failures while keeping examples clean with only marker comments.

ajbozarth · 2026-01-29T17:30:38Z

Running the tests myself and thought I'd share my version of your table:

Command	Result
`uv run pytest`	240 passed, 80 skipped, 1 deselected, 1 xpassed, 25 warnings in 561.54s (0:09:21)
`uv run pytest -m "not qualitative"`	219 passed, 27 skipped, 75 deselected, 1 xpassed, 21 warnings in 200.97s (0:03:20)
`uv run pytest test/`	198 passed, 80 skipped, 1 xpassed, 23 warnings in 147.77s (0:02:27)
`uv run pytest test/ -m "not qualitative"`	181 passed, 27 skipped, 70 deselected, 1 xpassed, 21 warnings in 73.78s (0:01:13)

ajbozarth · 2026-01-29T18:24:31Z

I updated my above table. I'm unclear on what is causing the skipped, deselected, and xpassed tests and if the warnings are just me or happening to everyone.

I'll be taking a final look through the code changes after lunch.

psschwei · 2026-01-29T18:31:06Z

Mine:

Command	Result
`uv run pytest`	5 failed, 238 passed, 77 skipped, 1 deselected, 1 xpassed, 22 warnings in 1695.27s (0:28:15)
`uv run pytest -m "not qualitative"`	2 failed, 220 passed, 24 skipped, 75 deselected, 1 xpassed, 17 warnings in 971.58s (0:16:11)
`uv run pytest test/`	2 failed, 199 passed, 77 skipped, 1 xpassed, 19 warnings in 796.70s (0:13:16)
`uv run pytest test/ -m "not qualitative"`	2 failed, 182 passed, 24 skipped, 70 deselected, 1 xpassed, 15 warnings in 889.95s (0:14:49)

ajbozarth

Tested everything locally and test run well (as documented in my comment above)

Looking at the code everything looks solid, just a few comments on potential merge conflicts with my open PRs depending on which gets merged first:

#376 touches a lot of the same files, but on a glance the conflicts will be simple to fix as I don't believe they touch the same code, just same files
#369 I've included the conflicts as inline comments below

ajbozarth · 2026-01-29T21:32:09Z

docs/tutorial.md

-You can then run all tests by running `pytest`, or only the CI/CD tests by
-running `CICD=1 pytest`. See [test/MARKERS_GUIDE.md](../test/MARKERS_GUIDE.md) for
-details on running specific test categories (e.g., by backend, resource requirements).
+You can then run tests:


#369 merge conflict note: this section is removed entirely

ajbozarth · 2026-01-29T21:35:27Z

AGENTS.md

 ollama serve                        # Start Ollama (required for most tests)
-uv run pytest -m "not qualitative"  # Skips LLM quality tests (~2 min)
-uv run pytest                       # Full suite (includes LLM quality tests)
+uv run pytest                       # Default: qualitative tests, skip slow tests


#369 merge conflict note: the ruff/mypy steps are edited here as well

ajbozarth · 2026-01-29T21:36:31Z

README.md

-You can then run all tests by running `pytest`, or only the CI/CD tests by
-running `CICD=1 pytest`. See [test/MARKERS_GUIDE.md](test/MARKERS_GUIDE.md) for
-details on running specific test categories (e.g., by backend, resource requirements).
+You can then run tests:


#369 merge conflict note: This section is moved to CONTRIBUTING.md

- Add comment-based pytest markers for examples (cleaner than decorators) - Enable qualitative tests by default, add 'slow' marker for >5min tests - Improve test infrastructure with better skip logic and collection - Fix test failures: watsonx credentials, pytest imports, heavy RAM markers - Fix mypy errors in tools.py from upstream changes Resolves test discovery performance issues and improves CI reliability.

planetf1 · 2026-01-30T12:22:03Z

Opened up two new issues following a final test after rebase

Bug: Tool examples broken after PR #380 - need MelleaTool wrapper #383 an example failure relating to feat: new MelleaTool class and adoption across mellea #380
Bug: Flaky test in langchain_messages.py - qualitative assertion too strict #384 an intermittent test failure (relating to nature of test, not framework)

Will merge

@ajbozart The overlap in docs is small. thanks for the review. I can review your PR after rebase, but you've probably looked more closely at the text/guidance. This PR improves default behaviour, and adds a few more workable options, but is compatible with the original instructions.

planetf1 marked this pull request as ready for review January 28, 2026 14:00

planetf1 force-pushed the feat/optimize-example-test-discovery branch from c43a53b to 1540a3c Compare January 28, 2026 15:15

psschwei reviewed Jan 28, 2026

View reviewed changes

docs/examples/agents/react.py Outdated Show resolved Hide resolved

ajbozarth reviewed Jan 28, 2026

View reviewed changes

README.md Outdated Show resolved Hide resolved

psschwei reviewed Jan 29, 2026

View reviewed changes

README.md Show resolved Hide resolved

planetf1 force-pushed the feat/optimize-example-test-discovery branch from 1cec30c to 4b1b773 Compare January 29, 2026 17:01

ajbozarth approved these changes Jan 29, 2026

View reviewed changes

ajbozarth mentioned this pull request Jan 29, 2026

docs: create contributing doc #369

Open

8 tasks

ajbozarth mentioned this pull request Jan 29, 2026

test: run lint/format/type checks on entire repo #376

Open

8 tasks

planetf1 force-pushed the feat/optimize-example-test-discovery branch from 4b1b773 to 607953d Compare January 30, 2026 11:24

planetf1 force-pushed the feat/optimize-example-test-discovery branch from 607953d to 20fb553 Compare January 30, 2026 11:31

planetf1 mentioned this pull request Jan 30, 2026

Bug: Tool examples broken after PR #380 - need MelleaTool wrapper #383

Open

planetf1 merged commit e9aefaf into generative-computing:main Jan 30, 2026
4 checks passed

planetf1 deleted the feat/optimize-example-test-discovery branch January 30, 2026 12:23

planetf1 mentioned this pull request Jan 30, 2026

docs: add decompose to tutorial with example #366

Open

8 tasks

test: optimize example test discovery and execution speed #372

test: optimize example test discovery and execution speed #372

Uh oh!

Conversation

planetf1 commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Misc PR

Type of PR

Description

Testing

Uh oh!

mergify bot commented Jan 28, 2026

Merge Protections

🟢 Enforce conventional commit

Uh oh!

github-actions bot commented Jan 28, 2026

Uh oh!

planetf1 commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

planetf1 commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

psschwei commented Jan 28, 2026

Uh oh!

psschwei commented Jan 28, 2026

Uh oh!

planetf1 commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

planetf1 commented Jan 28, 2026

Uh oh!

psschwei commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

planetf1 commented Jan 28, 2026

Uh oh!

psschwei commented Jan 28, 2026

Uh oh!

Uh oh!

Uh oh!

ajbozarth commented Jan 29, 2026

Uh oh!

planetf1 commented Jan 29, 2026

Uh oh!

Uh oh!

planetf1 commented Jan 29, 2026

Test Suite Timing Summary

Quick Reference

Recommended Workflows

Key Insights

Uh oh!

ajbozarth commented Jan 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ajbozarth commented Jan 29, 2026

Uh oh!

psschwei commented Jan 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ajbozarth left a comment

Choose a reason for hiding this comment

Uh oh!

ajbozarth Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

ajbozarth Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

ajbozarth Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

planetf1 commented Jan 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

planetf1 commented Jan 28, 2026 •

edited

Loading

planetf1 commented Jan 28, 2026 •

edited

Loading

planetf1 commented Jan 28, 2026 •

edited

Loading

planetf1 commented Jan 28, 2026 •

edited

Loading

psschwei commented Jan 28, 2026 •

edited

Loading

ajbozarth commented Jan 29, 2026 •

edited

Loading

psschwei commented Jan 29, 2026 •

edited

Loading