update the browser-use agent example #705

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

cuiyuebing wants to merge 14 commits into agentscope-ai:main from cuiyuebing:browser_agent_dev

+1,275 −139

Member

cuiyuebing commented Sep 2, 2025

AgentScope Version

[The version of AgentScope you are working on, e.g. import agentscope; print(agentscope.__version__)]

Description

[Please describe the background, purpose, changes made, and how to test this PR]

Checklist

Please check the following items before code is ready to be reviewed.

Code has been formatted with pre-commit run --all-files command
All tests are passing
Docstrings are in Google style
Related documentation has been updated (e.g. links, examples, etc.)
Code is ready for review

cuiyuebing and others added 2 commits

September 2, 2025 11:12


          update the browser-use agent with advanced functions

2c55aa8


          Merge branch 'agentscope-ai:main' into browser_agent_dev

0dfb374

DavdGao requested a review from Copilot

September 2, 2025 03:49

Copilot AI reviewed

View reviewed changes

Contributor

Copilot AI left a comment

Pull Request Overview

This PR updates the browser-use agent example with significant improvements to functionality and structure. The changes enhance the agent's task decomposition capabilities, add structured output support, and improve the overall architecture.

Replaces hardcoded prompts with external markdown files for better maintainability
Adds task decomposition functionality to break complex tasks into manageable subtasks
Introduces structured output support using Pydantic models

Reviewed Changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
examples/agent_browser/main.py	Updates agent configuration, adds structured output model, and fixes grammar in documentation
examples/agent_browser/build_in_prompt/browser_agent_task_decomposition_prompt.md	New prompt template for decomposing browser automation tasks into subtasks
examples/agent_browser/build_in_prompt/browser_agent_sys_prompt.md	New system prompt defining the browser agent's behavior and guidelines
examples/agent_browser/build_in_prompt/browser_agent_summarize_task.md	New prompt template for generating comprehensive task summary reports
examples/agent_browser/build_in_prompt/browser_agent_reasoning_prompt.md	New reasoning prompt for chunk-based webpage analysis
examples/agent_browser/browser_agent.py	Major refactor adding task decomposition, chunk-based observation, structured output, and multimodal support

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

examples/agent_browser/browser_agent.py

Comment on lines +30 to +35

    
              with open(

                  "examples/agent_browser/build_in_prompt/browser_agent_sys_prompt.md",

                  "r",

                  encoding="utf-8",

              ) as f:

                  _BROWSER_AGENT_DEFAULT_SYS_PROMPT = f.read()

Copilot AI Sep 2, 2025

The hardcoded file paths make the code fragile and dependent on the current working directory. Consider using os.path.dirname(__file__) or pathlib.Path(__file__).parent to construct relative paths from the current module's location.

Copilot uses AI. Check for mistakes.

examples/agent_browser/browser_agent.py

    
              ) as f:

                  _BROWSER_AGENT_DEFAULT_REASONING_PROMPT = f.read()

              with open(

                  "examples/agent_browser/build_in_prompt/browser_agent_task_decomposition_prompt.md",  # noqa: E501 pylint: disable=C0301

Copilot AI Sep 2, 2025

Using both noqa and pylint disable comments for the same line length issue is redundant. Choose one consistent approach throughout the codebase.

Suggested change

      
                "examples/agent_browser/build_in_prompt/browser_agent_task_decomposition_prompt.md",  # noqa: E501 pylint: disable=C0301
          
                "examples/agent_browser/build_in_prompt/browser_agent_task_decomposition_prompt.md",  # pylint: disable=C0301

Copilot uses AI. Check for mistakes.

examples/agent_browser/browser_agent.py

    
                      self.iter_n = 0

                      self.finish_function_name = "browser_generate_final_response"

                      self.init_query = ""

                      self._required_structured_model: Type[BaseModel] | None = None

Copilot AI Sep 2, 2025

Use Optional[Type[BaseModel]] instead of Type[BaseModel] | None for better compatibility with older Python versions and consistency with other type hints in the codebase.

Suggested change

      
                    self._required_structured_model: Type[BaseModel] | None = None
          
                    self._required_structured_model: Optional[Type[BaseModel]] = None

Copilot uses AI. Check for mistakes.

examples/agent_browser/browser_agent.py

    
                          text = re.sub(r"```yaml.*?```", "", text, flags=re.DOTALL)

                      # Remove JavaScript code blocks

                      text = re.sub(r"```js.*?```", "", text, flags=re.DOTALL)

                      # # Remove JavaScript code blocks

Copilot AI Sep 2, 2025

This commented-out code should be removed entirely rather than left as a comment. If the functionality might be needed later, consider documenting the reason for removal or create a proper TODO comment.

Suggested change

# # Remove JavaScript code blocks

Copilot uses AI. Check for mistakes.

examples/agent_browser/browser_agent.py

    
                          for i in range(0, len(snapshot_str), max_length)

                      ]

                  def observe_by_chunk(self, image_path: str | None = "") -> Msg:

Copilot AI Sep 2, 2025

Use Optional[str] instead of str | None for consistency with other type hints in the codebase, and consider using None as the default value instead of an empty string for an optional parameter.

Suggested change

      
                def observe_by_chunk(self, image_path: str | None = "") -> Msg:
          
                def observe_by_chunk(self, image_path: Optional[str] = None) -> Msg:

Copilot uses AI. Check for mistakes.

cuiyuebing and others added 12 commits

September 2, 2025 17:58


          fix finish function bug

ad3ed07


          Add a dfs search example in Browser Agent

e1dd0a5


          Update search prompt

73ef3c5


          Fix type annotation

cc7f2ce


          Add Docstring

06a7d8d


          Fix pre-commit warnings

822b152


          Fix comments

d7766af


          Fix init order in browser agent

b505a84


          Merge pull request #2 from TCtower/browser_agent_dev

0d22554

Add a dfs search example in Browser Agent


          change format

bce733b


          Merge remote-tracking branch 'origin/main' into browser_agent_dev


          Merge branch 'agentscope-ai:main' into browser_agent_dev

4a095d1

github-actions bot commented Dec 2, 2025

This PR is marked as stale because there has been no activity for 30 days. Remove stale label or add new comments or this PR will be closed in 5 day.

github-actions bot added the stale-pr label

cla-assistant bot commented Dec 2, 2025 •

edited

Loading

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.
0 out of 2 committers have signed the CLA.

❌ TCtower
❌ cuiyuebing
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

github-actions bot removed the stale-pr label

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet