跳到主要内容

openhands.agenthub.browsing_agent.browsing_agent

USE_NAV

only disable NAV actions when running webarena and miniwob benchmarks

USE_CONCISE_ANSWER

only return concise answer when running webarena and miniwob benchmarks

BrowsingAgent Objects

class BrowsingAgent(Agent)

VERSION

An agent that interacts with the browser.

__init__

def __init__(llm: LLM, config: AgentConfig) -> None

Initializes a new instance of the BrowsingAgent class.

Arguments:

  • llm (LLM): The llm to be used by this agent

reset

def reset() -> None

Resets the Browsing Agent.

step

def step(state: State) -> Action

Performs one step using the Browsing Agent. This includes gathering information on previous steps and prompting the model to make a browsing command to execute.

Arguments:

  • state (State): used to get updated info

Returns:

  • BrowseInteractiveAction(browsergym_command) - BrowserGym commands to run
  • MessageAction(content) - Message action to run (e.g. ask for clarification)
  • AgentFinishAction() - end the interaction