openhands.agenthub.visualbrowsing_agent.visualbrowsing_agent

VisualBrowsingAgent Objects

class VisualBrowsingAgent(Agent)

VisualBrowsing Agent that can uses webpage screenshots during browsing.

def __init__(llm: LLM, config: AgentConfig) -> None

Initializes a new instance of the VisualBrowsingAgent class.

Arguments:

def reset() -> None

Resets the VisualBrowsingAgent.

def step(state: State) -> Action

Performs one step using the VisualBrowsingAgent.

This includes gathering information on previous steps and prompting the model to make a browsing command to execute.

Arguments:

Returns: