Main Agent and Capabilities

CodeActAgent

Description

This agent implements the CodeAct idea (paper, tweet) that consolidates LLM agents’ actions into a unified code action space for both simplicity and performance.

The conceptual idea is illustrated below. At each turn, the agent can:

Converse: Communicate with humans in natural language to ask for clarification, confirmation, etc.
CodeAct: Choose to perform the task by executing code

Execute any valid Linux bash command
Execute any valid Python code with an interactive Python interpreter. This is simulated through bash command, see plugin system below for more details.

Demo

https://github.com/All-Hands-AI/OpenHands/assets/38853559/f592a192-e86c-4f48-ad31-d69282d5f6ac

Example of CodeActAgent with gpt-4-turbo-2024-04-09 performing a data science task (linear regression).

On this page

CodeActAgent
Description
Demo

CodeActAgent

Description

This agent implements the CodeAct idea (paper, tweet) that consolidates LLM agents’ actions into a unified code action space for both simplicity and performance.

The conceptual idea is illustrated below. At each turn, the agent can:

Converse: Communicate with humans in natural language to ask for clarification, confirmation, etc.
CodeAct: Choose to perform the task by executing code

Execute any valid Linux bash command
Execute any valid Python code with an interactive Python interpreter. This is simulated through bash command, see plugin system below for more details.

Demo

https://github.com/All-Hands-AI/OpenHands/assets/38853559/f592a192-e86c-4f48-ad31-d69282d5f6ac

Example of CodeActAgent with gpt-4-turbo-2024-04-09 performing a data science task (linear regression).

On this page

CodeActAgent
Description
Demo

​CodeActAgent

​Description

​Demo

API Reference

​CodeActAgent

​Description

​Demo

CodeActAgent

Description

Demo

CodeActAgent

Description

Demo