The unstructured action format is misleading LLM #3472

Marven11 · 2025-10-30T03:49:34Z

Marven11
Oct 30, 2025

I'm trying browser-use with qwen-flash because I want to have an automated spider that does some testing jobs for me.

But the model can hardly call an action even for simple ones like wait for some seconds. I tried qwen-max and LLM of some other families like deepseek and they don't work. They all call the wait function with a wrong input like {'wait': 5}:

action.0.DoneActionModel.done
  Field required [type=missing, input_value={'wait': 5}, input_type=dict]
    For further information visit https://errors.pydantic.dev/2.12/v/missing
action.0.DoneActionModel.wait
  Extra inputs are not permitted [type=extra_forbidden, input_value=5, input_type=int]
    For further information visit https://errors.pydantic.dev/2.12/v/extra_forbidden

After digging the source code for a while I found that an action

@self.registry.action('')
async def wait(seconds: int = 3):
    ...

is prompted as this:

wait: . (seconds=integer)

Which marked as the "unstructured format" for action here

It's a weird format related to neither python code, JSON, or anything that LLM was trained on. It didn't say anything about kwargs or something like that. So the LLM will try something like {'wait': 5} and failed (on my machine).

mitej23 · 2025-10-30T17:19:56Z

mitej23
Oct 30, 2025

We internally use qwen based model for browser use. It won't directly work because due to json based structure. We created a wrapper using pydantic-ai which include agent with tool calls. It runs absolutely beautifully. Ours is 32b model. It almost runs at 1/10th of the cost

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

The unstructured action format is misleading LLM #3472

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

The unstructured action format is misleading LLM #3472

Uh oh!

Marven11 Oct 30, 2025

Replies: 1 comment

Uh oh!

mitej23 Oct 30, 2025

Marven11
Oct 30, 2025

mitej23
Oct 30, 2025