- a list of actions that can be taken on the page (e.g. click on a button, scroll, etc.)
- a screenshot of the page (base64 encoded)
- some metadata about the page (title, url, etc.)
perception_type='fast') to make the query fast.
If you want a more powerful and LLM-ready action space, you can use:
instructions parameter to narrow down the action space to a specific intent on a website. This is useful if you want to quickly create a workflow using natural language:
Parameters
The minimum number of actions to list before stopping. If not provided, the listing will continue until the maximum number of actions is reached.
The maximum number of actions to list after which the listing will stop. Used when min_nb_actions is not provided.
The URL to observe. If not provided, uses the current page URL.
Additional instructions to use for the observation.
Returns
UnionType[ObserveResponse, list[Annotated[UnionType[ClickAction, FillAction, MultiFactorFillAction, FallbackFillAction, CheckAction, SelectDropdownOptionAction, UploadFileAction, DownloadFileAction], annotation=NoneType required=True discriminator='type']]]: The formatted observation result from the API response when no instructions provided. list[InteractionActionUnion]: The filtered list of actions when instructions is provided.
