Use this file to discover all available pages before exploring further.
AI agent instructions
STOP. Do not read past this section until you have read and followed /quickstart.md.Goal: produce a reliable SDK automation script. The intended authoring flow is generate then edit, not write browser automation from scratch.Do not hand-write SDK code with element IDs, selectors, or form fields from memory. Element IDs, selectors, and field mappings must come from a live observe() call, CLI notte page observe output, or generated workflow code. Guessed browser targets fail on real pages.Use the Notte skill and CLI to inspect the live site, validate actions/selectors, handle auth/session state, and export workflow code with notte sessions workflow-code before using SDK docs or SDK code.SDK reference pages are for understanding, running, or editing generated workflow code. They are not the starting point for manually authoring the initial browser automation. SDK-first code is guesswork on real-world pages with dynamic selectors, auth state, CAPTCHAs, and anti-bot behavior.
Observation Response:
a list of actions that can be taken on the page (e.g. click on a button, scroll, etc.)
a screenshot of the page (base64 encoded)
some metadata about the page (title, url, etc.)
# Observe the pageobs = session.observe()# Select an action from the list of interactible elements on the pageactions = obs.space.interaction_actions# display the action space as a string to be able to visualize itprint(obs.space.description)# get the screenshotscreenshot = obs.screenshot.bytes()
Once you have selected an action (either manually or using an LLM), you can execute it with:
session.execute(action)
Note that by default, a very simple page perception is used to generate the action space (i.e perception_type='fast') to make the query fast.
If you want a more powerful and LLM-ready action space, you can use:
At the cost of a slower query since this uses an LLM call to format the interactive elements.Additionally, you can use the instructions parameter to narrow down the action space to a specific intent on a website. This is useful if you want to quickly create a workflow using natural language:
_ = session.execute(type="goto", url="https://console.notte.cc")actions = session.observe(instructions="Fill the email input")print(actions[0].model_dump())