Session
scrape
Scrapes a page using provided parameters via the Notte API
Signature
Parameters
Whether to scrape links from the page. Links are scraped by default.
Whether to scrape images from the page. Images are scraped by default.
Whether to only scrape the main content of the page. If True, navbars, footers, etc. are excluded.
The response format to use for the scrape. You can use a Pydantic model or a JSON Schema dict.
Additional instructions to use for the scrape.
Whether to use an LLM for the extraction process.
Whether to use link/image placeholders to reduce the number of tokens in the prompt and hallucinations.
The URL to scrape. If not provided, uses the current page URL.
Returns
ScrapeResponse
: An Observation object containing metadata, screenshot, action space, and data space.