instructions is null then the whole page will be scraped. Otherwise, only the data that matches the instructions will be scraped. Instructions should be given as natural language, e.g. ‘Extract the title and the price of the product’.
Example:
Fields
Whether to only scrape the main content of the page. If True, navbars, footers, etc. are excluded.
Playwright selector to scope the scrape to. Only content inside this selector will be scraped.
Whether to only scrape images from the page. If True, the page content is excluded.
Whether to scrape links from the page. Links are scraped by default.
Whether to scrape images from the page.
HTML tags to ignore from the page.
JSON schema dict for structured output. Agent can provide a schema to extract structured data.
Module
notte_core.actions.actions
