POST
/
env
/
scrape

Authorizations

Authorization
string
header
required

The access token received from the authorization server in the OAuth 2.0 flow.

Body

application/json
keep_alive
boolean
default:
false

If True, the session will not be closed after the operation is completed.

max_nb_actions
integer
default:
100

The maximum number of actions to list after which the listing will stop. Used when min_nb_actions is not provided.

min_nb_actions
integer | null

The minimum number of actions to list before stopping. If not provided, the listing will continue until the maximum number of actions is reached.

only_main_content
boolean
default:
true

Whether to only scrape the main content of the page. If True, navbars, footers, etc. are excluded.

scrape_images
boolean
default:
false

Whether to scrape images from the page. Images are not scraped by default.

screenshot
boolean | null

Whether to include a screenshot in the response.

session_id
string | null

The ID of the session. A new session is created when not provided.

session_timeout_minutes
integer
default:
5

Session timeout in minutes. Cannot exceed the global timeout.

Required range: 0 < x < 30
url
string | null

The URL to observe. If not provided, uses the current page URL.

Response

200 - application/json
metadata
object
required

Metadata of the current page, i.e url, page title, snapshot timestamp.

session
object
required

Browser session information

data
object | null

Extracted data from the page

screenshot
file | null

Base64 encoded screenshot of the current page

space
object | null

Available actions in the current state