Skip to main contentNavigation & Browser Control
search - Search queries (DuckDuckGo, Google, Bing)
navigate - Navigate to URLs
go_back - Go back in browser history
wait - Wait for specified seconds
Page Interaction
click - Click elements by their index
input - Input text into form fields
upload_file - Upload files to file inputs
scroll - Scroll the page up/down
find_text - Scroll to specific text on page
send_keys - Send special keys (Enter, Escape, etc.)
JavaScript Execution
evaluate - Execute custom JavaScript code on the page (for advanced interactions, shadow DOM, custom selectors, data extraction)
Tab Management
switch - Switch between browser tabs
close - Close browser tabs
extract - Extract data from webpages using LLM
Visual Analysis
screenshot - Request a screenshot in your next browser state for visual confirmation
dropdown_options - Get dropdown option values
select_dropdown - Select dropdown options
File Operations
write_file - Write content to files
read_file - Read file contents
replace_file - Replace text in files
Task Completion
done - Complete the task (always available)