DocsCAPTCHA Solvingbrowser_solve_image_captcha

browser_solve_image_captcha

Browser Solve Image Captcha

Solve an image-selection CAPTCHA (e.g., 'select all images with traffic lights'). Uses vision model with numbered overlays for one-shot analysis. Supports reCAPTCHA, Cloudflare Turnstile, and hCaptcha.

Usage Example

1234567891011
import asyncio
from owl_browser import OwlBrowser, RemoteConfig
# Async usage
async with OwlBrowser(config) as browser:
context = await browser.create_context()
context_id = context["context_id"]
await browser.solve_image_captcha(
context_id=context_id
)

Parameters

Required

context_idstringrequired

The unique identifier of the browser context (e.g., 'ctx_000001')

Optional

max_attemptsstring

Maximum number of attempts to solve the CAPTCHA. For image selection, each attempt may select different images. Default: 3

providerenum
autoowlrecaptchacloudflare+1 more

CAPTCHA provider hint for optimized solving: 'auto' (detect automatically), 'owl' (Owl test CAPTCHAs), 'recaptcha' (Google reCAPTCHA), 'cloudflare' (Turnstile), 'hcaptcha'. Default: 'auto'

Response

Returns a JSON object with the operation result.

{
  "success": true,
  "result": <value>
}