DocsCAPTCHA Solvingbrowser_solve_text_captcha

browser_solve_text_captcha

Browser Solve Text Captcha

Solve a text-based CAPTCHA by using vision model to read distorted characters. Automatically finds the CAPTCHA image, extracts the text, enters it in the input field, and optionally submits.

Usage Example

1234567891011
import asyncio
from owl_browser import OwlBrowser, RemoteConfig
# Async usage
async with OwlBrowser(config) as browser:
context = await browser.create_context()
context_id = context["context_id"]
await browser.solve_text_captcha(
context_id=context_id
)

Parameters

Required

context_idstringrequired

The unique identifier of the browser context (e.g., 'ctx_000001')

Optional

max_attemptsstring

Maximum number of attempts to solve the CAPTCHA before giving up. Each attempt re-analyzes the image and tries a new solution. Default: 3

Response

Returns a JSON object with the operation result.

{
  "success": true,
  "result": <value>
}