Tool | Package | Local | Serializable | JS support |
---|---|---|---|---|
Crawl Tool | langchain-hyperbrowser | ❌ | ❌ | ❌ |
Scrape Tool | langchain-hyperbrowser | ❌ | ❌ | ❌ |
Extract Tool | langchain-hyperbrowser | ❌ | ❌ | ❌ |
langchain-hyperbrowser
integration package, and create a Hyperbrowser account and get an API key.
HyperbrowserCrawlTool
is a powerful tool that can crawl entire websites, starting from a given URL. It supports configurable page limits and scraping options.
HyperbrowserScrapeTool
is a tool that can scrape content from web pages. It supports both markdown and HTML output formats, along with metadata extraction.
HyperbrowserExtractTool
is a powerful tool that uses AI to extract structured data from web pages. It can extract information based predefined schemas.
url
: The URL to processsession_options
: Browser session configuration
use_proxy
: Whether to use a proxysolve_captchas
: Whether to automatically solve CAPTCHAsaccept_cookies
: Whether to accept cookiesmax_pages
: Maximum number of pages to crawlscrape_options
: Options for scraping each page
formats
: List of output formats (markdown, html)scrape_options
: Options for scraping the page
formats
: List of output formats (markdown, html)schema
: Pydantic model defining the structure to extractextraction_prompt
: Natural language prompt for extraction