Extract content from web pages and PDFs for analysis and processing
Parameter | Type | Required | Description |
---|---|---|---|
url | string | Yes | The URL of the webpage or PDF to scrape |
tables_only | boolean | No | Extract only tables from the page (default: false) |
raw_html | boolean | No | Return raw HTML instead of parsed text (default: false) |
tables_only: true
for data extraction