Batch Data Scraping (Web)
Function Description
TIP
Scrape batch data from web pages, commonly used for scraping list and detail pages, and can scrape multiple pages of data by setting pagination buttons
Configuration Item Description
General
Command Input
Web Page Object
TBrowser
: Input a web page object that is obtained or created by the 'Open Web Page' functionOperation Target
TTarget
: The table element to be scrapedNumber of Pages to Extract
Integer
: Number of Pages to ExtractNext Page Button
TTarget
: Pagination elementNumber of Pages
Integer
: The maximum number of pages to scrapeSave to Data Table
Boolean
: Whether to save the results to the data tablePreview in Data Table
Boolean
: Preview in Data Table
Command Output
- Save Batch Data To
TDataTable
: Save the batch data obtained from the web page
Advanced
Custom Column Names
string
: You can customize the column names, separated by commas.Page Interval (milliseconds)
Integer
: Time interval between page turnsSimulate Manual Click on Pagination Button
Boolean
: Simulate a manual click eventSend Window Messages
Boolean
: Send window messages to clickAutomatically Scroll to Bottom of Page
Boolean
: Automatically scroll to load the page before scraping, can be cancelledSimulate Pause During Page Turning
Boolean
: Simulate a real person's pause during page turningMinimum Pause Duration (milliseconds)
Integer
: The shortest waiting time after execution is completeMaximum Pause Duration (milliseconds)
Integer
: The longest waiting time after execution is completeDelay Before (ms)
Integer
: The waiting time before instruction executionTimeOut
Integer
:
Error Handling
Print Error Logs
Boolean
: Whether to print error logs to the "Logs" panel when the command fails. Default is checked.Handling Method
Integer
:Terminate Process: If the command fails, terminate the process.
Ignore Exception and Continue Execution: If the command fails, ignore the exception and continue the process.
Retry This Command: If the command fails, retry the command a specified number of times with a specified interval between retries.
Usage Example
Process logic description:
Common Errors and Handling
None
Frequently Asked Questions
None