partial_data
内的内容。一旦作业完成,内容将在 data
下可用。
我们建议自行跟踪爬取作业,因为爬取状态结果在24小时后可能会过期。Authorizations
Bearer authentication header of the form Bearer <token>
, where <token>
is your auth token.
Path Parameters
ID of the crawl job
Response
Successful response
Status of the job (completed, active, failed, paused)
Current page number
Total number of pages
Data returned from the job (null when it is in progress)
Partial documents returned as it is being crawled (streaming). This feature is currently in alpha - expect breaking changes When a page is ready, it will append to the partial_data array, so there is no need to wait for the entire website to be crawled. When the crawl is done, partial_data will become empty and the result will be available in data
. There is a max of 50 items in the array response. The oldest item (top of the array) will be removed when the new item is added to the array.