1. Basic parameters
The parameters of different crawlers are different in context and need to be included in it
parameter name |
description |
type |
Is it required? |
locale |
Language/region setting (such as "en-US") |
string |
no |
source |
data source type, |
string |
yes |
geo |
area |
string |
no |
format |
Required return format |
[]string |
no |
js_render |
Whether to enable js rendering false: disable true: enable Default is (false) |
boll |
no |
context |
Query context container |
object |
yes |
2.Youtube crawling parameters
Function name |
Parameter name |
description |
type |
Is it required? |
YouTube-Video details |
url_list ├─ url |
Video URL list, array, containing multiple video objects |
array<object> | yes |
|
Single YouTube video link |
string | yes | |
Youtube-Video Search |
keyword |
Search keywords |
string/null | yes |
|
continuation |
Search results paging cursor (for getting more results) |
string/null | no |
Youtube-Video Subtitles |
url |
YouTube video full link |
string/null | yes |
|
output_format |
Subtitle output format, currently supports "srt" |
string | no |
Youtube-User Information Acquisition |
url_list
|
List of user homepage URLs, including multiple user objects |
array<object> | yes |
|
Single YouTube user homepage link |
string | yes |
The following example uses YouTube to grab subtitles:
3.Facebook crawl parameters
Function name |
Parameter name |
description |
type |
Is it required? |
Facebook-User Profile |
url_list |
List of Facebook user or page URLs to extract profile information from |
list |
yes |
|
list |
A valid Facebook profile or page URL |
string |
yes |
Here we take the example of grabbing FB user information:
4. Amazon crawling parameters
Function name |
Parameter name |
description |
type |
Is it required? |
Amazon-Product Search |
keyword_list ├─ keyword |
Search keyword list, support multiple keyword crawling |
list (object[]) |
yes |
|
A single search keyword to crawl |
string |
yes |
|
|
start_pages |
Starting page number, from which page to start crawling (such as the first page is 1) |
number |
no |
|
pages |
Number of pages to crawl, how many pages of data to crawl in total |
number |
no |
Amazon-Best Seller |
category_list ├─ category_id |
Category ID list, array, containing multiple category objects |
array<object> |
yes |
|
Category unique identifier |
string |
yes |
|
|
start_page |
Starting page number, from which page to start crawling |
integer |
no |
|
pages |
Total number of crawled pages, how many pages are crawled |
integer |
no |
Amazon Product Reviews |
product_url_list ├─ product_url |
Product URL list, including multiple product objects |
array<object> |
yes |
|
URL for a single product |
string |
yes |
|
Amazon product details |
product_url_list ├─ product_url |
Product URL list, including multiple product objects |
array<object> |
yes |
|
URL for a single product |
string |
yes |
Here we take the example of crawling Amazon product keywords:
5.TikTok crawling parameters
Function name |
Parameter name |
description |
type |
Is it required? |
TK Video Search | keyword | Search for keywords and the system will search for videos based on the keywords | string | yes |
TK comments | video_id | Tik Tok video ID, used to locate the video | string | yes |
cursor | Cursor, used to paginate to get more responses | string | no | |
count | Limit the number of responses returned per page | integer | no | |
TK User Information Search | url_list ├─ user_url |
User URL list, array, containing multiple user objects | array<object> | yes |
Single TikTok user homepage URL | string | yes |
Here we take grabbing TK video information as an example:
6. X crawl parameters
Function name |
Parameter name |
description |
type |
Is it required? |
X tweet information | user_id | The ID of a Twitter user, used to get the tweets of that user | string | yes |
Here we take the example of grabbing X tweet information:
7. Google crawl parameters
Function name |
Parameter name |
description |
type |
Is it required? |
Optional |
Google job search | job_name |
Job title keywords |
string | yes | - |
job_location_list ├job_location |
Place list | array<object> | no | - | |
experiences | Work experience screening value | array<string> | no | INTERN_AND_APPRENTICE EARLY MID ADVANCED DIRECTOR_PLUS |
|
has_remote | Is the position remote? | boolean | no | - | |
skills | Skill keywords (comma separated) | string | no | - | |
degree | Education filter value | array<string> | no | PURSUING_DEGREE ASSOCIATE BACHELORS MASTERS DOCTORATE |
|
employment_type | Position type | array<string> | no | FULL_TIME PART_TIME TEMPORARY INTERN |
|
sort_by | Sort Field (Relevance/Time) | string | no | relevance date |
|
page | Pagination page number (starting from 1) | integer (建议) / string | no | - | |
Google Events | keywords_list | Keyword list (array, containing keyword field) | array<object> | yes | - |
start_page | Starting page number (from which page to start crawling) | integer | no | - | |
end_page | End page number (the page to be captured) | integer | no | - | |
Google Image | keywords_list ├keyword |
An array of keyword objects, used to specify search keywords | array<object> | yes | - |
Search keywords | string | yes | - | ||
start_page | Starting page number (which page does the crawl start from) | integer / null | no | - | |
end_page | End page number (the page to be captured) | integer / null | no | - | |
Google Autocomplete | keywords_list ├keyword |
Keyword object array | array<object> | yes | - |
Search keywords to get auto-complete suggestions | string | yes | - | ||
Google Finance | keywords_list ├keyword |
Keyword object array | array<object> | yes | - |
Search keywords (such as company name, stock code, etc.) | string | yes | - | ||
Google Search | keywords_list ├keyword |
Search keyword object array | array<object> | yes | - |
Search keyword strings (such as product names, brand names, etc.) | string | yes | - | ||
start_page | Starting search results page number, from which page to start searching (such as 1) | int/null | no | - | |
end_page | End search results page number, the page to end the search on (such as 3) | int/null | yes | - |
Here we take crawling Google job as an example: