A ready-to-use web spider that works with proxies, asynchrony, rate limit, configurable request pools, jQuery, and HTTP/2 support.
94
Duplicate URL detection and skipping
Crawler instantiation
100%
100%
skipDuplicates enabled
0%
100%
Cheerio title extraction
100%
100%
Cheerio price extraction
100%
100%
Callback and done()
100%
100%
drain event
100%
100%
Request queue management with priority-based scheduling
Crawler instantiation
100%
100%
maxConnections option
100%
100%
Priority option usage
100%
100%
Callback implementation
100%
100%
Cheerio extraction
100%
100%
done() invocation
100%
100%
Drain event handling
100%
100%
Advanced proxy management with per-proxy rate limiters
Crawler instantiation
100%
100%
Rate limiter assignment
0%
100%
Rate limit configuration
100%
100%
Task queueing
66%
100%
HTML parsing
100%
100%
Completion handling
50%
70%
User agent handling and configuration
Crawler instantiation
100%
100%
userAgents configuration
100%
100%
Request queuing
100%
100%
Callback implementation
100%
100%
Done function call
100%
100%
Completion handling
100%
100%
Result aggregation
100%
100%
Content-Type based intelligent response processing
Crawler instantiation
100%
100%
HTML parsing with jQuery
15%
0%
Charset handling
0%
30%
JSON content processing
25%
100%
Binary data handling
75%
100%
Queue-based URL processing
100%
100%
Callback flow control
100%
100%
Advanced Got options passthrough for full HTTP control
Crawler instantiation
100%
100%
SSL verification control
100%
100%
Redirect configuration
100%
100%
Decompression handling
100%
100%
JSON parsing
0%
100%
Charset detection and encoding conversion
Crawler instantiation
100%
100%
forceUTF8 option
0%
0%
jQuery parsing
0%
100%
Callback structure
100%
100%
Queue management
100%
100%
Encoding information
100%
100%
drain event
0%
0%
Silence mode for production logging
Silence configuration
100%
100%
Crawler instantiation
100%
100%
Queue management
100%
100%
Callback implementation
100%
100%
Data extraction
100%
100%
Automatic retry mechanism for failed requests
Crawler instantiation
100%
100%
Retry attempts configuration
100%
100%
Retry interval configuration
100%
100%
Timeout configuration
100%
100%
Request queueing
100%
100%
Callback handling
100%
100%
Error detection
100%
100%
Cookie jar integration with tough-cookie
CookieJar instantiation
100%
100%
Crawler instantiation
100%
100%
CookieJar configuration
100%
100%
Request queueing
66%
100%
Callback handling
100%
100%
Install with Tessl CLI
npx tessl i tessl/npm-crawlerTable of Contents