Notes

Home

❯

Projects

❯

Search Engine

❯

Rust-Based Search Engine

Rust-Based Search Engine

Jun 29, 20241 min read

Data Collection/Web Scraping

Puppeteer

Running Node.JS

Need to randomize the user agent of the browser so that websites don’t block the scraping or ask for captchas. https://www.zenrows.com/blog/puppeteer-user-agent#use-a-random-ua

Without CloudFlare or other scraping-protection systems blocking requests https://scrapfly.io/blog/how-to-scrape-without-getting-blocked-tutorial/


Graph View

  • Notes Source