Find out what a headless browser is, what you can use it for, and how it will help you with scraping websites. Learn about the best headless browsers and libraries.
Find out how to use Puppeteer to handle forms, buttons, and inputs. Learn about type method, click method, and how to deal with text fields, dropdowns, and checkboxes.
Learn how to start a browser with Puppeteer, click buttons and wait for actions, and how to extract data from websites. From building a basic scraper to large-scale crawling.
What if a single decision could cut your scraping costs by 90% while improving efficiency up to 60%?
That's precisely what the retail data company, Daltix, experienced by migrating their scrapers from Scrapy to Apify.
Since the launch of Apify actors last autumn, Apify is no longer just a tool to extract data from websites. It has become a full-featured serverless computing platform that enables people to automate workflows on the web, run data processing pipelines or integrate with third-party systems.
We have released a new open-source package called proxy-chain on NPM to enable running headless Chrome and Puppeteer over a proxy server that requires authentication.