How to scrape Goodreads books and reviews without using Goodreads API

Tuğkan Cengiz
Tuğkan Cengiz

The Goodreads API stopped issuing new keys to developers in 2020 and the future of APIs at Goodreads is uncertain. But if you still need to fetch data on books and reviews from Goodreads, you can now use Goodreads Scraper to get them.

TL;DR

Goodreads Scraper 📚 · Apify
Scrape goodreads.com for data on millions of books. Crawl book details for images, ISBN, author, description, title, buy links, number of reviews, page number, language, and all other details. You can specify search terms, filters, and much more.
Goodreads Scraper detail link on the Apify platform

Features

The Goodreads data scraper supports the following features:

  • Search any keyword: search any keyword you want.
  • Scrape lists: scrape any list from Goodreads.
  • Scrape shelf: check the shelves and scrape the information from the newest updates.
  • Scrape genres: to get the most read books on a certain category or anything related to genres, just use the URL.
  • Scrape book detail: scrape very detailed information for each book.
  • Scrape reviews: scrape all reviews of a book.

Setup & usage

You can see how this actor works in these videos:

Using Goodreads Scraper with Search mode

You can check the output of this example here.

Using start URLs

Using Goodreads Scraper with Start URLs

You can check the output of this example here.

When you want to scrape a specific listing URL, just copy and paste the link as one of the start URLs.

If you would like to scrape only the first page of a list then put the link for the page and have the endPage as 1.

With the last approach explained above, you can also fetch any interval of pages. If you provide the 5th page of a list and define the endPage parameter as 6 then you’ll have the 5th and 6th pages only.

Final words

Like my all other actors, there are lots of new features on the roadmap and I am always open to new ideas. Please don’t hesitate to contact me if you have any feedback, feature requests, or totally new ideas that might be interesting to implement.

P.S. You should always use a proxy to get the best results.



Great! Next, complete checkout for full access to Apify
Welcome back! You've successfully signed in
You've successfully subscribed to Apify
Success! Your account is fully activated, you now have access to all content
Success! Your billing info has been updated
Your billing was not updated