Skip to main content
All CollectionsLeads and Data
Using web scrapers for lead generation
Using web scrapers for lead generation

How to scrape the online data source with a web scraper to get more leads

Updated over 4 months ago

Web scraping can be used to gather contact information, such as email addresses and phone numbers, from websites. This information can then be used for lead generation or enrichment.

In the following example, I am going to demonstrate how to use one of the web scrapers to easily get thousands of contact data. We will need the following:

  • Data source (Clutch.co, TheManifest.com, etc.)

  • Web Scraper (Octoparse)

  • Enrichment tool (Findymail, Apollo)

You can watch the full video through this link.

Here is the process to scrape leads from Clutch.co using Octoparse:

  1. Find a database that is easy to scrape. Check if there are enough results for your search and that the database includes both the company name and the website. We will need this info for enrichment.

  2. Some of the free data sources for company search:

  3. On Clutch.co is a free data source and it is easy to scrape it because all the main info is on the first page.

    • Choose the industry, location, and size of your niche.

    • For example, Human Resources companies in the United States, size 1-249.

    • Click on All filters to get more granular.

    • Grab the URL once you are happy with the results.

  4. Open Octoparse and on the first page, enter the URL. Click Start.

  5. Turn on the Browser mode to get a better view of the page (accept cookies, close ads, etc), and then turn it off.

  6. Click "Autodetect Webpage Data" to scan the page.

  7. Keep the Pagination feature on so it automatically opens and scrapes the next page. Click Create workflow.

  8. In the Data Preview window, delete unnecessary columns by clicking the three dots and selecting Delete Field.

  9. Click Run to start scraping.

  10. Click Run on your device (Standard Mode).

  11. The scraping process starts. Free version of Octoparse allows you to export 10k rows per session so when you reach the desired amount of leads you can stop the process.

  12. Click Export Data to get the CSV File.

  13. Upload it to your Drive, and delete unnecessary data columns (but keep the Company name and domains).


How to clean the file

  1. Select the Title column, click Data, Data Cleanup, and choose Clean whitespace to remove extra spaces in the Company name.

  2. Now your CSV File is ready to be enriched with other tools to get the leads from these companies.

Did this answer your question?