Post Description
WebHarvy is a handy application designed to enable you to automatically extract data from web pages and save the extracted content in different formats. With WebHarvy, capturing data from web pages is as easy as navigating to the pages which contain data and clicking on the data to be captured. WebHarvy will intelligently identify patterns of data occurring in web pages. Using WebHarvy, you can extract data such as product catalogues or search results from a variety of websites which fall in to different categories like Real Estate, Ecommerce, Academic Research, Entertainment, Technology etc. The data extracted from web pages can be saved in a variety of formats. Often web pages display data such as search results in multiple pages. WebHarvy can automatically crawl and extract data from multiple pages.
Simple Point & Click Interface
WebHarvy is a visual web scraper. There is absolutely no need to write any scripts or code to scrape data. You will be using WebHarvy's in-built browser to navigate to web pages. You can select the data to be scraped with mouse clicks. It's that easy !
Intelligently Identifies Patterns
WebHarvy automatically identifies patterns of data occurring in web pages. So if you need to scrape a list of items (name, address, email, price etc) from a web page, you need not do any additional configuration. If the data repeats, WebHarvy will scrape it automatically.
Export Captured Data
You can save the data scraped from web pages in a variety of formats. The current version of WebHarvy allows you to save the captured data as a CSV, TSV or an XML file. You can also export the scraped data to an SQL database.
Capture Data From Multiple Pages
Often web pages display data such as search results in multiple pages. WebHarvy can automatically crawl and extract data from multiple pages. Just point out the 'link to the next page' to WebHarvy and WebHarvy will automatically scrape data from all pages.
Comments # 0