Q: What is a configuration rule/extraction task in Octoparse?

 

 

     A:

Crawlers run in Octoparse are determined by the rules configured, and the data extracted is structured data. Octoparse does not understand the web content with advanced algorithms, but it grabs the exact web content to you perfectly.

The rule configured would tell Octoparse: which website is to be open; where is the data you plan to crawl; what kind of data you want, etc.

Octoparse has a visible workflow designer to show how the rule is created. You can configure the rule by simply point and click the element on the web page. Octoparse can scrape multiple pages (pagination),  scrape a website behind a login, deal with web pages loaded with AJAX, scrape a website with infinite scrolling, etc.

 

Check out this article to better understand a task in Octoparse. 

A task in Octoparse means a crawler

 

 

If you have any question, we'd happy to help.

Octoparse Support Team

btn_sidebar_use.png
btn_sidebar_form.png
当社ウェブサイトは、利便性、品質維持・向上を目的に、Cookieを使用しております。詳しくはプロキシーをご確認ください。Cookieの利用に同意頂ける場合は、「同意する」ボタンを押してください。同意頂けない場合は、ブラウザを閉じて閲覧を中止してください。
同意する 閉じる