The scraping bot is the alter ego of the Website Crawler.
He’s absolutely +not resourceful! You have to take him by the hand and tell him where he can find the title of an article, its image, etc. The Scraping bot will go to the page you’re interested in.
The Scraping bot will go +to the page you’ve told it to. It won’t be interested in the whole site like the crawler.
If you launch it on the press release page, it will stay in that category.
*What’s the advantage then?
The Scraping bot, if properly set up, offers +impeccable collection and is fully customizable.
Creating a scrapping bot
Resources > Sources > Create > Enter URL > No RSS available for this site or the part you want to scrape? > Click on create.
You can choose the type of bot scraping you want, depending on the page style you want to follow.
Choose the right bot
Textual changes
Want to track textual changes in a specific part of a web page?
The Textual Changes robot is the one for you.
It will let you know if changes have been made to any part of a given page.
Single Page Newsfeed
Do you want to collect new content appearing on a given web page?
The Single Page Newsfeed robot is the one for you.
It will allow you to collect articles on a single level only. It will stay on the given page and not go beyond it.
Newsfeed “Read More
Do you want to collect the full content of new summaries appearing on a given web page?
The Newsfeed “Read More “ robot is the one for you.
It will allow you to collect publications with content published on another page (a second level).
News aggregator
Do you want to collect the complete content of different websites whose new summaries appear on a given web page?
Then the News aggregator is the robot for you.
It will collect publications on two levels (even if the second level has a different domain name).
Post your comment on this topic.