StormCrawler
Angelegt Samstag 07 März 2020
The preferred web crawler for Termcatcher is StormCrawler (StormCrawler FAQ). There is an alternative, which is probably less fast: Nutch. The Nutch tutorial can be found here: https://cwiki.apache.org/confluence/display/NUTCH/NutchTutorial.
Backlinks: Home:Technical Background