Hi everyone,
I'm trying to crawl several websites using Apache Nutch and extract only their title, keyword and description (and nothing else)
I saw several examples on how to do that.
However they all propose complicated (at least to a Nutch newbie) plugins configuration and settings Since my use case sounds like a very common one I was wondering if there is any simpler solution?
If there is no easier solution, can anyone at least explain what are the steps required for me to extract just these specific tags?
Thanks in advance