solr - how to index all metatags in nutch -
solr - how to index all metatags in nutch -
i have installed nutch 1.9 , configured crawl solr 4.10.1. trying set nutch index metadata outlined here https://wiki.apache.org/nutch/indexmetatags
how set index of metadata on site? set value metatags.names *
class="snippet-code-html lang-html prettyprint-override"><property> <name>metatags.names</name> <value>*</value> <description>names of metatags extract, separated ','. utilize '*' extract metatags. prefixes names 'metatag.' in parse-metadata. instance index description , keywords, need activate plugin index-metadata , set value of parameter 'index.parse.md' 'metatag.description,metatag.keywords'. </description> </property>
but unsure of how set value index.parse.md without listing individual metatag names. tried
class="snippet-code-html lang-html prettyprint-override"><property> <name>index.parse.md</name> <value>meta*</value> <description>comma-separated list of keys taken parse metadata generate fields. can used e.g. 'description' or 'keywords' provided these values generated parser (see parse-metatags plugin) </description> </property>
but doesn't display metadata when running
class="snippet-code-html lang-html prettyprint-override">bin/nutch indexchecker http://nutch.apache.org/
and sure there metadata on site because returns parse metadata when running
class="snippet-code-html lang-html prettyprint-override">bin/nutch parsechecker http://nutch.apache.org/
any help appreciated! thanks
solr nutch
Comments
Post a Comment