solr - how to index all metatags in nutch -



solr - how to index all metatags in nutch -

i have installed nutch 1.9 , configured crawl solr 4.10.1. trying set nutch index metadata outlined here https://wiki.apache.org/nutch/indexmetatags

how set index of metadata on site? set value metatags.names *

class="snippet-code-html lang-html prettyprint-override"><property> <name>metatags.names</name> <value>*</value> <description>names of metatags extract, separated ','. utilize '*' extract metatags. prefixes names 'metatag.' in parse-metadata. instance index description , keywords, need activate plugin index-metadata , set value of parameter 'index.parse.md' 'metatag.description,metatag.keywords'. </description> </property>

but unsure of how set value index.parse.md without listing individual metatag names. tried

class="snippet-code-html lang-html prettyprint-override"><property> <name>index.parse.md</name> <value>meta*</value> <description>comma-separated list of keys taken parse metadata generate fields. can used e.g. 'description' or 'keywords' provided these values generated parser (see parse-metatags plugin) </description> </property>

but doesn't display metadata when running

class="snippet-code-html lang-html prettyprint-override">bin/nutch indexchecker http://nutch.apache.org/

and sure there metadata on site because returns parse metadata when running

class="snippet-code-html lang-html prettyprint-override">bin/nutch parsechecker http://nutch.apache.org/

any help appreciated! thanks

solr nutch

Comments

Popular posts from this blog

java Multi query from Mysql using netbeans -

c# - DotNetZip fails with "stream does not support seek operations" -

c++ - StartServiceCtrlDispatcher don't can access 1063 error -