首页 \ 教程 \ solr

知识点

Solr

solr中集成中文分词 mmseg4j

Solr学习笔记之2、集成IK中文分词器

[ solr入门 ] - 在schema.xml中加入中文分词（IKAnalyzer）

集成Nutch和Solr

nutch，solr集成在hadoop上

solr中文分词

【转】 solr中文分词

solr中文分词

Nutch-2.2.1学习之七Nutch与Solr的集成

Solr中文分词配置（2）

nutch1.8+solr 4 配置过程+ikanalayzer2012 中文分词器

Nutch和Solr的集成方案

[ solr入门 ] - 在schema.xml中加入自己的分词工具

安装solr中文分词系统

Nutch集成Solr中文分词Schema

2019-03-27 00:22|来源: 网路

<?xml version="1.0" encoding="UTF-8" ?>

<!-- Licensed to the Apache Software Foundation (ASF) under one or more contributor

license agreements. See the NOTICE file distributed with this work for additional

information regarding copyright ownership. The ASF licenses this file to

You under the Apache License, Version 2.0 (the "License"); you may not use

this file except in compliance with the License. You may obtain a copy of

the License at http://www.apache.org/licenses/LICENSE-2.0 Unless required

by applicable law or agreed to in writing, software distributed under the

License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS

OF ANY KIND, either express or implied. See the License for the specific

language governing permissions and limitations under the License. -->

<!-- Description: This document contains Solr 3.1 schema definition to be

used with Solr integration currently build into Nutch. See https://issues.apache.org/jira/browse/NUTCH-442

https://issues.apache.org/jira/browse/NUTCH-699 https://issues.apache.org/jira/browse/NUTCH-994

https://issues.apache.org/jira/browse/NUTCH-997 and http://svn.apache.org/viewvc/lucene/dev/branches/branch_3x/solr/

example/solr/conf/schema.xml?view=markup for more info. -->

<types>

<fieldType name="string" class="solr.StrField"

sortMissingLast="true" omitNorms="true" />

<fieldType name="long" class="solr.TrieLongField"

precisionStep="0" omitNorms="true" positionIncrementGap="0" />

<fieldType name="float" class="solr.TrieFloatField"

precisionStep="0" omitNorms="true" positionIncrementGap="0" />

<fieldType name="date" class="solr.TrieDateField"

precisionStep="0" omitNorms="true" positionIncrementGap="0" />

<fieldType name="cache_text" class="solr.TextField"

positionIncrementGap="100">

</fieldType>

<fieldType name="text" class="solr.TextField"

positionIncrementGap="100">

<tokenizer class="com.chenlb.mmseg4j.solr.MMSegTokenizerFactory"

mode="complex" dicPath="dic" />

</analyzer>

<tokenizer class="com.chenlb.mmseg4j.solr.MMSegTokenizerFactory"

mode="complex" dicPath="dic" />

</analyzer>

</fieldType>

<fieldType name="url" class="solr.TextField"

positionIncrementGap="100">

<filter class="solr.WordDelimiterFilterFactory"

generateWordParts="1" generateNumberParts="1" />

</analyzer>

</fieldType>

</types>

<field name="url" type="url" stored="true" indexed="true"

required="true" />

<field name="cache_content" type="cache_text" stored="true"

indexed="false" />

<field name="anchor" type="string" stored="true" indexed="true"

multiValued="true" />

<field name="type" type="string" stored="true" indexed="true"

multiValued="true" />

<field name="subcollection" type="string" stored="true" indexed="true"

multiValued="true" />

<field name="tag" type="string" stored="true" indexed="true"

multiValued="true" />

<field name="cc" type="string" stored="true" indexed="true"

multiValued="true" />

</fields>

<defaultSearchField>content</defaultSearchField>

</schema>

本文出自 “果壳中的宇宙” 博客，请务必保留此出处http://williamx.blog.51cto.com/3629295/773815

转自：http://williamx.blog.51cto.com/3629295/773815

知识点

相关文章

最近更新

Nutch集成Solr中文分词Schema

相关问答

如何解决：请教nutch和solr集成问题[2023-12-14]

如何解决：请教nutch和solr集成问题[2021-11-19]

solr用qieqie庖丁加入中文分词问题[2021-12-08]

lucene 中文分词？[2022-07-24]

Nutch与Solr(Nutch versus Solr)[2022-06-20]

Nutch 1.2 Solr 3.6集成问题(Nutch 1.2 Solr 3.6 integration issue)[2022-08-14]

Apache Nutch 1.12和Solr 5.4.1的集成失败(Integration of Apache Nutch 1.12 and Solr 5.4.1 failed)[2024-01-20]

nutch 1.2 solr 3.1集成问题(nutch 1.2 solr 3.1 integration issue)[2023-02-21]

我应该使用cygwin进行nutch和solr集成吗？(Should i use cygwin for nutch and solr integration?)[2023-01-10]

Nutch v Solr v Nutch + Solr(Nutch v Solr v Nutch+Solr)[2022-04-21]