Zend_Lucene CJK support

Does anyone know if the Zend_Lucene class supports CJK (Chinese Japanese Korean).

I want to use it on my own website, the only problem it has to work with is both English and Japanese.

Also if anyone has some ressource about the Java version of the Java version this would be appreciated too.

thank

+2


source to share


1 answer


These are currently the only UTF-8 compliant parsers built into Zend_Lucene

  • Zend_Search_Lucene_Analysis_Analyzer_Common_Utf8
  • Zend_Search_Lucene_Analysis_Analyzer_Common_Utf8Num
  • Zend_Search_Lucene_Analysis_Analyzer_Common_Utf8_CaseInsensitive
  • Zend_Search_Lucene_Analysis_Analyzer_Common_Utf8Num_CaseInsensitive

You can use them using the following code:

Zend_Search_Lucene_Analysis_Analyzer::setDefault(
new Zend_Search_Lucene_Analysis_Analyzer_Common_Text());

      



You can also create your own analyzer if you like.

An alternative solution would be to create an index using Java Lucene and use that index in PHP as they need to be compatible. I haven't tried this though.

Zend_Search_Lucene was obtained from the Apache Lucene project. Currently (since ZF 1.6) Lucene Index Format Versions 1.4 - 2.3 are supported

You can read more about this in the Zend Framework manual. link text

+2


source







All Articles