How to detect the document language - in PHP?

The basics have already been answered here . But is there a built-in PHP library similar to Lingua :: Identify from CPAN?

+1


source to share


2 answers


This is where the PEAR package Text_LanguageDetect

I used earlier is. Get the job done well enough. I'm not sure about any other libs that are more mature.



+3


source


1- You could do it yourself (the hard way) - detect both language and codepage by looking at character and n-gram frequencies. You will need a lot of "training" data, but it's doable.



2- You can run a perl script to do detection (much easier).

0


source







All Articles