info.aduna.language.identification
Interface LanguageIdentifier

All Known Implementing Classes:
NGramLanguageIdentifier

public interface LanguageIdentifier

A LanguageIdentifier uses heuristics to determine the language of a text fragment. Typically, several tens or hundreds of characters are sufficient to determine the language of a text.


Method Summary
 String identify(String text)
          Identifies the language of the supplied text.
 

Method Detail

identify

String identify(String text)
Identifies the language of the supplied text.

Parameters:
text - The text whose language is to be determined.
Returns:
The determined language, encoded as a 2-letter ISO 639 code, or null when the language could not be determined.


Copyright © 1997-2008 Aduna. All Rights Reserved.