- All Implemented Interfaces:
- Tokenizer
public class RomanianWordTokenizer
extends Object
implements Tokenizer
Tokenizes a sentence into words. Punctuation and whitespace gets its own
token. Like EnglishWordTokenizer except for some characters: eg: "-'
- Since:
- 20.02.2009 19:53:50
- Author:
- Ionuț Păduraru