Package de.jplag.text
Class Language
java.lang.Object
de.jplag.text.Language
- All Implemented Interfaces:
Language
Language class for parsing (natural language) text. This language module employs a primitive approach where
individual words are interpreted as token types. Whitespace and special characters are ignored. This approach works,
but there are better approaches for text plagiarism out there (based on NLP techniques).
-
Constructor Details
-
Language
public Language()
-
-
Method Details
-
suffixes
-
getName
-
getIdentifier
- Specified by:
getIdentifierin interfaceLanguage
-
minimumTokenMatch
public int minimumTokenMatch()- Specified by:
minimumTokenMatchin interfaceLanguage
-
parse
- Specified by:
parsein interfaceLanguage- Throws:
ParsingException
-