Package de.jplag.text
Class NaturalLanguage
java.lang.Object
de.jplag.text.NaturalLanguage
- All Implemented Interfaces:
Language
@MetaInfServices(de.jplag.Language.class)
public class NaturalLanguage
extends Object
implements Language
Language class for parsing (natural language) text. This language module employs a primitive approach where
individual words are interpreted as token types. Whitespace and special characters are ignored. This approach works,
but there are better approaches for text plagiarism out there (based on NLP techniques).
-
Constructor Summary
Constructors -
Method Summary
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitMethods inherited from interface de.jplag.Language
customizeSubmissionOrder, expectsSubmissionOrder, getOptions, isPreformatted, parse, requiresCoreNormalization, supportsNormalization, tokensHaveSemantics, useViewFiles, viewFileSuffix
-
Constructor Details
-
NaturalLanguage
public NaturalLanguage()
-
-
Method Details
-
suffixes
-
getName
-
getIdentifier
- Specified by:
getIdentifierin interfaceLanguage
-
minimumTokenMatch
public int minimumTokenMatch()- Specified by:
minimumTokenMatchin interfaceLanguage
-
parse
- Specified by:
parsein interfaceLanguage- Throws:
ParsingException
-