Class AbnerTokenizer


  • public class AbnerTokenizer
    extends Object
    Author:
    shashank
    • Constructor Detail

      • AbnerTokenizer

        public AbnerTokenizer()
    • Method Detail

      • tokenize

        public static String tokenize​(String s)

        Take raw text apply ABNER's built-in tokenization on it.

      • splitTermsByPunctuation

        public static String splitTermsByPunctuation​(String input)
        Takes an input and splits the sentence by punctuations and spaces, then stitches it back together with a space and returns
        Parameters:
        input - the input string to process
        Returns:
        processed input string, where all words and punctuations are seperated by space