Class USAddressParser


  • public class USAddressParser
    extends java.lang.Object
    The parser class for usaddress. This will load a CRF model from resource folder.
    • Field Summary

      Fields 
      Modifier and Type Field Description
      static java.util.Set<java.lang.String> DIRECTIONS  
      static java.lang.String GROUP_LABEL  
      static java.lang.String[] LABLES  
      static java.lang.String MODEL_FILE  
      static java.lang.String PARENT_LABEL  
      static java.util.Set<java.lang.String> STREET_NAMES  
      static com.github.jcrfsuite.CrfTagger tagger  
    • Constructor Summary

      Constructors 
      Constructor Description
      USAddressParser()  
    • Method Summary

      All Methods Static Methods Concrete Methods 
      Modifier and Type Method Description
      static java.lang.String digits​(java.lang.String token)  
      static java.util.List<com.github.jcrfsuite.util.Pair<java.lang.String,​java.lang.String>> parse​(java.lang.String addressString)
      Parse an address string and return words with tags.
      static java.util.Set<java.lang.Character> stringIntersect​(java.lang.String str1, java.lang.String str2)  
      static DictFeature tokenFeatures​(java.lang.String token)  
      static java.util.List<java.lang.String> tokenize​(java.lang.String addressString)  
      static java.util.List<DictFeature> tokens2features​(java.util.List<java.lang.String> tokens)  
      static java.lang.String trailingZeros​(java.lang.String token)  
      • Methods inherited from class java.lang.Object

        clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
    • Field Detail

      • LABLES

        public static final java.lang.String[] LABLES
      • DIRECTIONS

        public static final java.util.Set<java.lang.String> DIRECTIONS
      • STREET_NAMES

        public static final java.util.Set<java.lang.String> STREET_NAMES
      • tagger

        public static com.github.jcrfsuite.CrfTagger tagger
    • Constructor Detail

      • USAddressParser

        public USAddressParser()
    • Method Detail

      • parse

        public static java.util.List<com.github.jcrfsuite.util.Pair<java.lang.String,​java.lang.String>> parse​(java.lang.String addressString)
        Parse an address string and return words with tags.
        Parameters:
        addressString - the input address string
        Returns:
        list of words with tags
      • tokenize

        public static java.util.List<java.lang.String> tokenize​(java.lang.String addressString)
      • tokens2features

        public static java.util.List<DictFeature> tokens2features​(java.util.List<java.lang.String> tokens)
      • tokenFeatures

        public static DictFeature tokenFeatures​(java.lang.String token)
      • trailingZeros

        public static java.lang.String trailingZeros​(java.lang.String token)
      • digits

        public static java.lang.String digits​(java.lang.String token)
      • stringIntersect

        public static java.util.Set<java.lang.Character> stringIntersect​(java.lang.String str1,
                                                                         java.lang.String str2)