Package com.github.laohyx.usaddress
Class USAddressParser
- java.lang.Object
-
- com.github.laohyx.usaddress.USAddressParser
-
public class USAddressParser extends java.lang.ObjectThe parser class for usaddress. This will load a CRF model from resource folder.
-
-
Field Summary
Fields Modifier and Type Field Description static java.util.Set<java.lang.String>DIRECTIONSstatic java.lang.StringGROUP_LABELstatic java.lang.String[]LABLESstatic java.lang.StringMODEL_FILEstatic java.lang.StringPARENT_LABELstatic java.util.Set<java.lang.String>STREET_NAMESstatic com.github.jcrfsuite.CrfTaggertagger
-
Constructor Summary
Constructors Constructor Description USAddressParser()
-
Method Summary
All Methods Static Methods Concrete Methods Modifier and Type Method Description static java.lang.Stringdigits(java.lang.String token)static java.util.List<com.github.jcrfsuite.util.Pair<java.lang.String,java.lang.String>>parse(java.lang.String addressString)Parse an address string and return words with tags.static java.util.Set<java.lang.Character>stringIntersect(java.lang.String str1, java.lang.String str2)static DictFeaturetokenFeatures(java.lang.String token)static java.util.List<java.lang.String>tokenize(java.lang.String addressString)static java.util.List<DictFeature>tokens2features(java.util.List<java.lang.String> tokens)static java.lang.StringtrailingZeros(java.lang.String token)
-
-
-
Field Detail
-
LABLES
public static final java.lang.String[] LABLES
-
PARENT_LABEL
public static final java.lang.String PARENT_LABEL
- See Also:
- Constant Field Values
-
GROUP_LABEL
public static final java.lang.String GROUP_LABEL
- See Also:
- Constant Field Values
-
MODEL_FILE
public static final java.lang.String MODEL_FILE
- See Also:
- Constant Field Values
-
DIRECTIONS
public static final java.util.Set<java.lang.String> DIRECTIONS
-
STREET_NAMES
public static final java.util.Set<java.lang.String> STREET_NAMES
-
tagger
public static com.github.jcrfsuite.CrfTagger tagger
-
-
Method Detail
-
parse
public static java.util.List<com.github.jcrfsuite.util.Pair<java.lang.String,java.lang.String>> parse(java.lang.String addressString)
Parse an address string and return words with tags.- Parameters:
addressString- the input address string- Returns:
- list of words with tags
-
tokenize
public static java.util.List<java.lang.String> tokenize(java.lang.String addressString)
-
tokens2features
public static java.util.List<DictFeature> tokens2features(java.util.List<java.lang.String> tokens)
-
tokenFeatures
public static DictFeature tokenFeatures(java.lang.String token)
-
trailingZeros
public static java.lang.String trailingZeros(java.lang.String token)
-
digits
public static java.lang.String digits(java.lang.String token)
-
stringIntersect
public static java.util.Set<java.lang.Character> stringIntersect(java.lang.String str1, java.lang.String str2)
-
-