public class FileIterator extends AbstractPipeInputIterator
In document classification it is common that the file name in the data field will be subsequently processed by one or more pipes until it contains a feature vector. The pattern applied to the file name is often used to extract a directory name that will be used as the true label of the instance; this label is kept in the target field.
| Modifier and Type | Field and Description |
|---|---|
static Pattern |
ALL_DIRECTORIES
Use as label names all the directory names in the filename.
|
static Pattern |
FIRST_DIRECTORY
Use as label names the first directory in the filename.
|
static Pattern |
LAST_DIRECTORY
Use as label name the last directory in the filename.
|
static Pattern |
STARTING_DIRECTORIES
Use as label names the directories specified in the constructor,
optionally removing common prefix of all starting directories
|
parentInstance| Modifier | Constructor and Description |
|---|---|
|
FileIterator(File directory) |
|
FileIterator(File[] directories,
FileFilter fileFilter,
Pattern targetPattern) |
protected |
FileIterator(File[] directories,
FileFilter fileFilter,
Pattern targetPattern,
boolean removeCommonPrefix)
Construct a FileIterator that will supply filenames within initial directories
as instances
|
|
FileIterator(File[] directories,
Pattern targetPattern)
Iterate over Files that pass the fileFilter test, setting...
|
|
FileIterator(File[] directories,
Pattern targetPattern,
boolean removeCommonPrefix) |
|
FileIterator(File directory,
FileFilter fileFilter) |
|
FileIterator(File directory,
FileFilter fileFilter,
Pattern targetPattern) |
|
FileIterator(File directory,
FileFilter fileFilter,
Pattern targetPattern,
boolean removeCommonPrefix) |
|
FileIterator(File directory,
Pattern targetPattern) |
|
FileIterator(File directory,
Pattern targetPattern,
boolean removeCommonPrefix) |
|
FileIterator(String directory) |
|
FileIterator(String[] directories,
FileFilter ff) |
|
FileIterator(String[] directories,
Pattern targetPattern) |
|
FileIterator(String[] directories,
Pattern targetPattern,
boolean removeCommonPrefix) |
|
FileIterator(String[] directories,
String targetPattern) |
|
FileIterator(String directory,
FileFilter filter) |
|
FileIterator(String directory,
Pattern targetPattern) |
|
FileIterator(String directory,
Pattern targetPattern,
boolean removeCommonPrefix) |
| Modifier and Type | Method and Description |
|---|---|
ArrayList |
getFileArray() |
boolean |
hasNext() |
File |
nextFile() |
Instance |
nextInstance() |
static File[] |
stringArray2FileArray(String[] sa) |
next, remove, setParentInstanceclone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitforEachRemainingpublic static final Pattern STARTING_DIRECTORIES
public static final Pattern FIRST_DIRECTORY
public static final Pattern LAST_DIRECTORY
public static final Pattern ALL_DIRECTORIES
protected FileIterator(File[] directories, FileFilter fileFilter, Pattern targetPattern, boolean removeCommonPrefix)
directories - Array of directories to collect files fromfileFilter - class implementing interface FileFilter that will decide which names to accept.
May be null.targetPattern - regex Pattern applied to the filename whose first parenthesized group
on matching is taken to be the target value of the generated instance. The pattern is applied to
the directory with the matcher.find() method. If null, then all instances
will have target null.removeCommonPrefix - boolean that modifies the behavior of the STARTING_DIRECTORIES pattern,
removing the common prefix of all initially specified directories,
leaving the remainder of each filename as the target value.public FileIterator(File[] directories, FileFilter fileFilter, Pattern targetPattern)
public FileIterator(File[] directories, Pattern targetPattern)
public FileIterator(File[] directories, Pattern targetPattern, boolean removeCommonPrefix)
public FileIterator(String[] directories, FileFilter ff)
public FileIterator(String[] directories, Pattern targetPattern, boolean removeCommonPrefix)
public FileIterator(File directory, FileFilter fileFilter, Pattern targetPattern)
public FileIterator(File directory, FileFilter fileFilter, Pattern targetPattern, boolean removeCommonPrefix)
public FileIterator(File directory, FileFilter fileFilter)
public FileIterator(String directory, Pattern targetPattern, boolean removeCommonPrefix)
public FileIterator(File directory)
public FileIterator(String directory)
public FileIterator(String directory, FileFilter filter)
public ArrayList getFileArray()
public Instance nextInstance()
nextInstance in interface PipeInputIteratornextInstance in class AbstractPipeInputIteratorpublic File nextFile()
public boolean hasNext()
hasNext in interface IteratorhasNext in class AbstractPipeInputIteratorCopyright © 2019 JULIE Lab, Germany. All rights reserved.