public class S3Emitter
extends org.apache.tika.pipes.emitter.AbstractEmitter
implements org.apache.tika.config.Initializable, org.apache.tika.pipes.emitter.StreamEmitter
<properties>
<emitters>
<emitter class="org.apache.tika.pipes.emitter.s3.S3Emitter>
<params>
<!-- required -->
<param name="name" type="string">s3e</param>
<!-- required -->
<param name="region" type="string">us-east-1</param>
<!-- required -->
<param name="credentialsProvider"
type="string">(profile|instance)</param>
<!-- required if credentialsProvider=profile-->
<param name="profile" type="string">my-profile</param>
<!-- required -->
<param name="bucket" type="string">my-bucket</param>
<!-- optional; prefix to add to the path before emitting;
default is no prefix -->
<param name="prefix" type="string">my-prefix</param>
<!-- optional; default is 'json' this will be added to the SOURCE_PATH
if no emitter key is specified. Do not add a "."
before the extension -->
<param name="fileExtension" type="string">json</param>
<!-- optional; default is 'true'-- whether to copy the
json to a local file before putting to s3 -->
<param name="spoolToTemp" type="bool">true</param>
</params>
</emitter>
</emitters>
</properties>| Constructor and Description |
|---|
S3Emitter() |
| Modifier and Type | Method and Description |
|---|---|
void |
checkInitialization(org.apache.tika.config.InitializableProblemHandler problemHandler) |
void |
emit(String path,
InputStream is,
org.apache.tika.metadata.Metadata userMetadata) |
void |
emit(String emitKey,
List<org.apache.tika.metadata.Metadata> metadataList)
Requires the src-bucket/path/to/my/file.txt in the
TikaCoreProperties.SOURCE_PATH. |
void |
initialize(Map<String,org.apache.tika.config.Param> params)
This initializes the s3 client.
|
void |
setAccessKey(String accessKey) |
void |
setBucket(String bucket) |
void |
setCredentialsProvider(String credentialsProvider) |
void |
setEndpointConfigurationService(String endpointConfigurationService) |
void |
setFileExtension(String fileExtension)
If you want to customize the output file's file extension.
|
void |
setMaxConnections(int maxConnections)
maximum number of http connections allowed.
|
void |
setPathStyleAccessEnabled(boolean pathStyleAccessEnabled) |
void |
setPrefix(String prefix) |
void |
setProfile(String profile) |
void |
setRegion(String region) |
void |
setSecretKey(String secretKey) |
void |
setSpoolToTemp(boolean spoolToTemp)
Whether or not to spool the metadatalist to a tmp file before putting object.
|
public void emit(String emitKey, List<org.apache.tika.metadata.Metadata> metadataList) throws IOException, org.apache.tika.pipes.emitter.TikaEmitterException
TikaCoreProperties.SOURCE_PATH.emit in interface org.apache.tika.pipes.emitter.EmittermetadataList - IOExceptionorg.apache.tika.exception.TikaExceptionorg.apache.tika.pipes.emitter.TikaEmitterExceptionpublic void emit(String path, InputStream is, org.apache.tika.metadata.Metadata userMetadata) throws IOException, org.apache.tika.pipes.emitter.TikaEmitterException
emit in interface org.apache.tika.pipes.emitter.StreamEmitterpath - -- object path, not including the bucketis - inputStream to copyuserMetadata - this will be written to the s3 ObjectMetadata's userMetadataorg.apache.tika.pipes.emitter.TikaEmitterException - or IOexception if there is a Runtime s3 client exceptionIOException@Field public void setSpoolToTemp(boolean spoolToTemp)
true. If this is set to false,
this emitter writes the json object to memory and then puts that into s3.spoolToTemp - @Field public void setRegion(String region)
@Field public void setProfile(String profile)
@Field public void setBucket(String bucket)
@Field public void setPrefix(String prefix)
@Field public void setCredentialsProvider(String credentialsProvider)
@Field public void setFileExtension(String fileExtension)
fileExtension - @Field public void setAccessKey(String accessKey)
@Field public void setSecretKey(String secretKey)
@Field public void setMaxConnections(int maxConnections)
maxConnections - @Field public void setEndpointConfigurationService(String endpointConfigurationService)
public void initialize(Map<String,org.apache.tika.config.Param> params) throws org.apache.tika.exception.TikaConfigException
initialize in interface org.apache.tika.config.Initializableparams - params to use for initializationorg.apache.tika.exception.TikaConfigExceptionpublic void checkInitialization(org.apache.tika.config.InitializableProblemHandler problemHandler)
throws org.apache.tika.exception.TikaConfigException
checkInitialization in interface org.apache.tika.config.Initializableorg.apache.tika.exception.TikaConfigException@Field public void setPathStyleAccessEnabled(boolean pathStyleAccessEnabled)
Copyright © 2007–2025 The Apache Software Foundation. All rights reserved.