Skip to content

Azure Blob

Azure Blob (azure-blob)

Send data to a Microsoft Azure Storage blob (Block Storage).

Block Store Cloud binary json raw

Minimal example

output:
azure-blob:
blob-destination:
name: ~
container-name: ~
storage-account: ~
storage-master-key: ~
JSON
{
"output": {
"azure-blob": {
"blob-destination": {
"name": null
},
"container-name": null,
"storage-account": null,
"storage-master-key": null
}
}
}

Contents

Authentication

Authentication
FieldTypeRequiredDescription
storage-accountstringThe Storage Account Name to be used (credential).
storage-master-keystringThe Storage Master Key to be used (credential).

Behavior

Behavior
FieldTypeRequiredDescription
modeModeEither writing or deleting a blob.
Allowed values: put, delete

Location

Location
FieldTypeRequiredDescription
container-namestringThe storage service container for created blobs.
blob-destinationBlob DestinationEither a literal blob name or the name of the field to read it from.
Allowed values: name, field

Object Properties

Object Properties
FieldTypeRequiredDescription
disable-blob-name-guidboolean (bool)Do not append a GUID to the output blob name.
Default: false
guid-prefixstringGUID Prefix, will be prepended to the GUID, the default value is ”/”.
guid-suffixstringGUID Suffix, will be appended to the GUID if specified.
content-typestringSet the created blob to the following content-type.
Default: text/plain

Processing

Processing
FieldTypeRequiredDescription
batchBatchBatching output events.
input-fieldfield (string)Send only the content of the specified field to the blob.
Examples: data_field
preprocessorsOutput Preprocessor[]Preprocessors (process data before making it available for upload) these processors will be run in the order they are specified.
Allowed values: gzip, parquet, base64

Reliability

Reliability
FieldTypeRequiredDescription
retryRetryHow to retry operation if it fails.
track-schemaboolean (bool)Check the schema of the written data and update __SCHEMA_NUMBER (written data must be JSON).
Default: false

Schema

Blob Destination Options

OptionNameTypeDescription
nameNamestringThe name for the output blob, this will be suffixed by a GUID by default.
fieldFieldstringA field containing the value for output_blob.

Batch Fields

FieldTypeRequiredDescription
fixed-sizenumber (integer)maximum number of events in an output batch.
Examples: 42, 1.2e-10
modeModeIf ‘document’ send on end of document generated by input. If ‘fixed’, use fixed_size.
Allowed values: fixed, document
timeouttime-interval (string)interval after which the batch is sent, to keep throughput going (default 100ms).
Default: 100ms
Examples: 500ms, 2h
headermultiline-text (string)put a header line before the batch.
footermultiline-text (string)put a header line after the last line of the batch.
use-document-markerboolean (bool)Enrich the job metadata with a document marker (for document handling in batch mode).
Default: false
wrap-as-jsonboolean (bool)Format the output batch as a JSON array.
Default: false

Retry Fields

FieldTypeRequiredDescription
timeouttime-interval (string)timeout (e.g. 500ms, 2s etc. - default is 30).
Examples: 500ms, 2h
retriesnumber (integer)number of retries.
Examples: 42, 1.2e-10

Mode Options

ValueNameDescription
putputPut Objects
deletedeleteDelete Objects

Batch - Mode Options

ValueNameDescription
fixedfixedFixed
documentdocumentDocument

Output Preprocessor Options

ValueNameDescription
gzipgzipGzip the output data
parquetparquetExtract the received data as JSON rows from a parquet file
base64base64Decode base64 as binary