Docx to Text
Docx to Text (docx-to-text)
Extract content from Microsoft Word documents with optional image emission.
Transform binary json
Minimal example
actions: - docx-to-text: {}JSON
{ "actions": [ { "docx-to-text": {} } ]}Contents
Advanced
Advanced
| Field | Type | Required | Description |
|---|---|---|---|
include-images ✓ | boolean (bool) | Emit embedded images as separate binary events alongside extracted text. Default: false | |
preserve-styles ✓ | boolean (bool) | Preserve inline style markers (bold/italic/etc.) in markdown output. Default: false | |
emit-document-events ✓ | boolean (bool) | Emit a synthetic document-level event with metadata alongside page/paragraph events. Default: false |
General
General
| Field | Type | Required | Description |
|---|---|---|---|
description | string | Short summary shown next to the action in the editor. | |
condition | lua-expression (string) | Conditional expression that must evaluate truthy for the action to run. Examples: 2 * count() |
Output
Output
| Field | Type | Required | Description |
|---|---|---|---|
mode | string | Output mode: markdown | |
split | string | Splitting strategy: none |
Parser
Parser
| Field | Type | Required | Description |
|---|---|---|---|
parser | string | Parser backend: ooxml (quick-xml). |