Docx to Text

Docx to Text (`docx-to-text`)

Extract content from Microsoft Word documents with optional image emission.

Transform binary json

actions:
  - docx-to-text: {}

JSON

{
  "actions": [
    {
      "docx-to-text": {}
    }
  ]
}

Field	Type	Description
`include-images` ✓	`boolean` (`bool`)	Emit embedded images as separate binary events alongside extracted text. Default: `false`
`preserve-styles` ✓	`boolean` (`bool`)	Preserve inline style markers (bold/italic/etc.) in markdown output. Default: `false`
`emit-document-events` ✓	`boolean` (`bool`)	Emit a synthetic document-level event with metadata alongside page/paragraph events. Default: `false`

Field	Type	Required	Description
`description`	`string`		Short summary shown next to the action in the editor.
`condition`	`lua-expression` (`string`)		Conditional expression that must evaluate truthy for the action to run. Examples: `2 * count()`

Field	Type	Required	Description
`mode`	`string`		Output mode: markdown
`split`	`string`		Splitting strategy: none

Field	Type	Required	Description
`parser`	`string`		Parser backend: ooxml (quick-xml).