You have a batch of DOCX files — contracts, reports, form letters — and you need the raw text out of them. Maybe you are feeding text to a search index, importing it into a database, or cleaning up content for a CMS. Microsoft Word can "Save As" plain text, but only one file at a time, and it defaults to ANSI encoding that drops every non-Latin character. Total Doc Converter exports DOCX to Unicode text (UTF-8 or UTF-16) in batch, preserving every glyph — Arabic, Chinese, Cyrillic, accented Latin, emoji — without manual re-encoding.
DOCX is a ZIP archive of XML files. It stores text together with fonts, styles, images, tables, headers, and footers. Formatting information accounts for most of the file size. When you only need the text — for indexing, data extraction, or migration — the DOCX wrapper is unnecessary overhead.
A Unicode text file contains only characters and line breaks. It opens in any editor on any operating system. UTF-8 is the standard encoding for web applications, databases, and APIs. UTF-16 is preferred by some legacy Windows tools. Both encodings support every writing system — no more garbled characters when a file is opened on a different locale.
Launch Total Doc Converter. The folder tree on the left shows your drives and directories. Navigate to the folder with your DOCX files. Tick individual files or check the folder to select everything inside it.
Click the TXT button in the format bar at the top. The settings wizard opens.
In the wizard, choose Unicode (UTF-8) or Unicode (UTF-16) as the encoding. Specify the destination folder. If you want all texts merged into one file, enable the Combine files option.
Press Start. The converter processes every selected file, strips formatting, and writes plain text with the chosen Unicode encoding. A log shows the result for each file.

Total Doc Converter includes a command-line interface for scripting and automation. A typical command:
DocConverter.exe "C:\Contracts\*.docx" "C:\Output\" -cTXT -oUTF8
Parameters: source path (wildcards supported), destination folder, -cTXT sets target format, -oUTF8 sets encoding. Save this in a .bat file and schedule it with Windows Task Scheduler to run nightly or on demand. Total Doc Converter X (server edition) adds ActiveX support for integration into web applications and document workflows without a GUI.
| Feature | Online Converter | Total Doc Converter |
|---|---|---|
| Batch conversion (100+ files) | No — most accept one file at a time | Yes — unlimited files and folders |
| Unicode encoding choice | Usually only UTF-8, no control | UTF-8 or UTF-16 with BOM |
| Combine output into one file | No | Yes |
| Table handling | Stripped or garbled | Tab-separated values |
| Command line / automation | No | Yes — CLI + .bat scripting |
| File size limit | Typically 10–50 MB | No limit |
| Privacy | Files uploaded to a third-party server | 100% offline — files never leave your PC |
| Multilingual accuracy | Varies — encoding errors common | Correct BOM, tested with CJK, Arabic, Cyrillic |
The converter writes a proper BOM header and uses the encoding you choose. Chinese, Japanese, Korean, Arabic, Hebrew, Cyrillic, and accented Latin characters survive the conversion without substitution or question marks.
Tables become tab-separated rows. Bullet lists become plain lines. Headers and footers are either included or stripped — your choice. The output is ready for import into a database, search engine, or text-processing pipeline.
The same tool handles DOC, RTF, ODT, DOCM, HTML, and TXT. If you receive documents in mixed formats, Total Doc Converter normalizes them all to Unicode text in one batch.
Total Doc Converter X is the server edition. It runs as a background process with no GUI, accepts commands via ActiveX or command line, and processes files around the clock. Ideal for document ingestion pipelines, helpdesk systems, or archival workflows.
Download the free 30-day trial — no email or credit card required. A personal license costs $49.90 and includes one year of free upgrades. Works on Windows 7/8/10/11.
Download Free Trial Buy License — $49.90
"We index product descriptions that arrive as DOCX files from hundreds of suppliers. Total Doc Converter extracts the text to UTF-8 in batch — 2,000 files in about three minutes. The output plugs straight into our Elasticsearch pipeline. Before this tool we had a Python script that choked on Asian characters."
Martin Lindqvist Search Engineer, E-Commerce Company
"Client declarations come in as DOCX in Spanish, Portuguese, and Haitian Creole. I convert them to Unicode text for our case management database. Every accent and special character survives. The combine option is handy — I merge all declarations for one case into a single text file for the attorney to review."
Rebecca Torres Paralegal, Immigration Law Firm
"Translators submit files in DOCX, DOC, and RTF. I normalize everything to UTF-8 text before feeding it to our CAT tool. Total Doc Converter handles all three formats in one batch. The command-line mode runs on our server every night via Task Scheduler. Japanese, Chinese, and Korean text comes through without issues."
Kenji Watanabe IT Administrator, Translation Agency
Download free trial and convert your files in minutes.
No credit card or email required.
© 2026. All rights reserved. CoolUtils File Converters