{"id":19676,"date":"2024-03-15T19:18:05","date_gmt":"2024-03-15T19:18:05","guid":{"rendered":"https:\/\/www.silicloud.com\/blog\/how-to-use-python3-to-convert-multiple-docx-documents-to-txt-files\/"},"modified":"2024-03-21T16:16:14","modified_gmt":"2024-03-21T16:16:14","slug":"how-to-use-python3-to-convert-multiple-docx-documents-to-txt-files","status":"publish","type":"post","link":"https:\/\/www.silicloud.com\/blog\/how-to-use-python3-to-convert-multiple-docx-documents-to-txt-files\/","title":{"rendered":"How to use Python3 to convert multiple DOCX documents to TXT files"},"content":{"rendered":"<p>To batch convert DOCX documents to TXT using Python3, you can utilize the python-docx library. Below is a simple sample code:<\/p>\n<pre class=\"post-pre\"><code><span class=\"hljs-keyword\">from<\/span> docx <span class=\"hljs-keyword\">import<\/span> Document\r\n\r\n<span class=\"hljs-keyword\">def<\/span> <span class=\"hljs-title function_\">convert_docx_to_txt<\/span>(<span class=\"hljs-params\">docx_file, txt_file<\/span>):\r\n    doc = Document(docx_file)\r\n    <span class=\"hljs-keyword\">with<\/span> <span class=\"hljs-built_in\">open<\/span>(txt_file, <span class=\"hljs-string\">'w'<\/span>, encoding=<span class=\"hljs-string\">'utf-8'<\/span>) <span class=\"hljs-keyword\">as<\/span> f:\r\n        <span class=\"hljs-keyword\">for<\/span> paragraph <span class=\"hljs-keyword\">in<\/span> doc.paragraphs:\r\n            f.write(paragraph.text + <span class=\"hljs-string\">'\\n'<\/span>)\r\n\r\n<span class=\"hljs-comment\"># \u6279\u91cf\u8f6c\u6362<\/span>\r\ndocx_files = [<span class=\"hljs-string\">'file1.docx'<\/span>, <span class=\"hljs-string\">'file2.docx'<\/span>, <span class=\"hljs-string\">'file3.docx'<\/span>]\r\n<span class=\"hljs-keyword\">for<\/span> docx_file <span class=\"hljs-keyword\">in<\/span> docx_files:\r\n    <span class=\"hljs-comment\"># \u6784\u5efa\u8f93\u51fa\u6587\u4ef6\u540d<\/span>\r\n    txt_file = docx_file.replace(<span class=\"hljs-string\">'.docx'<\/span>, <span class=\"hljs-string\">'.txt'<\/span>)\r\n    convert_docx_to_txt(docx_file, txt_file)\r\n<\/code><\/pre>\n<p>The function convert_docx_to_txt in the above code takes a DOCX file path and a TXT file path as input, and writes the content of the DOCX document line by line into the TXT file. Then, the names of the DOCX files that need to be converted are stored in the docx_files list. Iterating through each file in the list, the convert_docx_to_txt function is called to perform the conversion.<\/p>\n<p>Please note that the code utilizes the python-docx library, so you will need to install it beforehand. You can install it using the following command:<\/p>\n<pre class=\"post-pre\"><code>pip install python-docx\r\n<\/code><\/pre>\n<p>Please make sure you have installed Python 3 and pip, and have placed the DOCX file in the same directory as the code file.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>To batch convert DOCX documents to TXT using Python3, you can utilize the python-docx library. Below is a simple sample code: from docx import Document def convert_docx_to_txt(docx_file, txt_file): doc = Document(docx_file) with open(txt_file, &#8216;w&#8217;, encoding=&#8217;utf-8&#8242;) as f: for paragraph in doc.paragraphs: f.write(paragraph.text + &#8216;\\n&#8217;) # \u6279\u91cf\u8f6c\u6362 docx_files = [&#8216;file1.docx&#8217;, &#8216;file2.docx&#8217;, &#8216;file3.docx&#8217;] for docx_file in docx_files: [&hellip;]<\/p>\n","protected":false},"author":8,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_import_markdown_pro_load_document_selector":0,"_import_markdown_pro_submit_text_textarea":"","footnotes":""},"categories":[1],"tags":[],"class_list":["post-19676","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v21.5 (Yoast SEO v21.5) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>How to use Python3 to convert multiple DOCX documents to TXT files - Blog - Silicon Cloud<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.silicloud.com\/blog\/how-to-use-python3-to-convert-multiple-docx-documents-to-txt-files\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"How to use Python3 to convert multiple DOCX documents to TXT files\" \/>\n<meta property=\"og:description\" content=\"To batch convert DOCX documents to TXT using Python3, you can utilize the python-docx library. Below is a simple sample code: from docx import Document def convert_docx_to_txt(docx_file, txt_file): doc = Document(docx_file) with open(txt_file, &#039;w&#039;, encoding=&#039;utf-8&#039;) as f: for paragraph in doc.paragraphs: f.write(paragraph.text + &#039;n&#039;) # \u6279\u91cf\u8f6c\u6362 docx_files = [&#039;file1.docx&#039;, &#039;file2.docx&#039;, &#039;file3.docx&#039;] for docx_file in docx_files: [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.silicloud.com\/blog\/how-to-use-python3-to-convert-multiple-docx-documents-to-txt-files\/\" \/>\n<meta property=\"og:site_name\" content=\"Blog - Silicon Cloud\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/SiliCloudGlobal\/\" \/>\n<meta property=\"article:published_time\" content=\"2024-03-15T19:18:05+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-03-21T16:16:14+00:00\" \/>\n<meta name=\"author\" content=\"William Carter\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@SiliCloudGlobal\" \/>\n<meta name=\"twitter:site\" content=\"@SiliCloudGlobal\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"William Carter\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"1 minute\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/how-to-use-python3-to-convert-multiple-docx-documents-to-txt-files\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.silicloud.com\/blog\/how-to-use-python3-to-convert-multiple-docx-documents-to-txt-files\/\"},\"author\":{\"name\":\"William Carter\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/#\/schema\/person\/f697031891aacefc4b681d139781d3c0\"},\"headline\":\"How to use Python3 to convert multiple DOCX documents to TXT files\",\"datePublished\":\"2024-03-15T19:18:05+00:00\",\"dateModified\":\"2024-03-21T16:16:14+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.silicloud.com\/blog\/how-to-use-python3-to-convert-multiple-docx-documents-to-txt-files\/\"},\"wordCount\":156,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/www.silicloud.com\/blog\/#organization\"},\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/how-to-use-python3-to-convert-multiple-docx-documents-to-txt-files\/\",\"url\":\"https:\/\/www.silicloud.com\/blog\/how-to-use-python3-to-convert-multiple-docx-documents-to-txt-files\/\",\"name\":\"How to use Python3 to convert multiple DOCX documents to TXT files - Blog - Silicon Cloud\",\"isPartOf\":{\"@id\":\"https:\/\/www.silicloud.com\/blog\/#website\"},\"datePublished\":\"2024-03-15T19:18:05+00:00\",\"dateModified\":\"2024-03-21T16:16:14+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/www.silicloud.com\/blog\/how-to-use-python3-to-convert-multiple-docx-documents-to-txt-files\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.silicloud.com\/blog\/how-to-use-python3-to-convert-multiple-docx-documents-to-txt-files\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/how-to-use-python3-to-convert-multiple-docx-documents-to-txt-files\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.silicloud.com\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"How to use Python3 to convert multiple DOCX documents to TXT files\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/#website\",\"url\":\"https:\/\/www.silicloud.com\/blog\/\",\"name\":\"Silicon Cloud Blog\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\/\/www.silicloud.com\/blog\/#organization\"},\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/#organization\",\"name\":\"Silicon Cloud Blog\",\"url\":\"https:\/\/www.silicloud.com\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.silicloud.com\/blog\/wp-content\/uploads\/2023\/11\/EN-SILICON-Full.png\",\"contentUrl\":\"https:\/\/www.silicloud.com\/blog\/wp-content\/uploads\/2023\/11\/EN-SILICON-Full.png\",\"width\":1024,\"height\":1024,\"caption\":\"Silicon Cloud Blog\"},\"image\":{\"@id\":\"https:\/\/www.silicloud.com\/blog\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/SiliCloudGlobal\/\",\"https:\/\/twitter.com\/SiliCloudGlobal\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/#\/schema\/person\/f697031891aacefc4b681d139781d3c0\",\"name\":\"William Carter\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/1786698071dd8d74bec894b512f9e3c610c3a2a32985f67e688976cee3c8bbef?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/1786698071dd8d74bec894b512f9e3c610c3a2a32985f67e688976cee3c8bbef?s=96&d=mm&r=g\",\"caption\":\"William Carter\"},\"url\":\"https:\/\/www.silicloud.com\/blog\/author\/williamcarter\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"How to use Python3 to convert multiple DOCX documents to TXT files - Blog - Silicon Cloud","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.silicloud.com\/blog\/how-to-use-python3-to-convert-multiple-docx-documents-to-txt-files\/","og_locale":"en_US","og_type":"article","og_title":"How to use Python3 to convert multiple DOCX documents to TXT files","og_description":"To batch convert DOCX documents to TXT using Python3, you can utilize the python-docx library. Below is a simple sample code: from docx import Document def convert_docx_to_txt(docx_file, txt_file): doc = Document(docx_file) with open(txt_file, 'w', encoding='utf-8') as f: for paragraph in doc.paragraphs: f.write(paragraph.text + 'n') # \u6279\u91cf\u8f6c\u6362 docx_files = ['file1.docx', 'file2.docx', 'file3.docx'] for docx_file in docx_files: [&hellip;]","og_url":"https:\/\/www.silicloud.com\/blog\/how-to-use-python3-to-convert-multiple-docx-documents-to-txt-files\/","og_site_name":"Blog - Silicon Cloud","article_publisher":"https:\/\/www.facebook.com\/SiliCloudGlobal\/","article_published_time":"2024-03-15T19:18:05+00:00","article_modified_time":"2024-03-21T16:16:14+00:00","author":"William Carter","twitter_card":"summary_large_image","twitter_creator":"@SiliCloudGlobal","twitter_site":"@SiliCloudGlobal","twitter_misc":{"Written by":"William Carter","Est. reading time":"1 minute"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.silicloud.com\/blog\/how-to-use-python3-to-convert-multiple-docx-documents-to-txt-files\/#article","isPartOf":{"@id":"https:\/\/www.silicloud.com\/blog\/how-to-use-python3-to-convert-multiple-docx-documents-to-txt-files\/"},"author":{"name":"William Carter","@id":"https:\/\/www.silicloud.com\/blog\/#\/schema\/person\/f697031891aacefc4b681d139781d3c0"},"headline":"How to use Python3 to convert multiple DOCX documents to TXT files","datePublished":"2024-03-15T19:18:05+00:00","dateModified":"2024-03-21T16:16:14+00:00","mainEntityOfPage":{"@id":"https:\/\/www.silicloud.com\/blog\/how-to-use-python3-to-convert-multiple-docx-documents-to-txt-files\/"},"wordCount":156,"commentCount":0,"publisher":{"@id":"https:\/\/www.silicloud.com\/blog\/#organization"},"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/www.silicloud.com\/blog\/how-to-use-python3-to-convert-multiple-docx-documents-to-txt-files\/","url":"https:\/\/www.silicloud.com\/blog\/how-to-use-python3-to-convert-multiple-docx-documents-to-txt-files\/","name":"How to use Python3 to convert multiple DOCX documents to TXT files - Blog - Silicon Cloud","isPartOf":{"@id":"https:\/\/www.silicloud.com\/blog\/#website"},"datePublished":"2024-03-15T19:18:05+00:00","dateModified":"2024-03-21T16:16:14+00:00","breadcrumb":{"@id":"https:\/\/www.silicloud.com\/blog\/how-to-use-python3-to-convert-multiple-docx-documents-to-txt-files\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.silicloud.com\/blog\/how-to-use-python3-to-convert-multiple-docx-documents-to-txt-files\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.silicloud.com\/blog\/how-to-use-python3-to-convert-multiple-docx-documents-to-txt-files\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.silicloud.com\/blog\/"},{"@type":"ListItem","position":2,"name":"How to use Python3 to convert multiple DOCX documents to TXT files"}]},{"@type":"WebSite","@id":"https:\/\/www.silicloud.com\/blog\/#website","url":"https:\/\/www.silicloud.com\/blog\/","name":"Silicon Cloud Blog","description":"","publisher":{"@id":"https:\/\/www.silicloud.com\/blog\/#organization"},"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.silicloud.com\/blog\/#organization","name":"Silicon Cloud Blog","url":"https:\/\/www.silicloud.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.silicloud.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/www.silicloud.com\/blog\/wp-content\/uploads\/2023\/11\/EN-SILICON-Full.png","contentUrl":"https:\/\/www.silicloud.com\/blog\/wp-content\/uploads\/2023\/11\/EN-SILICON-Full.png","width":1024,"height":1024,"caption":"Silicon Cloud Blog"},"image":{"@id":"https:\/\/www.silicloud.com\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/SiliCloudGlobal\/","https:\/\/twitter.com\/SiliCloudGlobal"]},{"@type":"Person","@id":"https:\/\/www.silicloud.com\/blog\/#\/schema\/person\/f697031891aacefc4b681d139781d3c0","name":"William Carter","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.silicloud.com\/blog\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/1786698071dd8d74bec894b512f9e3c610c3a2a32985f67e688976cee3c8bbef?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/1786698071dd8d74bec894b512f9e3c610c3a2a32985f67e688976cee3c8bbef?s=96&d=mm&r=g","caption":"William Carter"},"url":"https:\/\/www.silicloud.com\/blog\/author\/williamcarter\/"}]}},"_links":{"self":[{"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/posts\/19676","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/users\/8"}],"replies":[{"embeddable":true,"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/comments?post=19676"}],"version-history":[{"count":1,"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/posts\/19676\/revisions"}],"predecessor-version":[{"id":53430,"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/posts\/19676\/revisions\/53430"}],"wp:attachment":[{"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/media?parent=19676"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/categories?post=19676"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/tags?post=19676"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}