PythonでWord文書のテキスト内容を読み取る方法は何ですか？

2年 ago

陽, 向宇

1 minute

Pythonでは、python-docxライブラリを使用してWord文書からテキストコンテンツを読み取ることができます。以下はサンプルコードです：

from docx import Document

# 打开Word文档
doc = Document('example.docx')

# 遍历文档的段落并输出文本内容
for para in doc.paragraphs:
    print(para.text)

# 遍历文档的表格并输出单元格内容
for table in doc.tables:
    for row in table.rows:
        for cell in row.cells:
            print(cell.text)

この例では、まずDocumentクラスをインポートします。その後、Word文書のファイルパスを渡してDocumentオブジェクトを作成します。次に、paragraphs属性を使用してドキュメントの段落を走査し、各段落のテキストコンテンツを出力できます。同様に、tables属性を使用してドキュメントの表を走査し、各セルのコンテンツを出力できます。

#Python #プログラミング