{"id":36867,"date":"2024-01-18T08:44:31","date_gmt":"2023-12-22T04:47:30","guid":{"rendered":"https:\/\/www.silicloud.com\/zh\/blog\/%e5%b0%9d%e8%af%95%e4%bd%bf%e7%94%a8apache-tika%e8%af%bb%e5%8f%96%e5%86%85%e5%ae%b9%e3%80%82\/"},"modified":"2024-05-04T18:54:01","modified_gmt":"2024-05-04T10:54:01","slug":"%e5%b0%9d%e8%af%95%e4%bd%bf%e7%94%a8apache-tika%e8%af%bb%e5%8f%96%e5%86%85%e5%ae%b9%e3%80%82","status":"publish","type":"post","link":"https:\/\/www.silicloud.com\/zh\/blog\/%e5%b0%9d%e8%af%95%e4%bd%bf%e7%94%a8apache-tika%e8%af%bb%e5%8f%96%e5%86%85%e5%ae%b9%e3%80%82\/","title":{"rendered":"\u5c1d\u8bd5\u4f7f\u7528Apache Tika\u8bfb\u53d6\u5185\u5bb9"},"content":{"rendered":"<h1>Apache Tika\u662f\u4ec0\u4e48<\/h1>\n<p>\u963f\u5e15\u5947\u63d0\u5361<\/p>\n<blockquote><p>Apache Tika\uff0c\u662f\u4e00\u500b\u4f7f\u7528Java\u958b\u767c\u7684\u6587\u4ef6\u5206\u6790\u548c\u5143\u6578\u64da\u63d0\u53d6\u5de5\u5177\u5305\u3002\u5b83\u652f\u6301\u591a\u7a2e\u6587\u4ef6\u683c\u5f0f\uff0c\u53ef\u4ee5\u5f9e\u76ee\u6a19\u6578\u64da\u4e2d\u63d0\u53d6\u5143\u6578\u64da\u3002Tika\u539f\u5148\u662fApache Lucene\u7684\u5b50\u9805\u76ee\uff0c\u4f46\u73fe\u5728\u88ab\u8996\u70baApache\u8edf\u4ef6\u57fa\u91d1\u6703\u65d7\u4e0b\u7684\u9805\u76ee\u3002<\/p><\/blockquote>\n<p>\u300cApache Tika 1.0\u300d\u73b0\u5df2\u53d1\u5e03\uff0c\u53ef\u4ee5\u4ecePDF\u548cOffice\u6587\u6863\u4e2d\u63d0\u53d6\u5143\u6570\u636e\u3002<\/p>\n<h2>\u6682\u65f6\u8bd5\u7528\u4e00\u4e0b<\/h2>\n<p>\u53ea\u662f\u6211\u60f3\u8bd5\u8bd5\u8fd9\u6b21\u3002<\/p>\n<p>\u4eceApache Tika\u5b98\u7f51\u4e0b\u8f7dtika-app-1.4.jar\u3002<\/p>\n<p>\u53ea\u8981\u6709 Java 5 \u6216\u66f4\u9ad8\u7248\u672c\uff0c\u5c31\u80fd\u9032\u884c\u9019\u500b\u64cd\u4f5c\u3002<\/p>\n<p>\u6211\u6253\u7b97\u53c2\u8003\u300aApache Tika \u5165\u95e8\u300b\u4e00\u6587\u6765\u8fd0\u884c\u4e00\u4e0b\u3002<\/p>\n<p>\u4f8b\u5982\uff0c\u4eceQiita\u7684\u4e3b\u9875\u4e2d\u63d0\u53d6\u6587\u672c\u3002<\/p>\n<pre class=\"post-pre\"><code>curl http:\/\/qiita.com | java <span class=\"nt\">-jar<\/span> tika-app-1.4.jar <span class=\"nt\">-t<\/span>\r\n<span class=\"c\"># \uff08\u51fa\u529b\u306f\u7701\u7565\uff09<\/span>\r\n<\/code><\/pre>\n<p>\u53ea\u6709\u53bb\u9664\u4e86HTML\u6807\u8bb0\u7684\u6587\u672c\u88ab\u63d0\u53d6\u51fa\u6765\u3002<\/p>\n<p>\u63a5\u4e0b\u6765\u5c1d\u8bd5\u63d0\u53d6\u5143\u6570\u636e\u3002<\/p>\n<pre class=\"post-pre\"><code>curl http:\/\/qiita.com |java <span class=\"nt\">-jar<\/span> tika-app-1.4.jar <span class=\"nt\">-m<\/span>\r\n  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current\r\n                                 Dload  Upload   Total   Spent    Left  Speed\r\n100 13354  100 13354    0     0   6295      0  0:00:02  0:00:02 <span class=\"nt\">--<\/span>:--:--  149k\r\nContent-Encoding: UTF-8\r\nContent-Type: text\/html<span class=\"p\">;<\/span> <span class=\"nv\">charset<\/span><span class=\"o\">=<\/span>UTF-8\r\ncsrf-param: authenticity_token\r\ncsrf-token: dSy4U4+9rRNQFC4caHMvMF7HACh52MIeIv2T6whBYD8<span class=\"o\">=<\/span>\r\ndc:title: Qiita <span class=\"o\">[<\/span>\u30ad\u30fc\u30bf] - \u30d7\u30ed\u30b0\u30e9\u30de\u306e\u6280\u8853\u60c5\u5831\u5171\u6709\u30b5\u30fc\u30d3\u30b9\r\ndescription: Qiita\u306f\u3001\u30d7\u30ed\u30b0\u30e9\u30de\u306e\u305f\u3081\u306e\u6280\u8853\u60c5\u5831\u5171\u6709\u30b5\u30fc\u30d3\u30b9\u3067\u3059\u3002\u30d7\u30ed\u30b0\u30e9\u30df\u30f3\u30b0\u306b\u95a2\u3059\u308bTips\u3001\u30ce\u30a6\u30cf\u30a6\u3001\u30e1\u30e2\u3092\u7c21\u5358\u306b\u8a18\u9332&amp;amp<span class=\"p\">;<\/span>\u516c\u958b\u3059\u308b\u3053\u3068\u304c\u3067\u304d\u307e\u3059\u3002\r\nfb:admins: 564524038\r\nog:description: Qiita\u306f\u3001\u30d7\u30ed\u30b0\u30e9\u30de\u306e\u305f\u3081\u306e\u6280\u8853\u60c5\u5831\u5171\u6709\u30b5\u30fc\u30d3\u30b9\u3067\u3059\u3002\u30d7\u30ed\u30b0\u30e9\u30df\u30f3\u30b0\u306b\u95a2\u3059\u308bTips\u3001\u30ce\u30a6\u30cf\u30a6\u3001\u30e1\u30e2\u3092\u7c21\u5358\u306b\u8a18\u9332&amp;amp<span class=\"p\">;<\/span>\u516c\u958b\u3059\u308b\u3053\u3068\u304c\u3067\u304d\u307e\u3059\u3002\r\nog:image: http:\/\/qiita.com\/\/assets\/qiita-fb-ced1f2e92fd6f8d912353b746a063723.png\r\nog:site_name: Qiita\r\nog:title: Qiita <span class=\"o\">[<\/span>\u30ad\u30fc\u30bf] - \u30d7\u30ed\u30b0\u30e9\u30de\u306e\u6280\u8853\u60c5\u5831\u5171\u6709\u30b5\u30fc\u30d3\u30b9\r\nog:type: website\r\nog:url: http:\/\/qiita.com\/\r\ntitle: Qiita <span class=\"o\">[<\/span>\u30ad\u30fc\u30bf] - \u30d7\u30ed\u30b0\u30e9\u30de\u306e\u6280\u8853\u60c5\u5831\u5171\u6709\u30b5\u30fc\u30d3\u30b9\r\ntwitter:card: summary\r\ntwitter:site: @Qiita\r\nviewport: <span class=\"nv\">width<\/span><span class=\"o\">=<\/span>device-width,height<span class=\"o\">=<\/span>device-height,initial-scale<span class=\"o\">=<\/span>1\r\n<\/code><\/pre>\n<p>\u6bd4\u5982\uff0cdc:title\u662fDublin Core\uff08\u90fd\u67cf\u6797\u6838\u5fc3\uff09\u7684\u4e00\u4e2a\u57fa\u672c\u5143\u7d20\u3002\u5176\u4ed6\u4f8b\u5982OGP\uff08\u5f00\u653e\u56fe\u8c31\u534f\u8bae\uff09\u7684\u5143\u6570\u636eog:\u4ee5\u53caTwitter\u7684ID\u4e5f\u53ef\u4ee5\u83b7\u53d6\u3002<\/p>\n<p>Tika\u4e0d\u4ec5\u652f\u6301HTML\uff0c\u8fd8\u652f\u6301\u5404\u79cd\u6587\u6863\u683c\u5f0f\u3002\u4f8b\u5982\uff0c\u6211\u4eec\u53ef\u4ee5\u5c1d\u8bd5\u5c06\u5176\u5e94\u7528\u4e8e\u5e73\u621024\u5e74\u7248\u539a\u751f\u52b4\u50cd\u767d\u66f8\u6982\u8981\u7248\u7684PDF\u6587\u6863\u3002\u56e0\u4e3a\u53cd\u590d\u8bbf\u95ee\u53ef\u80fd\u8ba9\u4eba\u611f\u5230\u4e0d\u4fbf\uff0c\u6240\u4ee5\u6211\u4eec\u5148\u4e0b\u8f7d\u8be5\u6587\u6863\uff0c\u7136\u540e\u5c1d\u8bd5\u4ee5JSON\u683c\u5f0f\u83b7\u53d6\u5143\u6570\u636e\u3002<\/p>\n<pre class=\"post-pre\"><code>wget http:\/\/www.mhlw.go.jp\/wp\/hakusyo\/kousei\/12-1\/dl\/gaiyou.pdf\r\njava <span class=\"nt\">-jar<\/span> tika-app-1.4.jar <span class=\"nt\">-j<\/span>  &lt; gaiyou.pdf\r\n<span class=\"o\">{<\/span> <span class=\"s2\">\"Author\"<\/span>:<span class=\"s2\">\"\u539a\u751f\u52b4\u50cd\u7701\u30cd\u30c3\u30c8\u30ef\u30fc\u30af\u30b7\u30b9\u30c6\u30e0\"<\/span>,\r\n<span class=\"s2\">\"Company\"<\/span>:<span class=\"s2\">\"\u539a\u751f\u52b4\u50cd\u7701\"<\/span>,\r\n<span class=\"s2\">\"Content-Type\"<\/span>:<span class=\"s2\">\"application\/pdf\"<\/span>,\r\n<span class=\"s2\">\"ContentTypeId\"<\/span>:<span class=\"s2\">\"0x0101002DA299AC048A4B8EA9C1D19079C1A322009BEBE826950D474BAD6B2F2400F1439F\"<\/span>,\r\n<span class=\"s2\">\"Creation-Date\"<\/span>:<span class=\"s2\">\"2012-11-01T05:13:10Z\"<\/span>,\r\n<span class=\"s2\">\"Last-Modified\"<\/span>:<span class=\"s2\">\"2012-11-01T05:13:44Z\"<\/span>,\r\n<span class=\"s2\">\"Last-Save-Date\"<\/span>:<span class=\"s2\">\"2012-11-01T05:13:44Z\"<\/span>,\r\n<span class=\"s2\">\"created\"<\/span>:<span class=\"s2\">\"Wed Oct 31 22:13:10 PDT 2012\"<\/span>,\r\n<span class=\"s2\">\"creator\"<\/span>:<span class=\"s2\">\"\u539a\u751f\u52b4\u50cd\u7701\u30cd\u30c3\u30c8\u30ef\u30fc\u30af\u30b7\u30b9\u30c6\u30e0\"<\/span>,\r\n<span class=\"s2\">\"date\"<\/span>:<span class=\"s2\">\"2012-11-01T05:13:44Z\"<\/span>,\r\n<span class=\"s2\">\"dc:creator\"<\/span>:<span class=\"s2\">\"\u539a\u751f\u52b4\u50cd\u7701\u30cd\u30c3\u30c8\u30ef\u30fc\u30af\u30b7\u30b9\u30c6\u30e0\"<\/span>,\r\n<span class=\"s2\">\"dc:title\"<\/span>:<span class=\"s2\">\"\u30b9\u30e9\u30a4\u30c9 1\"<\/span>,\r\n<span class=\"s2\">\"dcterms:created\"<\/span>:<span class=\"s2\">\"2012-11-01T05:13:10Z\"<\/span>,\r\n<span class=\"s2\">\"dcterms:modified\"<\/span>:<span class=\"s2\">\"2012-11-01T05:13:44Z\"<\/span>,\r\n<span class=\"s2\">\"meta:author\"<\/span>:<span class=\"s2\">\"\u539a\u751f\u52b4\u50cd\u7701\u30cd\u30c3\u30c8\u30ef\u30fc\u30af\u30b7\u30b9\u30c6\u30e0\"<\/span>,\r\n<span class=\"s2\">\"meta:creation-date\"<\/span>:<span class=\"s2\">\"2012-11-01T05:13:10Z\"<\/span>,\r\n<span class=\"s2\">\"meta:save-date\"<\/span>:<span class=\"s2\">\"2012-11-01T05:13:44Z\"<\/span>,\r\n<span class=\"s2\">\"modified\"<\/span>:<span class=\"s2\">\"2012-11-01T05:13:44Z\"<\/span>,\r\n<span class=\"s2\">\"producer\"<\/span>:<span class=\"s2\">\"Adobe PDF Library 9.0\"<\/span>,\r\n<span class=\"s2\">\"title\"<\/span>:<span class=\"s2\">\"\u30b9\u30e9\u30a4\u30c9 1\"<\/span>,\r\n<span class=\"s2\">\"xmp:CreatorTool\"<\/span>:<span class=\"s2\">\"PowerPoint \u7528 Acrobat PDFMaker 9.1\"<\/span>,\r\n<span class=\"s2\">\"xmpTPg:NPages\"<\/span>:12 <span class=\"o\">}<\/span>\r\n<\/code><\/pre>\n<p>\u7136\u540e\uff0c\u542c\u8bf4\u53ef\u4ee5\u4f7f\u7528\u5927\u5199\u7684T\u9009\u9879\u6765\u63d0\u53d6\u4e3b\u8981\u7684\u6587\u672c\u3002<br \/>\n\u8bd5\u7740\u63d0\u53d6\u4e86 MacBook Air (13-inch, Mid 2013) &#8211; \u4ece\u5feb\u901f\u5165\u95e8\u4e2d\u3002<\/p>\n<pre class=\"post-pre\"><code>curl http:\/\/manuals.info.apple.com\/ja_JP\/macbook_air-13-inch-mid-2013_quick_start_jp.pdf | java <span class=\"nt\">-jar<\/span> tika-app-1.4.jar <span class=\"nt\">-T<\/span>\r\n  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current\r\n                                 Dload  Upload   Total   Spent    Left  Speed\r\n100 4679k  100 4679k    0     0  1757k      0  0:00:02  0:00:02 <span class=\"nt\">--<\/span>:--:-- 7232k\r\n\u306f\u3058\u3081\u306b \u304a\u8cb7\u3044\u6c42\u3081\u306eMacBook Air\u3092\u306f\u3058\u3081\u3066\u8d77\u52d5\u3059\u308b\u3068\u3001\u300c\u8a2d\u5b9a\u30a2\u30b7\u30b9\u30bf\u30f3\u30c8\u300d\u304cMac\u306e\u8a2d\u5b9a\u624b \u9806\u3092\u3054\u6848\u5185\u3057\u307e\u3059\u3002\u8868\u793a\u3055\u308c\u308b\u8aac\u660e\u306b\u5f93\u3063\u3066\u3001Wi-Fi\u30cd\u30c3\u30c8\u30ef\u30fc\u30af\u3078\u306e\u63a5\u7d9a\u3001\u307b\u304b\u306eMac\u307e \u305f\u306fWindows\u30b3\u30f3\u30d4\u30e5\u30fc\u30bf\u304b\u3089\u306e\u30c7\u30fc\u30bf\u306e\u8ee2\u9001\u3001Mac\u306e\u30e6\u30fc\u30b6\u30a2\u30ab\u30a6\u30f3\u30c8\u306e\u8a2d\u5b9a\u304c\u7c21\u5358\u306b \u3067\u304d\u307e\u3059\u3002\r\n\r\n<span class=\"c\"># \uff08\u4e2d\u7565\uff09<\/span>\r\n\r\n\u65b0\u3057\u3044\u30a2\u30d7\u30ea\u30b1\u30fc\u30b7\u30e7\u30f3\u3092\u30c1\u30a7\u30c3\u30af \u3055\u307e\u3056\u307e\u306a\u30a2\u30d7\u30ea\u30b1\u30fc\u30b7\u30e7\u30f3\u3092\u30d6\u30e9 \u30a6\u30ba\u3057\u3066\u3001\u300cLaunchpad\u300d\u306b\u76f4\u63a5 \u30c0\u30a6\u30f3\u30ed \u30c9\u30fc\u3067\u304d\u307e\u3059\u3002\r\n\u30ab\u30ec\u30f3\u30c0\u30fc\u8868\u793a \u65e5\u3001\u9031\u3001\u6708\u3001\u307e\u305f\u306f\u5e74\u8868\u793a\u3092 \u9078\u629e\u3067\u304d\u307e\u3059\u3002\r\n\u30a4\u30d9\u30f3\u30c8\u3092\u8ffd\u52a0 \u30ab\u30ec\u30f3\u30c0\u30fc\u5185\u3092\u30c0\u30d6\u30eb \u30af\u30ea\u30c3\u30af\u3059\u308c\u3070\u65b0\u3057\u3044\u30a4\u30d9 \u30f3\u30c8\u3092\u8ffd\u52a0\u3067\u304d\u307e\u3059\u3002\r\n<\/code><\/pre>\n<p>\u867d\u7136\u8fd8\u6ca1\u6709\u770b\u5230\u5b83\u662f\u4ee5\u4ec0\u4e48\u4f5c\u4e3a\u4e3b\u8981\u6587\u672c\u7684\uff0c\u4f46\u6211\u6709\u4e00\u79cd\u611f\u89c9\u5b83\u6709\u70b9\u50cf\u662f\u4e3b\u8981\u7684\u3002<\/p>\n<p>\u4eca\u5929\u5c31\u5230\u8fd9\u91cc\u5427\uff0c\u4e0d\u8fc7\u6211\u7a81\u7136\u89c9\u5f97\u53ef\u80fd\u6709\u4e9b\u6709\u8da3\u7684\u4e8b\u60c5\u53ef\u4ee5\u505a\uff01<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Apache Tika\u662f\u4ec0\u4e48 \u963f\u5e15\u5947\u63d0\u5361 Apache Tika\uff0c\u662f\u4e00\u500b\u4f7f\u7528Java\u958b\u767c\u7684\u6587\u4ef6\u5206\u6790\u548c\u5143\u6578\u64da\u63d0\u53d6 [&hellip;]<\/p>\n","protected":false},"author":9,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-36867","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v21.5 (Yoast SEO v21.5) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>\u5c1d\u8bd5\u4f7f\u7528Apache Tika\u8bfb\u53d6\u5185\u5bb9 - Blog - Silicon Cloud<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.silicloud.com\/zh\/blog\/\u5c1d\u8bd5\u4f7f\u7528apache-tika\u8bfb\u53d6\u5185\u5bb9\u3002\/\" \/>\n<meta property=\"og:locale\" content=\"zh_CN\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"\u5c1d\u8bd5\u4f7f\u7528Apache Tika\u8bfb\u53d6\u5185\u5bb9\" \/>\n<meta property=\"og:description\" content=\"Apache Tika\u662f\u4ec0\u4e48 \u963f\u5e15\u5947\u63d0\u5361 Apache Tika\uff0c\u662f\u4e00\u500b\u4f7f\u7528Java\u958b\u767c\u7684\u6587\u4ef6\u5206\u6790\u548c\u5143\u6578\u64da\u63d0\u53d6 [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.silicloud.com\/zh\/blog\/\u5c1d\u8bd5\u4f7f\u7528apache-tika\u8bfb\u53d6\u5185\u5bb9\u3002\/\" \/>\n<meta property=\"og:site_name\" content=\"Blog - Silicon Cloud\" \/>\n<meta property=\"article:published_time\" content=\"2023-12-22T04:47:30+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-05-04T10:54:01+00:00\" \/>\n<meta name=\"author\" content=\"\u6e05, \u626c\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"\u4f5c\u8005\" \/>\n\t<meta name=\"twitter:data1\" content=\"\u6e05, \u626c\" \/>\n\t<meta name=\"twitter:label2\" content=\"\u9884\u8ba1\u9605\u8bfb\u65f6\u95f4\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 \u5206\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/%e5%b0%9d%e8%af%95%e4%bd%bf%e7%94%a8apache-tika%e8%af%bb%e5%8f%96%e5%86%85%e5%ae%b9%e3%80%82\/\",\"url\":\"https:\/\/www.silicloud.com\/zh\/blog\/%e5%b0%9d%e8%af%95%e4%bd%bf%e7%94%a8apache-tika%e8%af%bb%e5%8f%96%e5%86%85%e5%ae%b9%e3%80%82\/\",\"name\":\"\u5c1d\u8bd5\u4f7f\u7528Apache Tika\u8bfb\u53d6\u5185\u5bb9 - Blog - Silicon Cloud\",\"isPartOf\":{\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/#website\"},\"datePublished\":\"2023-12-22T04:47:30+00:00\",\"dateModified\":\"2024-05-04T10:54:01+00:00\",\"author\":{\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/cb5556d2501da73d864cac945e8d9461\"},\"breadcrumb\":{\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/%e5%b0%9d%e8%af%95%e4%bd%bf%e7%94%a8apache-tika%e8%af%bb%e5%8f%96%e5%86%85%e5%ae%b9%e3%80%82\/#breadcrumb\"},\"inLanguage\":\"zh-Hans\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.silicloud.com\/zh\/blog\/%e5%b0%9d%e8%af%95%e4%bd%bf%e7%94%a8apache-tika%e8%af%bb%e5%8f%96%e5%86%85%e5%ae%b9%e3%80%82\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/%e5%b0%9d%e8%af%95%e4%bd%bf%e7%94%a8apache-tika%e8%af%bb%e5%8f%96%e5%86%85%e5%ae%b9%e3%80%82\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"\u9996\u9875\",\"item\":\"https:\/\/www.silicloud.com\/zh\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"\u5c1d\u8bd5\u4f7f\u7528Apache Tika\u8bfb\u53d6\u5185\u5bb9\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/#website\",\"url\":\"https:\/\/www.silicloud.com\/zh\/blog\/\",\"name\":\"Blog - Silicon Cloud\",\"description\":\"\",\"inLanguage\":\"zh-Hans\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/cb5556d2501da73d864cac945e8d9461\",\"name\":\"\u6e05, \u626c\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"zh-Hans\",\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/32a4239de8ff29adace466261d309424a1e5fe9f7e3036bf89fe03f2e3dbe717?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/32a4239de8ff29adace466261d309424a1e5fe9f7e3036bf89fe03f2e3dbe717?s=96&d=mm&r=g\",\"caption\":\"\u6e05, \u626c\"},\"url\":\"https:\/\/www.silicloud.com\/zh\/blog\/author\/qingyang\/\"},{\"@type\":\"ImageObject\",\"inLanguage\":\"zh-Hans\",\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/%e5%b0%9d%e8%af%95%e4%bd%bf%e7%94%a8apache-tika%e8%af%bb%e5%8f%96%e5%86%85%e5%ae%b9%e3%80%82\/#local-main-organization-logo\",\"url\":\"\",\"contentUrl\":\"\",\"caption\":\"Blog - Silicon Cloud\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"\u5c1d\u8bd5\u4f7f\u7528Apache Tika\u8bfb\u53d6\u5185\u5bb9 - Blog - Silicon Cloud","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.silicloud.com\/zh\/blog\/\u5c1d\u8bd5\u4f7f\u7528apache-tika\u8bfb\u53d6\u5185\u5bb9\u3002\/","og_locale":"zh_CN","og_type":"article","og_title":"\u5c1d\u8bd5\u4f7f\u7528Apache Tika\u8bfb\u53d6\u5185\u5bb9","og_description":"Apache Tika\u662f\u4ec0\u4e48 \u963f\u5e15\u5947\u63d0\u5361 Apache Tika\uff0c\u662f\u4e00\u500b\u4f7f\u7528Java\u958b\u767c\u7684\u6587\u4ef6\u5206\u6790\u548c\u5143\u6578\u64da\u63d0\u53d6 [&hellip;]","og_url":"https:\/\/www.silicloud.com\/zh\/blog\/\u5c1d\u8bd5\u4f7f\u7528apache-tika\u8bfb\u53d6\u5185\u5bb9\u3002\/","og_site_name":"Blog - Silicon Cloud","article_published_time":"2023-12-22T04:47:30+00:00","article_modified_time":"2024-05-04T10:54:01+00:00","author":"\u6e05, \u626c","twitter_card":"summary_large_image","twitter_misc":{"\u4f5c\u8005":"\u6e05, \u626c","\u9884\u8ba1\u9605\u8bfb\u65f6\u95f4":"2 \u5206"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.silicloud.com\/zh\/blog\/%e5%b0%9d%e8%af%95%e4%bd%bf%e7%94%a8apache-tika%e8%af%bb%e5%8f%96%e5%86%85%e5%ae%b9%e3%80%82\/","url":"https:\/\/www.silicloud.com\/zh\/blog\/%e5%b0%9d%e8%af%95%e4%bd%bf%e7%94%a8apache-tika%e8%af%bb%e5%8f%96%e5%86%85%e5%ae%b9%e3%80%82\/","name":"\u5c1d\u8bd5\u4f7f\u7528Apache Tika\u8bfb\u53d6\u5185\u5bb9 - Blog - Silicon Cloud","isPartOf":{"@id":"https:\/\/www.silicloud.com\/zh\/blog\/#website"},"datePublished":"2023-12-22T04:47:30+00:00","dateModified":"2024-05-04T10:54:01+00:00","author":{"@id":"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/cb5556d2501da73d864cac945e8d9461"},"breadcrumb":{"@id":"https:\/\/www.silicloud.com\/zh\/blog\/%e5%b0%9d%e8%af%95%e4%bd%bf%e7%94%a8apache-tika%e8%af%bb%e5%8f%96%e5%86%85%e5%ae%b9%e3%80%82\/#breadcrumb"},"inLanguage":"zh-Hans","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.silicloud.com\/zh\/blog\/%e5%b0%9d%e8%af%95%e4%bd%bf%e7%94%a8apache-tika%e8%af%bb%e5%8f%96%e5%86%85%e5%ae%b9%e3%80%82\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.silicloud.com\/zh\/blog\/%e5%b0%9d%e8%af%95%e4%bd%bf%e7%94%a8apache-tika%e8%af%bb%e5%8f%96%e5%86%85%e5%ae%b9%e3%80%82\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"\u9996\u9875","item":"https:\/\/www.silicloud.com\/zh\/blog\/"},{"@type":"ListItem","position":2,"name":"\u5c1d\u8bd5\u4f7f\u7528Apache Tika\u8bfb\u53d6\u5185\u5bb9"}]},{"@type":"WebSite","@id":"https:\/\/www.silicloud.com\/zh\/blog\/#website","url":"https:\/\/www.silicloud.com\/zh\/blog\/","name":"Blog - Silicon Cloud","description":"","inLanguage":"zh-Hans"},{"@type":"Person","@id":"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/cb5556d2501da73d864cac945e8d9461","name":"\u6e05, \u626c","image":{"@type":"ImageObject","inLanguage":"zh-Hans","@id":"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/32a4239de8ff29adace466261d309424a1e5fe9f7e3036bf89fe03f2e3dbe717?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/32a4239de8ff29adace466261d309424a1e5fe9f7e3036bf89fe03f2e3dbe717?s=96&d=mm&r=g","caption":"\u6e05, \u626c"},"url":"https:\/\/www.silicloud.com\/zh\/blog\/author\/qingyang\/"},{"@type":"ImageObject","inLanguage":"zh-Hans","@id":"https:\/\/www.silicloud.com\/zh\/blog\/%e5%b0%9d%e8%af%95%e4%bd%bf%e7%94%a8apache-tika%e8%af%bb%e5%8f%96%e5%86%85%e5%ae%b9%e3%80%82\/#local-main-organization-logo","url":"","contentUrl":"","caption":"Blog - Silicon Cloud"}]}},"_links":{"self":[{"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/posts\/36867","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/users\/9"}],"replies":[{"embeddable":true,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/comments?post=36867"}],"version-history":[{"count":2,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/posts\/36867\/revisions"}],"predecessor-version":[{"id":100085,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/posts\/36867\/revisions\/100085"}],"wp:attachment":[{"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/media?parent=36867"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/categories?post=36867"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/tags?post=36867"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}