{"id":46124,"date":"2022-12-16T19:15:05","date_gmt":"2023-06-20T22:57:48","guid":{"rendered":"https:\/\/www.silicloud.com\/zh\/blog\/46124-2\/"},"modified":"2024-04-29T05:36:46","modified_gmt":"2024-04-28T21:36:46","slug":"46124-2","status":"publish","type":"post","link":"https:\/\/www.silicloud.com\/zh\/blog\/46124-2\/","title":{"rendered":""},"content":{"rendered":"<h2>\u76ee\u7684<\/h2>\n<p>PyPI (pip) \u3067Tesseract-OCR\u3092\u69cb\u7bc9\u3059\u308b\u65b9\u6cd5\u306f\u30cd\u30c3\u30c8\u4e0a\u306b\u305f\u304f\u3055\u3093\u60c5\u5831\u304c\u51fa\u3066\u3044\u308b\u306e\u3067\u3059\u304c\u3001Anaconda (conda) \u306b\u95a2\u3059\u308b\u60c5\u5831\u304c\u5c11\u306a\u3044\u3067\u3059\u3002<br \/>\n\u79c1\u304c\u69cb\u7bc9\u3057\u305f\u969b\u306b\u3064\u307e\u305a\u3044\u305f\u70b9\u304c\u3042\u3063\u305f\u306e\u3067\u5099\u5fd8\u9332\u3068\u3057\u3066\u5bfe\u7b56\u307e\u3068\u3081\u3066\u304a\u304d\u307e\u3059\u3002<\/p>\n<h2>\u30b7\u30b9\u30c6\u30e0\u8981\u4ef6<\/h2>\n<ul class=\"post-ul\">\n<li style=\"list-style-type: none;\">\n<ul class=\"post-ul\">Windows 11 Home 22H2<\/ul>\n<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<ul class=\"post-ul\">\n<li style=\"list-style-type: none;\">\n<ul class=\"post-ul\">Anaconda 3.11.1<\/ul>\n<\/li>\n<\/ul>\n<p>Python 3.9.15<\/p>\n<h2>\u8a66\u3057\u305f\u3053\u3068<\/h2>\n<h3>\u6210\u529f<\/h3>\n<ul class=\"post-ul\">\n<li style=\"list-style-type: none;\">\n<ul class=\"post-ul\">Anaconda Prompt\u3067 conda install -c conda-forge tesseract<\/ul>\n<\/li>\n<\/ul>\n<p>Jupyter Notebook\u4e0a\u3067 tesseract \u3068 tessdata \u306e\u30d1\u30b9\u6307\u5b9a<\/p>\n<h3>\u5931\u6557<\/h3>\n<ul class=\"post-ul\">\n<li style=\"list-style-type: none;\">\n<ul class=\"post-ul\">Windows\u306b\u76f4\u63a5Tesseract-OCR\u3092\u30a4\u30f3\u30b9\u30c8\u30fc\u30eb<\/ul>\n<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<ul class=\"post-ul\">\n<li style=\"list-style-type: none;\">\n<ul class=\"post-ul\">Jupyter Notebook\u4e0a\u3067 !conda install -c conda-forge tesseract<\/ul>\n<\/li>\n<\/ul>\n<p>Jupyter Notebook\u4e0a\u3067 !pip install tesseract-ocr libtesseract-dev tesseract-ocr-jpn<\/p>\n<p>libtesseract \u304c\u898b\u3064\u304b\u3089\u306a\u3044\u3068\u6012\u3089\u308c\u308b<\/p>\n<h2>\u6210\u529f\u3057\u305f\u65b9\u6cd5<\/h2>\n<h3>Anaconda Prompt\u3067Tesseract-OCR\u3092\u30a4\u30f3\u30b9\u30c8\u30fc\u30eb<\/h3>\n<pre class=\"post-pre\"><code>(base) &gt; conda install -c conda-forge tesseract\r\n<\/code><\/pre>\n<p>\u3053\u3053\u3067Anaconda Prompt\u4e0a\u3067 tesseract \u304c\u52d5\u304f\u304b\u8a66\u3057\u307e\u3059\u3002<br \/>\n\u6b63\u3057\u304f\u8a00\u8a9e\u30ea\u30b9\u30c8\u304c\u8868\u793a\u3055\u308c\u3066\u3044\u307e\u3059\u3002<\/p>\n<pre class=\"post-pre\"><code>(base) &gt; tesseract --list-langs\r\nList of available languages in \"C:\\Users\\kabeg\\anaconda3\\share\\tessdata\/\" (125):\r\nafr\r\namh\r\nara\r\nasm\r\n(\u4e2d\u7565)\r\n<\/code><\/pre>\n<h3>Jupyter Notebook\u3067\u52d5\u4f5c\u78ba\u8a8d \u2192 \u8a00\u8a9e\u30ea\u30b9\u30c8\u304c\u7a7a??<\/h3>\n<p>\u7d9a\u3044\u3066Jupyter Notebook\u4e0a\u3067\u3082 tesseract \u306e\u52d5\u4f5c\u78ba\u8a8d\u3067\u3059\u3002<br \/>\ntesseract \u81ea\u4f53\u306f\u52d5\u3044\u3066\u3044\u308b\u3088\u3046\u306b\u898b\u3048\u307e\u3059\u304c\u3001\u8a00\u8a9e\u30ea\u30b9\u30c8\u304c\u7a7a\u3067\u3059\u3002<br \/>\n\u307e\u305f\u8a00\u8a9e\u30c7\u30fc\u30bf tessdata \u306e\u30c7\u30a3\u30ec\u30af\u30c8\u30ea\u306fC:\/Users\/username\/anaconda3\/share\/tessdata \u306b\u3042\u308b\u306f\u305a\u3067\u3059\u304c\u3001\u3069\u3046\u3044\u3046\u308f\u3051\u304b .\/ \u306b\u306a\u3063\u3066\u3044\u307e\u3059\u3002<\/p>\n<pre class=\"post-pre\"><code><span class=\"err\">!<\/span><span class=\"n\">tesseract<\/span> <span class=\"o\">--<\/span><span class=\"nb\">list<\/span><span class=\"o\">-<\/span><span class=\"n\">langs<\/span>\r\n<span class=\"c1\"># List of available languages in \".\/\" (0):\r\n<\/span><\/code><\/pre>\n<h3>Jupyter Notebook\u4e0a\u3067Path\u3092\u901a\u3059<\/h3>\n<p>Windows\u306e\u74b0\u5883\u5909\u6570\u3082\u3044\u3058\u3063\u3066\u307f\u307e\u3057\u305f\u304c\u3001\u52b9\u679c\u304c\u306a\u304b\u3063\u305f\u306e\u3067Jupyter Notebook\u4e0a\u3067\u30d1\u30b9\u3092\u6307\u5b9a\u3057\u307e\u3059\u3002<br \/>\n\u4ee5\u4e0b\u306e\u30b3\u30fc\u30c9\u3092 import \u6587\u306e\u4e0b\u306a\u3069\u306b\u304a\u3044\u3066\u5b9f\u884c\u3057\u307e\u3059\u3002<\/p>\n<pre class=\"post-pre\"><code><span class=\"kn\">import<\/span> <span class=\"n\">pyocr<\/span>\r\n<span class=\"kn\">import<\/span> <span class=\"n\">os<\/span>\r\n\r\n<span class=\"n\">TESSDATA_PATH<\/span> <span class=\"o\">=<\/span> <span class=\"sh\">'<\/span><span class=\"s\">C:<\/span><span class=\"se\">\\\\<\/span><span class=\"s\">Users<\/span><span class=\"se\">\\\\<\/span><span class=\"s\">username<\/span><span class=\"se\">\\\\<\/span><span class=\"s\">anaconda3<\/span><span class=\"se\">\\\\<\/span><span class=\"s\">share<\/span><span class=\"se\">\\\\<\/span><span class=\"s\">tessdata<\/span><span class=\"sh\">'<\/span>        <span class=\"c1\"># tessdata\u3078\u306e\u30d1\u30b9 (anaconda\u3067\u30a4\u30f3\u30b9\u30c8\u30fc\u30eb\u306e\u5834\u5408)\r\n<\/span>                                                                         <span class=\"c1\"># username\u306f\u74b0\u5883\u306b\u3088\u3063\u3066\u5909\u308f\u308a\u307e\u3059\r\n# TESSDATA_PATH = 'C:\\\\Program Files\\\\Tesseract-OCR\\\\tessdata'           # tessdata\u3078\u306e\u30d1\u30b9 (Windows\u76f4\u30a4\u30f3\u30b9\u30c8\u30fc\u30eb\u306e\u5834\u5408)\r\n# TESSERACT_PATH = 'C:\\\\Program Files\\\\Tesseract-OCR\\\\tesseract.exe'     # tesseract\u304c\u5b9f\u884c\u3067\u304d\u306a\u3044\u5834\u5408\u306ftesseract.exe\u306b\u30d1\u30b9\u3092\u901a\u3059\r\n<\/span>\r\n<span class=\"n\">os<\/span><span class=\"p\">.<\/span><span class=\"n\">environ<\/span><span class=\"p\">[<\/span><span class=\"sh\">\"<\/span><span class=\"s\">TESSDATA_PREFIX<\/span><span class=\"sh\">\"<\/span><span class=\"p\">]<\/span> <span class=\"o\">=<\/span> <span class=\"n\">TESSDATA_PATH<\/span>          <span class=\"c1\"># tessdata\u3078\u306e\u30d1\u30b9\u3092\u901a\u3059\r\n# os.environ[\"PATH\"] += os.pathsep + TESSERACT_PATH    # tesseract.exe\u306e\u30d1\u30b9\u3082\u6307\u5b9a\u3059\u308b\u5fc5\u8981\u304c\u6709\u308b\u5834\u5408\r\n<\/span><\/code><\/pre>\n<h3>\u518d\u5ea6Jupyter Notebook\u3067\u52d5\u4f5c\u78ba\u8a8d<\/h3>\n<p>\u4eca\u5ea6\u306f\u554f\u984c\u306a\u304f\u8a00\u8a9e\u30ea\u30b9\u30c8\u304c\u8aad\u307f\u8fbc\u3081\u307e\u3057\u305f\u3002<\/p>\n<pre class=\"post-pre\"><code>!tesseract --list-langs\r\nList of available languages in \"C:\\Program Files\\Tesseract-OCR\\tessdata\/\" (6):\r\neng\r\njpn\r\njpn_vert\r\nosd\r\nscript\/Japanese\r\nscript\/Japanese_vert\r\n<\/code><\/pre>\n<h2>\u8003\u5bdf<\/h2>\n<p>\u4e00\u756a\u4e0d\u601d\u8b70\u306a\u306e\u306fAnaconda Prompt\u4e0a\u3067\u306f\u8a00\u8a9e\u30c7\u30fc\u30bf\u3078\u306e\u30d1\u30b9\u304c\u3046\u307e\u304f\u901a\u3063\u3066\u3044\u305f\u306e\u306bJupyter NB\u4e0a\u3067\u306f\u30d1\u30b9\u304c\u901a\u3063\u3066\u3044\u306a\u304b\u3063\u305f\u3053\u3068\u3067\u3059\u3002<br \/>\n\u8003\u3048\u3089\u308c\u308b\u539f\u56e0\u3068\u3057\u3066\u306fJupyter NB\u3067\u8d70\u3063\u3066\u3044\u308bAnaconda\u306f\u4eee\u60f3\u74b0\u5883\u3067\u3042\u308b\u305f\u3081\u30d1\u30c3\u30b1\u30fc\u30b8\u7b49\u306f\u5f15\u304d\u7d99\u304c\u308c\u308b\u304c\u30d1\u30b9\u7b49\u306f\u5f15\u304d\u7d99\u304c\u308c\u306a\u3044\u3053\u3068\u3067\u3059\u3002<br \/>\n\u6b63\u76f4\u79c1\u306fPython\u5468\u308a\u306b\u3042\u307e\u308a\u660e\u308b\u304f\u306a\u3044\u306e\u3067\u3082\u3057\u539f\u56e0\u306b\u5fc3\u5f53\u305f\u308a\u304c\u3042\u308b\u65b9\u304c\u3044\u3089\u3063\u3057\u3083\u308c\u3070\u30b3\u30e1\u30f3\u30c8\u7b49\u3067\u3054\u6559\u793a\u3044\u305f\u3060\u3051\u308b\u3068\u5e78\u3044\u3067\u3059\u3002<\/p>\n","protected":false},"excerpt":{"rendered":"<p>\u76ee\u7684 PyPI (pip) \u3067Tesseract-OCR\u3092\u69cb\u7bc9\u3059\u308b\u65b9\u6cd5\u306f\u30cd\u30c3\u30c8\u4e0a\u306b\u305f\u304f\u3055\u3093\u60c5\u5831\u304c\u51fa\u3066\u3044\u308b\u306e\u3067\u3059 [&hellip;]<\/p>\n","protected":false},"author":4,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-46124","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v21.5 (Yoast SEO v21.5) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>- Blog - Silicon Cloud<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.silicloud.com\/zh\/blog\/46124-2\/\" \/>\n<meta property=\"og:locale\" content=\"zh_CN\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:description\" content=\"\u76ee\u7684 PyPI (pip) \u3067Tesseract-OCR\u3092\u69cb\u7bc9\u3059\u308b\u65b9\u6cd5\u306f\u30cd\u30c3\u30c8\u4e0a\u306b\u305f\u304f\u3055\u3093\u60c5\u5831\u304c\u51fa\u3066\u3044\u308b\u306e\u3067\u3059 [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.silicloud.com\/zh\/blog\/46124-2\/\" \/>\n<meta property=\"og:site_name\" content=\"Blog - Silicon Cloud\" \/>\n<meta property=\"article:published_time\" content=\"2023-06-20T22:57:48+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-04-28T21:36:46+00:00\" \/>\n<meta name=\"author\" content=\"\u79d1, \u96c5\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"\u4f5c\u8005\" \/>\n\t<meta name=\"twitter:data1\" content=\"\u79d1, \u96c5\" \/>\n\t<meta name=\"twitter:label2\" content=\"\u9884\u8ba1\u9605\u8bfb\u65f6\u95f4\" \/>\n\t<meta name=\"twitter:data2\" content=\"1 \u5206\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/46124-2\/\",\"url\":\"https:\/\/www.silicloud.com\/zh\/blog\/46124-2\/\",\"name\":\"- Blog - Silicon Cloud\",\"isPartOf\":{\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/#website\"},\"datePublished\":\"2023-06-20T22:57:48+00:00\",\"dateModified\":\"2024-04-28T21:36:46+00:00\",\"author\":{\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/41e222757cdd2a3365361328bd79970a\"},\"inLanguage\":\"zh-Hans\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.silicloud.com\/zh\/blog\/46124-2\/\"]}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/#website\",\"url\":\"https:\/\/www.silicloud.com\/zh\/blog\/\",\"name\":\"Blog - Silicon Cloud\",\"description\":\"\",\"inLanguage\":\"zh-Hans\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/41e222757cdd2a3365361328bd79970a\",\"name\":\"\u79d1, \u96c5\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"zh-Hans\",\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/1b2d3e00a7df03689797ebd4af8c5827ba5af936849a71050ec331f4cf902c5d?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/1b2d3e00a7df03689797ebd4af8c5827ba5af936849a71050ec331f4cf902c5d?s=96&d=mm&r=g\",\"caption\":\"\u79d1, \u96c5\"},\"url\":\"https:\/\/www.silicloud.com\/zh\/blog\/author\/keya\/\"},{\"@type\":\"ImageObject\",\"inLanguage\":\"zh-Hans\",\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/46124-2\/#local-main-organization-logo\",\"url\":\"\",\"contentUrl\":\"\",\"caption\":\"Blog - Silicon Cloud\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"- Blog - Silicon Cloud","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.silicloud.com\/zh\/blog\/46124-2\/","og_locale":"zh_CN","og_type":"article","og_description":"\u76ee\u7684 PyPI (pip) \u3067Tesseract-OCR\u3092\u69cb\u7bc9\u3059\u308b\u65b9\u6cd5\u306f\u30cd\u30c3\u30c8\u4e0a\u306b\u305f\u304f\u3055\u3093\u60c5\u5831\u304c\u51fa\u3066\u3044\u308b\u306e\u3067\u3059 [&hellip;]","og_url":"https:\/\/www.silicloud.com\/zh\/blog\/46124-2\/","og_site_name":"Blog - Silicon Cloud","article_published_time":"2023-06-20T22:57:48+00:00","article_modified_time":"2024-04-28T21:36:46+00:00","author":"\u79d1, \u96c5","twitter_card":"summary_large_image","twitter_misc":{"\u4f5c\u8005":"\u79d1, \u96c5","\u9884\u8ba1\u9605\u8bfb\u65f6\u95f4":"1 \u5206"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.silicloud.com\/zh\/blog\/46124-2\/","url":"https:\/\/www.silicloud.com\/zh\/blog\/46124-2\/","name":"- Blog - Silicon Cloud","isPartOf":{"@id":"https:\/\/www.silicloud.com\/zh\/blog\/#website"},"datePublished":"2023-06-20T22:57:48+00:00","dateModified":"2024-04-28T21:36:46+00:00","author":{"@id":"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/41e222757cdd2a3365361328bd79970a"},"inLanguage":"zh-Hans","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.silicloud.com\/zh\/blog\/46124-2\/"]}]},{"@type":"WebSite","@id":"https:\/\/www.silicloud.com\/zh\/blog\/#website","url":"https:\/\/www.silicloud.com\/zh\/blog\/","name":"Blog - Silicon Cloud","description":"","inLanguage":"zh-Hans"},{"@type":"Person","@id":"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/41e222757cdd2a3365361328bd79970a","name":"\u79d1, \u96c5","image":{"@type":"ImageObject","inLanguage":"zh-Hans","@id":"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/1b2d3e00a7df03689797ebd4af8c5827ba5af936849a71050ec331f4cf902c5d?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/1b2d3e00a7df03689797ebd4af8c5827ba5af936849a71050ec331f4cf902c5d?s=96&d=mm&r=g","caption":"\u79d1, \u96c5"},"url":"https:\/\/www.silicloud.com\/zh\/blog\/author\/keya\/"},{"@type":"ImageObject","inLanguage":"zh-Hans","@id":"https:\/\/www.silicloud.com\/zh\/blog\/46124-2\/#local-main-organization-logo","url":"","contentUrl":"","caption":"Blog - Silicon Cloud"}]}},"_links":{"self":[{"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/posts\/46124","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/users\/4"}],"replies":[{"embeddable":true,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/comments?post=46124"}],"version-history":[{"count":2,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/posts\/46124\/revisions"}],"predecessor-version":[{"id":83364,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/posts\/46124\/revisions\/83364"}],"wp:attachment":[{"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/media?parent=46124"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/categories?post=46124"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/tags?post=46124"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}