{"id":45467,"date":"2022-12-05T16:59:46","date_gmt":"2023-09-11T14:11:34","guid":{"rendered":"https:\/\/www.silicloud.com\/zh\/blog\/45467-2\/"},"modified":"2024-04-29T03:51:52","modified_gmt":"2024-04-28T19:51:52","slug":"45467-2","status":"publish","type":"post","link":"https:\/\/www.silicloud.com\/zh\/blog\/45467-2\/","title":{"rendered":""},"content":{"rendered":"<p>\u9023\u8a18\u4e8b\u76ee\u6b21<\/p>\n<ul class=\"post-ul\">\n<li style=\"list-style-type: none;\">\n<ul class=\"post-ul\">Ruby\/Rust \u9023\u643a (1)\u3000\u76ee\u7684<\/ul>\n<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<ul class=\"post-ul\">\n<li style=\"list-style-type: none;\">\n<ul class=\"post-ul\">Ruby\/Rust \u9023\u643a (2)\u3000\u624b\u6bb5<\/ul>\n<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<ul class=\"post-ul\">\n<li style=\"list-style-type: none;\">\n<ul class=\"post-ul\">Ruby\/Rust \u9023\u643a (3)\u3000FFI \u3067\u6570\u5024\u8a08\u7b97<\/ul>\n<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<ul class=\"post-ul\">\n<li style=\"list-style-type: none;\">\n<ul class=\"post-ul\">Ruby\/Rust \u9023\u643a (4)\u3000Rutie \u3067\u6570\u5024\u8a08\u7b97\u2460<\/ul>\n<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<ul class=\"post-ul\">\n<li style=\"list-style-type: none;\">\n<ul class=\"post-ul\">Ruby\/Rust \u9023\u643a (5)\u3000Rutie \u3067\u6570\u5024\u8a08\u7b97\u2461\u3000\u30d9\u30b8\u30a8<\/ul>\n<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<ul class=\"post-ul\">\n<li style=\"list-style-type: none;\">\n<ul class=\"post-ul\">Ruby\/Rust \u9023\u643a (6)\u3000\u5f62\u614b\u7d20\u306e\u62bd\u51fa<\/ul>\n<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<ul class=\"post-ul\">Ruby\/Rust \u9023\u643a (7)\u3000\u30a4\u30f3\u30b9\u30c8\u30fc\u30eb\u6642\u30d3\u30eb\u30c9\u306e Rust \u62e1\u5f35 gem \u3092\u4f5c\u308b<\/ul>\n<h1>\u306f\u3058\u3081\u306b<\/h1>\n<p>Ruby \u3068 Rust \u3092\u9023\u643a\u3055\u305b\u308b\u3084\u308a\u65b9\u304c\u3060\u3093\u3060\u3093\u5206\u304b\u308a\uff0c\u9762\u767d\u304f\u306a\u3063\u3066\u304d\u305f\u3002<br \/>\n\u3053\u308c\u307e\u3067\uff08(3)\u301c(5)\uff09\u306f\u6570\u5024\u8a08\u7b97\u3092\u3084\u3089\u305b\u3066\u307f\u305f\u306e\u3067\uff0c\u3053\u3093\u3069\u306f\u30c6\u30ad\u30b9\u30c8\u51e6\u7406\u3092\u3084\u3063\u3066\u307f\u3088\u3046\u3002<br \/>\n\u3088\u3057\uff0c\u3044\u304d\u306a\u308a\u3060\u304c\uff0cRust \u306e\u5f62\u614b\u7d20\u89e3\u6790\u30e9\u30a4\u30d6\u30e9\u30ea\u30fc\u3092\u4f7f\u3063\u3066\uff0c\u30c6\u30ad\u30b9\u30c8\u304b\u3089\u56fa\u6709\u540d\u8a5e\u3060\u3051\u3068\u304b\uff0c\u540d\u8a5e\u5168\u90e8\u3068\u304b\uff0c\u5f62\u5bb9\u8a5e\u3068\u526f\u8a5e\uff0c\u3068\u304b\u3068\u3044\u3063\u305f\u3088\u3046\u306b\uff0c\u7279\u5b9a\u306e\u54c1\u8a5e\u306e\u5f62\u614b\u7d20\u3060\u3051\u3092\u629c\u304d\u51fa\u3059\uff0c\u3068\u3044\u3046\u3053\u3068\u3092\u3084\u308b\u305e\u3002<\/p>\n<p>\u306a\u304a\u7b46\u8005\u306f\u7d30\u304f\u9577\u3044 Ruby \u4eba\u751f\u3092\u9001\u3063\u3066\u304d\u305f\u304c\uff0cRust \u306f\u30c9\u7d20\u4eba\u3067\u3042\u308a\uff0c\u5f62\u614b\u7d20\u89e3\u6790\u3068\u3044\u3048\u3070 Ruby \u3067 MeCab \u3092\u6271\u3046\u904a\u3073\u3092\u3061\u3087\u3063\u3068\u3084\u3063\u305f\u7a0b\u5ea6\u3002\u96e3\u3057\u3044\u3053\u3068\u306f\u5206\u304b\u3089\u306a\u3044\u3002<\/p>\n<h1>\u65b9\u91dd<\/h1>\n<p>Rust \u88fd\u306e\u5f62\u614b\u7d20\u89e3\u6790\u30e9\u30a4\u30d6\u30e9\u30ea\u30fc\u3068\u3057\u3066\uff0c Lindera \u3068\u3044\u3046\u3082\u306e\u3092\u4f7f\u3046\u3002<br \/>\n\u3053\u308c\u306f\u5b9f\u9a13\u7684\u306b\u4f5c\u3089\u308c\u305f kuromoji-rs \u3068\u3044\u3046\u30e9\u30a4\u30d6\u30e9\u30ea\u30fc\u306e @mosuka \u3055\u3093\u306b\u3088\u308b\u30d5\u30a9\u30fc\u30af\u3002\u30d5\u30a9\u30fc\u30af\u306e\u5f62\u3092\u53d6\u3063\u3066\u3044\u308b\u304c\uff0c\u5225\u540d\u3067\u958b\u767a\u3092\u5f15\u304d\u7d99\u3044\u3060\u3068\u3044\u3046\u3082\u306e\u3002<br \/>\n\u7d4c\u7def\u306a\u3069\u306f @mosuka \u3055\u3093\u306e\u4e0b\u8a18\u306e\u8a18\u4e8b\u3092\u53c2\u7167\u3002<br \/>\nRust\u521d\u5fc3\u8005\u304cRust\u88fd\u306e\u65e5\u672c\u8a9e\u5f62\u614b\u7d20\u89e3\u6790\u5668\u306e\u958b\u767a\u3092\u5f15\u304d\u7d99\u3044\u3067\u307f\u305f &#8211; Qiita<\/p>\n<p>Ruby \u3068 Rust \u306e\u9023\u643a\u306e\u4ed5\u7d44\u307f\u306f (4)\uff0c(5) \u3068\u540c\u69d8\uff0cRutie \u3092\u4f7f\u3046\u3002<\/p>\n<p>Ruby \u3068 Rust \u306e\u5f79\u5272\u5206\u62c5\u306f\u3053\u3093\u306a\u3075\u3046\u306b\u8003\u3048\u3066\u3044\u308b\u3002<br \/>\nRust \u3067\uff0c\u5f62\u614b\u7d20\u62bd\u51fa\u5668\u3068\u3067\u3082\u3044\u3046\u3088\u3046\u306a Ruby \u306e\u30af\u30e9\u30b9\u3092\u4f5c\u308b\uff08Rutie \u306f Rust \u3067 Ruby \u306e\u30af\u30e9\u30b9\u304c\u66f8\u3051\u308b\uff09\u3002\u521d\u671f\u5316\u306e\u3068\u304d\u306b\uff0c\u3069\u3093\u306a\u54c1\u8a5e\u3092\u62fe\u3046\u304b\u3092\u30ea\u30b9\u30c8\u3067\u4e0e\u3048\u308b\uff08\u30ea\u30b9\u30c8\u306b\u3042\u308b\u3059\u3079\u3066\u306e\u54c1\u8a5e\u3092\u62fe\u3046\uff09\u3002<br \/>\n\u5f62\u614b\u7d20\u62bd\u51fa\u5668\u306e\u30a4\u30f3\u30b9\u30bf\u30f3\u30b9\u3092\u4f5c\u308a\uff0c\u305d\u308c\u306b\u30c6\u30ad\u30b9\u30c8\u3092\u4e0e\u3048\u308b\u3068\uff0c\u8a72\u5f53\u3059\u308b\u5f62\u614b\u7d20\u3092\u6587\u5b57\u5217\u306e\u914d\u5217\u3068\u3057\u3066\u8fd4\u3059\uff08\u51fa\u73fe\u9806\u306b\uff0c\u91cd\u8907\u3042\u308a\u3067\uff09\u3002<\/p>\n<p>Ruby \u5074\u306e\u30b5\u30f3\u30d7\u30eb\u30d7\u30ed\u30b0\u30e9\u30e0\u3067\u306f\uff0c\u8fd4\u3063\u3066\u304d\u305f\u5f62\u614b\u7d20\u306e\u30ea\u30b9\u30c8\u304b\u3089\u983b\u5ea6\u8868\u3092\u4f5c\u308a\uff0c\u983b\u5ea6\u306e\u9ad8\u3044\u3082\u306e\u304b\u3089\u9806\u306b\u8868\u793a\u3059\u308b\u3002<\/p>\n<h1>Lindera \u306e\u7279\u5fb4<\/h1>\n<p>Lindera \u306e\u6982\u8981\u306f\u30ea\u30f3\u30af\u5148\u3092\u898b\u3066\u3044\u305f\u3060\u304f\u3068\u3057\u3066\uff0c\u3053\u3053\u3067\u306f\u4ee5\u4e0b\u306e\u70b9\u3060\u3051\u6307\u6458\u3057\u3066\u304a\u304d\u305f\u3044\u3002<\/p>\n<ul class=\"post-ul\">\n<li style=\"list-style-type: none;\">\n<ul class=\"post-ul\">IPADIC \u304c\u6700\u521d\u304b\u3089\u5165\u3063\u3066\u3044\u308b<\/ul>\n<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<ul class=\"post-ul\">\n<li style=\"list-style-type: none;\">\n<ul class=\"post-ul\">IPADIC-NEologd \u306a\u3069\u4ed6\u306e\u8f9e\u66f8\u3082\u5bb9\u6613\u306b\u5229\u7528\u3067\u304d\u308b<\/ul>\n<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<ul class=\"post-ul\">\u30e6\u30fc\u30b6\u30fc\u5358\u8a9e\u306e\u8ffd\u52a0\u304c\u5bb9\u6613<\/ul>\n<p>\u8f9e\u66f8\u304c\u6700\u521d\u304b\u3089\u5165\u3063\u3066\u3044\u308b\u3068\u3044\u3046\u306e\u306f\u3042\u308a\u304c\u305f\u3044\u3002\u3061\u3087\u3063\u3068\u8a66\u3057\u3066\u307f\u308b\u3060\u3051\u306a\u306e\u306b\uff0c\u307e\u305a\u8f9e\u66f8\u3092\u3069\u3053\u305d\u3053\u304b\u3089\u30c0\u30a6\u30f3\u30ed\u30fc\u30c9\u3057\uff0c\u306a\u3093\u3061\u3083\u3089\u30b3\u30de\u30f3\u30c9\u3092\u6253\u3063\u3066\uff0c\u305d\u306e\u30d5\u30a1\u30a4\u30eb\u3092\u3069\u3053\u305d\u3053\u306b\u914d\u7f6e\u3057\uff0c\u3068\u3044\u3046\u306e\u306f\u3084\u3084\u3064\u3089\u3044\u3002<\/p>\n<p>\u307e\u305f\uff0cSNS \u306a\u3069\u3055\u307e\u3056\u307e\u306a\u30e1\u30c7\u30a3\u30a2\u3092\u98db\u3073\u4ea4\u3046\u6587\u3092\u6271\u3046\u306e\u306b IPADIC \u3067\u306f\u8a9e\u6570\u304c\u5727\u5012\u7684\u306b\u8db3\u308a\u306a\u3044\u304c\uff0cIPADIC-NEologd \u306e\u3088\u3046\u306a\u5927\u304d\u306a\u8f9e\u66f8\u304c\u5bb9\u6613\u306b\u4f7f\u3048\u308b\u306e\u3082\u3042\u308a\u304c\u305f\u3044\u3002<\/p>\n<p>\u30e6\u30fc\u30b6\u30fc\u5358\u8a9e\u306e\u8ffd\u52a0\u306f CSV \u30d5\u30a1\u30a4\u30eb\u3092\u7f6e\u3044\u3066\u305d\u306e\u30d1\u30b9\u3092\u6307\u5b9a\u3059\u308b\u3060\u3051\uff0c\u3068\u3044\u3046\u5bb9\u6613\u3055\u3002<\/p>\n<h1>\u52d5\u6a5f<\/h1>\n<p>\u3053\u306e\u8a18\u4e8b\u3067\u306f\uff0c\u4ed6\u306e\u65b9\u304c\u53c2\u8003\u306b\u3057\u3084\u3059\u3044\u3088\u3046\uff0c\u300c\u5b9f\u7528\u6027\u306f\u4f4e\u3044\u304c\uff0c\u5b9f\u7528\u7684\u306a\u30b3\u30fc\u30c9\u3078\u306e\u9053\u7b4b\u304c\u60f3\u50cf\u3067\u304d\u308b\u7a0b\u5ea6\u306b\u5358\u7d14\u306a\u30b3\u30fc\u30c9\u300d\u3092\u63d0\u793a\u3057\u305f\u3044\u3002<\/p>\n<p>Ruby \u3067\u306f\uff0c\u5f62\u614b\u7d20\u89e3\u6790\u5668 MeCab\uff0cJUMAN++ \u3092\u4f7f\u3046\u305f\u3081\u306e gem \u3068\u3057\u3066 natto\uff0cjumanpp_ruby \u3068\u3044\u3063\u305f\u3082\u306e\u304c\u305d\u308c\u305e\u308c\u3042\u308b1\u3002<\/p>\n<p>\u305d\u308c\u306a\u306e\u306b\u306a\u305c Ruby \u304b\u3089 Rust \u3092\u547c\u3076\u3088\u3046\u306a\u30b3\u30fc\u30c9\u3092\u308f\u3056\u308f\u3056\u66f8\u304f\u306e\u304b\uff1f<br \/>\n\u305d\u308c\u306b\u306f\uff0cGC \u3092\u907f\u3051\u305f\u3044 \u3067\u66f8\u3044\u305f\u3088\u3046\u306a\u4eee\u8aac\u304c\u80cc\u666f\u306b\u3042\u308b\u3002<\/p>\n<p>MeCab \u306a\u3069\u3092 Ruby \u304b\u3089\u5229\u7528\u3059\u308b\u5834\u5408\uff0c\u5f62\u614b\u7d20\u3054\u3068\u306b Ruby \u5074\u306b\u5927\u91cf\u306e\u6587\u5b57\u5217\u30c7\u30fc\u30bf\u304c\u6301\u3061\u8fbc\u307e\u308c\u308b\u3002\u305d\u306e\u5927\u534a\u306f\u30ac\u30fc\u30d9\u30b8\uff08\u30b4\u30df\uff09\u3068\u306a\u3063\u3066\uff0c\u3042\u308b\u7a0b\u5ea6\u305f\u307e\u308b\u3068\u30ac\u30fc\u30d9\u30b8\u30b3\u30ec\u30af\u30b7\u30e7\u30f3\u306e\u5bfe\u8c61\u306b\u306a\u308b\u3002\u3069\u3046\u3082\u52b9\u7387\u304c\u60aa\u3044\u306e\u3067\u306f\u306a\u3044\u304b2\u3002<\/p>\n<p>\u540d\u8a5e\u62bd\u51fa\u306e\u3088\u3046\u306a\u8ab2\u984c\u3067\u306f\uff0cRust \u5074\u3067\u540d\u8a5e\u3060\u3051\u3092\u629c\u304d\u51fa\u3057\uff0cRuby \u5074\u304c\u6b32\u3059\u308b\u6587\u5b57\u5217\u3060\u3051\u3092\u8fd4\u3057\u3066\u3084\u308c\u3070\u52b9\u7387\u304c\u826f\u3044\u306e\u3067\u306f\u306a\u3044\u304b\u3002<br \/>\nRust \u5074\u3067\u306f\u30ac\u30fc\u30d9\u30b8\u30b3\u30ec\u30af\u30b7\u30e7\u30f3\u306f\u8d77\u3053\u3089\u306a\u3044\u3002\u30b9\u30b3\u30fc\u30d7\u3092\u5916\u308c\u305f\u5909\u6570\u306f\u305d\u306e\u77ac\u9593\u306b\u6d88\u3048\u308b\u306e\u3060\u3002<\/p>\n<h1>\u5b9f\u88c5\uff1aRust \u5074<\/h1>\n<h2>Cargo.toml \u7de8\u96c6\u307e\u3067<\/h2>\n<p>\u307e\u305a<\/p>\n<pre class=\"post-pre\"><code>cargo new phoneme_extractor <span class=\"nt\">--lib<\/span>\r\n<\/code><\/pre>\n<p>\u3068\u3059\u308b\u3002<br \/>\nphoneme \u3068\u3044\u3046\u306e\u306f\u5f62\u614b\u7d20\u3068\u3044\u3046\u610f\u5473\u3002<br \/>\n\u5f62\u614b\u7d20\u62bd\u51fa\u5668\u3068\u3044\u3046\u65e5\u672c\u8a9e\u304c\u59a5\u5f53\u304b\u3069\u3046\u304b\u3057\u3089\u306a\u3044\u3057\uff0c\u305d\u306e\u82f1\u8a9e\u304c\u679c\u305f\u3057\u3066 phoneme extractor \u3067\u3088\u3044\u306e\u304b\u3069\u3046\u304b\uff0c\u79c1\u306f\u77e5\u3089\u3093\u3002<\/p>\n<p>\u3067\u3082\u3063\u3066\uff0cCargo.toml \u306b<\/p>\n<pre class=\"post-pre\"><code><span class=\"nn\">[dependencies]<\/span>\r\n<span class=\"py\">lindera<\/span> <span class=\"p\">=<\/span> <span class=\"s\">\"0.5.1\"<\/span>\r\n<span class=\"py\">lazy_static<\/span> <span class=\"p\">=<\/span> <span class=\"s\">\"1.4.0\"<\/span>\r\n<span class=\"py\">rutie<\/span> <span class=\"p\">=<\/span> <span class=\"s\">\"0.8.1\"<\/span>\r\n<span class=\"py\">serde<\/span> <span class=\"p\">=<\/span> <span class=\"s\">\"1.0.115\"<\/span>\r\n<span class=\"py\">serde_json<\/span> <span class=\"p\">=<\/span> <span class=\"s\">\"1.0.57\"<\/span>\r\n\r\n<span class=\"nn\">[lib]<\/span>\r\n<span class=\"py\">crate-type<\/span> <span class=\"p\">=<\/span> <span class=\"nn\">[\"cdylib\"]<\/span>\r\n<\/code><\/pre>\n<p>\u3068\u66f8\u304f\u3002<\/p>\n<p>\uff08\u8ffd\u8a18 2020-10-01\uff09Rutie \u306e\u30d0\u30fc\u30b8\u30e7\u30f3\u3092 &#8220;0.7.0&#8221; \u3068\u3057\u3066\u3044\u305f\u304c\uff0c\u73fe\u6642\u70b9\u306e\u6700\u65b0\u7248 &#8220;0.8.1&#8221; \u306b\u5909\u66f4\u3057\u305f\u3002\u3053\u308c\u306b\u3088\u308a\uff0cRust 1.46 \u3067\u51fa\u3066\u3044\u305f\u8b66\u544a\u304c\u51fa\u306a\u304f\u306a\u308b\u3002\u306a\u304a\uff0c\u300c0.7.0 \u3060\u3068\u30b3\u30f3\u30d1\u30a4\u30eb\u3067\u304d\u305f\u304c 0.8.1 \u3060\u3068\u30b3\u30f3\u30d1\u30a4\u30eb\u3067\u304d\u306a\u304b\u3063\u305f\u300d\u3068\u3044\u3046\u65b9\u304c\u3044\u305f\u3089\u6559\u3048\u3066\u304f\u3060\u3055\u3044\u3002<\/p>\n<p>lindera \u306f\u4eca\u56de\u306e\u8ab2\u984c\u306e\u8981\u3068\u306a\u308b\u5f62\u614b\u7d20\u89e3\u6790\u306e\u30af\u30ec\u30fc\u30c8\u3002<br \/>\nrutie \u306f Ruby \u3068 Rust \u3092\u7e4b\u3050\u30af\u30ec\u30fc\u30c8\u3002<br \/>\nlazy_static \u306f\uff0cRutie \u3067\u30af\u30e9\u30b9\u3092\u4f5c\u308b\u969b\u306b\u5fc5\u8981\u306a\u30af\u30ec\u30fc\u30c8\u3002<\/p>\n<p>\u3069\u3093\u306a\u54c1\u8a5e\u3092\u62bd\u51fa\u3059\u308b\u304b\uff0c\u3068\u3044\u3063\u305f\u60c5\u5831\u3092 Ruby \u304b\u3089 Rust \u306b\u4f1d\u3048\u308b\u3046\u307e\u3044\u65b9\u6cd5\u304c\u79c1\u306b\u306f\u5206\u304b\u3089\u306a\u304b\u3063\u305f\u306e\u3067\uff0cJSON \u5f62\u5f0f\u306e\u6587\u5b57\u5217\u3067\u4f1d\u3048\u308b\u3053\u3068\u306b\u3057\u305f\u3002<br \/>\n\u305d\u306e\u305f\u3081\u306b\uff0cserde \u3068 serde_json \u3092\u4f7f\u3046\u3002<\/p>\n<h2>\u30b3\u30fc\u30c9<\/h2>\n<p>Rust \u5074\u306e\u30b3\u30fc\u30c9\u306e\u5168\u4f53\u304c\u3053\u308c\u3002<\/p>\n<pre class=\"post-pre\"><code><span class=\"nd\">#[macro_use]<\/span>\r\n<span class=\"k\">extern<\/span> <span class=\"n\">crate<\/span> <span class=\"n\">rutie<\/span><span class=\"p\">;<\/span>\r\n\r\n<span class=\"nd\">#[macro_use]<\/span>\r\n<span class=\"k\">extern<\/span> <span class=\"n\">crate<\/span> <span class=\"n\">lazy_static<\/span><span class=\"p\">;<\/span>\r\n\r\n<span class=\"k\">use<\/span> <span class=\"nn\">serde<\/span><span class=\"p\">::{<\/span><span class=\"n\">Deserialize<\/span><span class=\"p\">};<\/span>\r\n\r\n<span class=\"k\">use<\/span> <span class=\"nn\">rutie<\/span><span class=\"p\">::{<\/span><span class=\"n\">Object<\/span><span class=\"p\">,<\/span> <span class=\"n\">Class<\/span><span class=\"p\">,<\/span> <span class=\"n\">RString<\/span><span class=\"p\">,<\/span> <span class=\"n\">Array<\/span><span class=\"p\">};<\/span>\r\n\r\n<span class=\"k\">use<\/span> <span class=\"nn\">lindera<\/span><span class=\"p\">::<\/span><span class=\"nn\">tokenizer<\/span><span class=\"p\">::<\/span><span class=\"n\">Tokenizer<\/span><span class=\"p\">;<\/span>\r\n\r\n<span class=\"nd\">#[derive(Deserialize)]<\/span>\r\n<span class=\"k\">pub<\/span> <span class=\"k\">struct<\/span> <span class=\"n\">RustPhonemeExtractor<\/span> <span class=\"p\">{<\/span>\r\n    <span class=\"n\">mode<\/span><span class=\"p\">:<\/span> <span class=\"nb\">String<\/span><span class=\"p\">,<\/span>\r\n    <span class=\"n\">allowed_poss<\/span><span class=\"p\">:<\/span> <span class=\"nb\">Vec<\/span><span class=\"o\">&lt;<\/span><span class=\"nb\">String<\/span><span class=\"o\">&gt;<\/span><span class=\"p\">,<\/span>\r\n<span class=\"p\">}<\/span>\r\n\r\n<span class=\"nd\">wrappable_struct!<\/span><span class=\"p\">(<\/span><span class=\"n\">RustPhonemeExtractor<\/span><span class=\"p\">,<\/span> <span class=\"n\">PhonemeExtractorWrapper<\/span><span class=\"p\">,<\/span> <span class=\"n\">PHONEME_EXTRACTOR_WRAPPER<\/span><span class=\"p\">);<\/span>\r\n\r\n<span class=\"nd\">class!<\/span><span class=\"p\">(<\/span><span class=\"n\">PhonemeExtractor<\/span><span class=\"p\">);<\/span>\r\n\r\n<span class=\"nd\">methods!<\/span><span class=\"p\">(<\/span>\r\n    <span class=\"n\">PhonemeExtractor<\/span><span class=\"p\">,<\/span>\r\n    <span class=\"n\">rtself<\/span><span class=\"p\">,<\/span>\r\n\r\n    <span class=\"k\">fn<\/span> <span class=\"nf\">phoneme_extractor_new<\/span><span class=\"p\">(<\/span><span class=\"n\">params<\/span><span class=\"p\">:<\/span> <span class=\"n\">RString<\/span><span class=\"p\">)<\/span> <span class=\"k\">-&gt;<\/span> <span class=\"n\">PhonemeExtractor<\/span> <span class=\"p\">{<\/span>\r\n        <span class=\"k\">let<\/span> <span class=\"n\">params<\/span> <span class=\"o\">=<\/span> <span class=\"n\">params<\/span><span class=\"nf\">.unwrap<\/span><span class=\"p\">()<\/span><span class=\"nf\">.to_string<\/span><span class=\"p\">();<\/span>\r\n        <span class=\"k\">let<\/span> <span class=\"n\">rpe<\/span><span class=\"p\">:<\/span> <span class=\"n\">RustPhonemeExtractor<\/span> <span class=\"o\">=<\/span> <span class=\"nn\">serde_json<\/span><span class=\"p\">::<\/span><span class=\"nf\">from_str<\/span><span class=\"p\">(<\/span><span class=\"o\">&amp;<\/span><span class=\"n\">params<\/span><span class=\"p\">)<\/span><span class=\"nf\">.unwrap<\/span><span class=\"p\">();<\/span>\r\n\r\n        <span class=\"nn\">Class<\/span><span class=\"p\">::<\/span><span class=\"nf\">from_existing<\/span><span class=\"p\">(<\/span><span class=\"s\">\"PhonemeExtractor\"<\/span><span class=\"p\">)<\/span><span class=\"nf\">.wrap_data<\/span><span class=\"p\">(<\/span><span class=\"n\">rpe<\/span><span class=\"p\">,<\/span> <span class=\"o\">&amp;*<\/span><span class=\"n\">PHONEME_EXTRACTOR_WRAPPER<\/span><span class=\"p\">)<\/span>\r\n    <span class=\"p\">}<\/span>\r\n\r\n    <span class=\"k\">fn<\/span> <span class=\"nf\">extract<\/span><span class=\"p\">(<\/span><span class=\"n\">input<\/span><span class=\"p\">:<\/span> <span class=\"n\">RString<\/span><span class=\"p\">)<\/span> <span class=\"k\">-&gt;<\/span> <span class=\"n\">Array<\/span> <span class=\"p\">{<\/span>\r\n        <span class=\"k\">let<\/span> <span class=\"n\">extractor<\/span> <span class=\"o\">=<\/span> <span class=\"n\">rtself<\/span><span class=\"nf\">.get_data<\/span><span class=\"p\">(<\/span><span class=\"o\">&amp;*<\/span><span class=\"n\">PHONEME_EXTRACTOR_WRAPPER<\/span><span class=\"p\">);<\/span>\r\n        <span class=\"k\">let<\/span> <span class=\"n\">input<\/span> <span class=\"o\">=<\/span> <span class=\"n\">input<\/span><span class=\"nf\">.unwrap<\/span><span class=\"p\">();<\/span>\r\n        <span class=\"k\">let<\/span> <span class=\"k\">mut<\/span> <span class=\"n\">tokenizer<\/span> <span class=\"o\">=<\/span> <span class=\"nn\">Tokenizer<\/span><span class=\"p\">::<\/span><span class=\"nf\">new<\/span><span class=\"p\">(<\/span><span class=\"o\">&amp;<\/span><span class=\"n\">extractor<\/span><span class=\"py\">.mode<\/span><span class=\"p\">,<\/span> <span class=\"s\">\"\"<\/span><span class=\"p\">);<\/span>\r\n        <span class=\"k\">let<\/span> <span class=\"n\">tokens<\/span> <span class=\"o\">=<\/span> <span class=\"n\">tokenizer<\/span><span class=\"nf\">.tokenize<\/span><span class=\"p\">(<\/span><span class=\"n\">input<\/span><span class=\"nf\">.to_str<\/span><span class=\"p\">());<\/span>\r\n\r\n        <span class=\"k\">let<\/span> <span class=\"k\">mut<\/span> <span class=\"n\">result<\/span> <span class=\"o\">=<\/span> <span class=\"nn\">Array<\/span><span class=\"p\">::<\/span><span class=\"nf\">new<\/span><span class=\"p\">();<\/span>\r\n        <span class=\"k\">for<\/span> <span class=\"n\">token<\/span> <span class=\"n\">in<\/span> <span class=\"n\">tokens<\/span> <span class=\"p\">{<\/span>\r\n            <span class=\"k\">let<\/span> <span class=\"n\">detail<\/span> <span class=\"o\">=<\/span> <span class=\"n\">token<\/span><span class=\"py\">.detail<\/span><span class=\"p\">;<\/span>\r\n            <span class=\"k\">let<\/span> <span class=\"n\">pos<\/span><span class=\"p\">:<\/span> <span class=\"nb\">String<\/span> <span class=\"o\">=<\/span> <span class=\"n\">detail<\/span><span class=\"nf\">.join<\/span><span class=\"p\">(<\/span><span class=\"s\">\",\"<\/span><span class=\"p\">);<\/span>\r\n            <span class=\"k\">if<\/span> <span class=\"n\">extractor<\/span><span class=\"py\">.allowed_poss<\/span><span class=\"nf\">.iter<\/span><span class=\"p\">()<\/span><span class=\"nf\">.any<\/span><span class=\"p\">(|<\/span><span class=\"n\">s<\/span><span class=\"p\">|<\/span> <span class=\"n\">pos<\/span><span class=\"nf\">.starts_with<\/span><span class=\"p\">(<\/span><span class=\"n\">s<\/span><span class=\"p\">))<\/span> <span class=\"p\">{<\/span>\r\n                <span class=\"n\">result<\/span><span class=\"nf\">.push<\/span><span class=\"p\">(<\/span><span class=\"nn\">RString<\/span><span class=\"p\">::<\/span><span class=\"nf\">new_utf8<\/span><span class=\"p\">(<\/span><span class=\"o\">&amp;<\/span><span class=\"n\">token<\/span><span class=\"py\">.text<\/span><span class=\"p\">));<\/span>\r\n            <span class=\"p\">}<\/span>\r\n        <span class=\"p\">}<\/span>\r\n\r\n        <span class=\"n\">result<\/span>\r\n    <span class=\"p\">}<\/span>\r\n<span class=\"p\">);<\/span>\r\n\r\n<span class=\"nd\">#[allow(non_snake_case)]<\/span>\r\n<span class=\"nd\">#[no_mangle]<\/span>\r\n<span class=\"k\">pub<\/span> <span class=\"k\">extern<\/span> <span class=\"s\">\"C\"<\/span> <span class=\"k\">fn<\/span> <span class=\"nf\">Init_phoneme_extractor<\/span><span class=\"p\">()<\/span> <span class=\"p\">{<\/span>\r\n    <span class=\"nn\">Class<\/span><span class=\"p\">::<\/span><span class=\"nf\">new<\/span><span class=\"p\">(<\/span><span class=\"s\">\"PhonemeExtractor\"<\/span><span class=\"p\">,<\/span> <span class=\"nb\">None<\/span><span class=\"p\">)<\/span><span class=\"nf\">.define<\/span><span class=\"p\">(|<\/span><span class=\"n\">klass<\/span><span class=\"p\">|<\/span> <span class=\"p\">{<\/span>\r\n        <span class=\"n\">klass<\/span><span class=\"nf\">.def_self<\/span><span class=\"p\">(<\/span><span class=\"s\">\"new\"<\/span><span class=\"p\">,<\/span> <span class=\"n\">phoneme_extractor_new<\/span><span class=\"p\">);<\/span>\r\n        <span class=\"n\">klass<\/span><span class=\"nf\">.def<\/span><span class=\"p\">(<\/span><span class=\"s\">\"extract\"<\/span><span class=\"p\">,<\/span> <span class=\"n\">extract<\/span><span class=\"p\">);<\/span>\r\n    <span class=\"p\">});<\/span>\r\n<span class=\"p\">}<\/span>\r\n<\/code><\/pre>\n<p>\u4ee5\u4e0b\uff0c\u5c11\u3005\u89e3\u8aac\u3092\u52a0\u3048\u3066\u3044\u304f\u3002<\/p>\n<h2>RustPhoneneExtractor<\/h2>\n<p>Rutie \u3092\u4f7f\u3063\u3066\uff0cRuby \u306e PhonemeExtractor \u3068\u3044\u3046\u30af\u30e9\u30b9\u3092\u4f5c\u308b\u3002<br \/>\n\u307e\u305a RustPhonemeExtractor \u3068\u3044\u3046\u69cb\u9020\u4f53\u3092\u4f5c\u308a\uff0c\u305d\u308c\u3092 wrap \u3057\u3066 PhonemeExtractor \u3092\u4f5c\u308b\u3053\u3068\u306b\u3059\u308b\u3002<\/p>\n<p>RustPhonemeExtractor \u306e\u5b9a\u7fa9\u304c\u3053\u308c\u3002<\/p>\n<pre class=\"post-pre\"><code><span class=\"nd\">#[derive(Deserialize)]<\/span>\r\n<span class=\"k\">pub<\/span> <span class=\"k\">struct<\/span> <span class=\"n\">RustPhonemeExtractor<\/span> <span class=\"p\">{<\/span>\r\n    <span class=\"n\">mode<\/span><span class=\"p\">:<\/span> <span class=\"nb\">String<\/span><span class=\"p\">,<\/span>\r\n    <span class=\"n\">allowed_poss<\/span><span class=\"p\">:<\/span> <span class=\"nb\">Vec<\/span><span class=\"o\">&lt;<\/span><span class=\"nb\">String<\/span><span class=\"o\">&gt;<\/span><span class=\"p\">,<\/span>\r\n<span class=\"p\">}<\/span>\r\n<\/code><\/pre>\n<p>\u3042\uff0c\u8a00\u3063\u3066\u306a\u304b\u3063\u305f\u3051\u3069\uff0cLindera \u306b\u306f normal \u3068 decompose \u3068\u3044\u3046\u4e8c\u3064\u306e\u300c\u30e2\u30fc\u30c9\u300d\u304c\u3042\u308b\u3002\u5927\u96d1\u628a\u306b\u3044\u3046\u3068\uff0cdecompose \u306f\u8907\u5408\u8a9e\u3092\u5206\u89e3\u3059\u308b\u30e2\u30fc\u30c9\u3002\u3064\u307e\u308a\uff0cnormal \u3088\u308a decompose \u306e\u307b\u3046\u304c\u3088\u308a\u7d30\u304b\u304f\u306a\u308b\u3002<br \/>\n\u3053\u308c\u3092 mode \u3067\u6307\u5b9a\u3067\u304d\u308b\u3088\u3046\u306b\u3059\u308b\u3002<br \/>\n\u4e00\u65b9\uff0callowed_poss \u306f\uff0c\u62fe\u3046\u3079\u304d\u54c1\u8a5e\u306e\u30ea\u30b9\u30c8\u3092\u30d9\u30af\u30bf\u30fc\u306e\u5f62\u3067\u6301\u3064\u3002<br \/>\nposs \u3068\u3044\u3046\u306e\u306f\u305a\u3044\u3076\u3093\u9069\u5f53\u306a\u30cd\u30fc\u30df\u30f3\u30b0\u306a\u306e\u3060\u304c\uff0c\u300c\u54c1\u8a5e\u300d\u306e\u82f1\u8a9e\u304c\u300cpart of speech\u300d\u306a\u306e\u3067\uff0c\u7565\u3057\u3066 pos\u3002\u305d\u308c\u3092\u8907\u6570\u5f62\uff08\uff1f\uff09\u3067 poss \u3068\u3057\u305f\uff08poses \u3060\u3068 pose \u306e\u4e09\u4eba\u79f0\u5358\u6570\u73fe\u5728\u5f62\u3068\u7d1b\u3089\u308f\u3057\u3044\u3057\uff09\u3002<\/p>\n<h2>PhonenemeExtractor<\/h2>\n<p>\u6b21\u306b\uff0cRuby \u306e\u30af\u30e9\u30b9 PhonenemeExtractor \u3092\u4f5c\u308b\u3002<\/p>\n<p>RustPhonemeExtractor \u3092 wrap \u3057\u3066 PhonemeExtractor \u3092\u4f5c\u308b\u305f\u3081\uff0c<\/p>\n<pre class=\"post-pre\"><code><span class=\"nd\">wrappable_struct!<\/span><span class=\"p\">(<\/span><span class=\"n\">RustPhonemeExtractor<\/span><span class=\"p\">,<\/span> <span class=\"n\">PhonemeExtractorWrapper<\/span><span class=\"p\">,<\/span> <span class=\"n\">PHONEME_EXTRACTOR_WRAPPER<\/span><span class=\"p\">);<\/span>\r\n<\/code><\/pre>\n<p>\u3068\u66f8\u304f\u3002<br \/>\n\u8aac\u660e\u306f\u524d\u56de\u306e<br \/>\nRuby\/Rust \u9023\u643a (5)\u3000Rutie \u3067\u6570\u5024\u8a08\u7b97\u2461\u3000\u30d9\u30b8\u30a8 &#8211; Qiita<br \/>\n\u3092\u898b\u3066\u307b\u3057\u3044\u3002<\/p>\n<p>\u305d\u3057\u3066\u30af\u30e9\u30b9\u3092\u4f5c\u308b\u306e\u306b<\/p>\n<pre class=\"post-pre\"><code><span class=\"nd\">class!<\/span><span class=\"p\">(<\/span><span class=\"n\">PhonemeExtractor<\/span><span class=\"p\">);<\/span>\r\n<\/code><\/pre>\n<p>\u3068\u66f8\u304f\u3002<\/p>\n<h2>PhonenemeExtractor \u306e\u30e1\u30bd\u30c3\u30c9<\/h2>\n<p>\u3064\u304e\u306b\uff0cPhonenemeExtractor \u306e\u30e1\u30bd\u30c3\u30c9\u3092 methods! \u30de\u30af\u30ed\u3067\u66f8\u304f\u3002<br \/>\n\u4ee5\u4e0b\u306e\u4e8c\u3064\u306e\u30e1\u30bd\u30c3\u30c9\u3092\u8a18\u8ff0\u3057\u305f\u3002<\/p>\n<p>phoneme_extractor_new\uff08\u30a4\u30f3\u30b9\u30bf\u30f3\u30b9\u3092\u4f5c\u308b\uff09<\/p>\n<p>extract\uff08\u5f62\u614b\u7d20\u3092\u62bd\u51fa\u3059\u308b\uff09<\/p>\n<h3>phoneme_extractor_new \u30e1\u30bd\u30c3\u30c9<\/h3>\n<p>\u5b9a\u7fa9\u306f\u3053\u308c\u3002<\/p>\n<pre class=\"post-pre\"><code><span class=\"k\">fn<\/span> <span class=\"nf\">phoneme_extractor_new<\/span><span class=\"p\">(<\/span><span class=\"n\">params<\/span><span class=\"p\">:<\/span> <span class=\"n\">RString<\/span><span class=\"p\">)<\/span> <span class=\"k\">-&gt;<\/span> <span class=\"n\">PhonemeExtractor<\/span> <span class=\"p\">{<\/span>\r\n    <span class=\"k\">let<\/span> <span class=\"n\">params<\/span> <span class=\"o\">=<\/span> <span class=\"n\">params<\/span><span class=\"nf\">.unwrap<\/span><span class=\"p\">()<\/span><span class=\"nf\">.to_string<\/span><span class=\"p\">();<\/span>\r\n    <span class=\"k\">let<\/span> <span class=\"n\">rpe<\/span><span class=\"p\">:<\/span> <span class=\"n\">RustPhonemeExtractor<\/span> <span class=\"o\">=<\/span> <span class=\"nn\">serde_json<\/span><span class=\"p\">::<\/span><span class=\"nf\">from_str<\/span><span class=\"p\">(<\/span><span class=\"o\">&amp;<\/span><span class=\"n\">params<\/span><span class=\"p\">)<\/span><span class=\"nf\">.unwrap<\/span><span class=\"p\">();<\/span>\r\n\r\n    <span class=\"nn\">Class<\/span><span class=\"p\">::<\/span><span class=\"nf\">from_existing<\/span><span class=\"p\">(<\/span><span class=\"s\">\"PhonemeExtractor\"<\/span><span class=\"p\">)<\/span><span class=\"nf\">.wrap_data<\/span><span class=\"p\">(<\/span><span class=\"n\">rpe<\/span><span class=\"p\">,<\/span> <span class=\"o\">&amp;*<\/span><span class=\"n\">PHONEME_EXTRACTOR_WRAPPER<\/span><span class=\"p\">)<\/span>\r\n<span class=\"p\">}<\/span>\r\n<\/code><\/pre>\n<p>RString \u306f Ruby \u306e String \u30af\u30e9\u30b9\u306b\u5bfe\u5fdc\u3059\u308b Rust \u306e\u578b\uff08Rutie \u3067\u5b9a\u7fa9\u3055\u308c\u3066\u3044\u308b\uff09\u3002<br \/>\nparams \u306f\uff0c\u521d\u671f\u5316\u306b Lindera \u306e\u30e2\u30fc\u30c9\u3084\uff0c\u62fe\u3044\u4e0a\u3052\u308b\u54c1\u8a5e\u30ea\u30b9\u30c8\u3092 JSON \u5f62\u5f0f\u3067\u8868\u3057\u305f\u6587\u5b57\u5217\u3002<\/p>\n<p>\u3067\uff0c\u3053\u3053\u304c\u9762\u767d\u3044\u3068\u3053\u308d\u306a\u306e\u3060\u304c\uff0cparams \u306b\u5165\u3063\u3066\u3044\u308b JSON \u6587\u5b57\u5217\u3092\u5143\u306b\u3057\u3066 RustPhonemeExtractor \u69cb\u9020\u4f53\u306e\u5024\u3092\u4f5c\u308b\uff0c\u3068\u3044\u3046\u51e6\u7406\u304c<\/p>\n<pre class=\"post-pre\"><code><span class=\"nn\">serde_json<\/span><span class=\"p\">::<\/span><span class=\"nf\">from_str<\/span><span class=\"p\">(<\/span><span class=\"o\">&amp;<\/span><span class=\"n\">params<\/span><span class=\"p\">)<\/span><span class=\"nf\">.unwrap<\/span><span class=\"p\">()<\/span>\r\n<\/code><\/pre>\n<p>\u3060\u3051\u3067\u3067\u304d\u3061\u3083\u3063\u3066\u3044\u308b\u3002<\/p>\n<p>\u3053\u308c\u304c Serde \u3068\u3044\u3046\u30af\u30ec\u30fc\u30c8\u306e\u30b9\u30b4\u30a4\u3068\u3053\u308d\uff08\u77e5\u3089\u3093\u3051\u3069\uff09\u3002<br \/>\n\u69cb\u9020\u4f53\u306e\u5b9a\u7fa9\u306b\u5408\u308f\u305b\u3066 JSON \u3092\u89e3\u91c8\u3057\u3066\u304f\u308c\u308b\u3002\u69cb\u9020\u4f53\u306e\u5b9a\u7fa9\u306b\u5408\u308f\u306a\u3044 JSON \u6587\u5b57\u5217\u304c\u4e0e\u3048\u3089\u308c\u305f\u3068\u304d\u306f unwrap() \u306e\u969b\u306b\u30d7\u30ed\u30b0\u30e9\u30e0\u304c\u843d\u3061\u308b\u3002\u5b9f\u7528\u7684\u306a\u30e9\u30a4\u30d6\u30e9\u30ea\u30fc\u3092\u4f5c\u308b\u5834\u5408\u306f\uff0c\u3061\u3083\u3093\u3068\u30a8\u30e9\u30fc\u306e\u51e6\u7406\u3092\u3084\u3063\u305f\u307b\u3046\u304c\u3044\u3044\u306d\u3002<\/p>\n<p>\u3061\u306a\u307f\u306b\uff0c\u3053\u3093\u306a JSON \u6587\u5b57\u5217\u304c\u4e0e\u3048\u3089\u308c\u308b\u3053\u3068\u3092\u671f\u5f85\u3057\u3066\u3044\u308b\u3002<\/p>\n<pre class=\"post-pre\"><code><span class=\"p\">{<\/span>\r\n  <span class=\"nl\">\"mode\"<\/span><span class=\"p\">:<\/span> <span class=\"s2\">\"normal\"<\/span><span class=\"p\">,<\/span>\r\n  <span class=\"nl\">\"allowed_poss\"<\/span><span class=\"p\">:<\/span> <span class=\"p\">[<\/span>\r\n    <span class=\"s2\">\"\u540d\u8a5e,\u4e00\u822c\"<\/span><span class=\"p\">,<\/span>\r\n    <span class=\"s2\">\"\u540d\u8a5e,\u56fa\u6709\u540d\u8a5e\"<\/span><span class=\"p\">,<\/span>\r\n    <span class=\"s2\">\"\u540d\u8a5e,\u526f\u8a5e\u53ef\u80fd\"<\/span><span class=\"p\">,<\/span>\r\n    <span class=\"s2\">\"\u540d\u8a5e,\u30b5\u5909\u63a5\u7d9a\"<\/span><span class=\"p\">,<\/span>\r\n    <span class=\"s2\">\"\u540d\u8a5e,\u5f62\u5bb9\u52d5\u8a5e\u8a9e\u5e79\"<\/span><span class=\"p\">,<\/span>\r\n    <span class=\"s2\">\"\u540d\u8a5e,\u30ca\u30a4\u5f62\u5bb9\u8a5e\u8a9e\u5e79\"<\/span>\r\n  <span class=\"p\">]<\/span>\r\n<span class=\"p\">}<\/span>\r\n<\/code><\/pre>\n<p>\u54c1\u8a5e\u306b\u3064\u3044\u3066\u306f\u5f8c\u307b\u3069\u5225\u306e\u7bc0\u3092\u8a2d\u3051\u3066\u8ff0\u3079\u308b\u3002<\/p>\n<h3>extract \u30e1\u30bd\u30c3\u30c9<\/h3>\n<p>\u3053\u3061\u3089\u306f PhonemeExtractor \u30af\u30e9\u30b9\u306e\u30a4\u30f3\u30b9\u30bf\u30f3\u30b9\u30e1\u30bd\u30c3\u30c9\u306b\u306a\u308b\u3002<\/p>\n<p>\u5b9a\u7fa9\u3092\u629c\u304d\u51fa\u3059\u3068\u3053\u3046\u306a\u3063\u3066\u3044\u308b\u3002<\/p>\n<pre class=\"post-pre\"><code><span class=\"k\">fn<\/span> <span class=\"nf\">extract<\/span><span class=\"p\">(<\/span><span class=\"n\">input<\/span><span class=\"p\">:<\/span> <span class=\"n\">RString<\/span><span class=\"p\">)<\/span> <span class=\"k\">-&gt;<\/span> <span class=\"n\">Array<\/span> <span class=\"p\">{<\/span>\r\n    <span class=\"k\">let<\/span> <span class=\"n\">extractor<\/span> <span class=\"o\">=<\/span> <span class=\"n\">rtself<\/span><span class=\"nf\">.get_data<\/span><span class=\"p\">(<\/span><span class=\"o\">&amp;*<\/span><span class=\"n\">PHONEME_EXTRACTOR_WRAPPER<\/span><span class=\"p\">);<\/span>\r\n    <span class=\"k\">let<\/span> <span class=\"n\">input<\/span> <span class=\"o\">=<\/span> <span class=\"n\">input<\/span><span class=\"nf\">.unwrap<\/span><span class=\"p\">();<\/span>\r\n    <span class=\"k\">let<\/span> <span class=\"k\">mut<\/span> <span class=\"n\">tokenizer<\/span> <span class=\"o\">=<\/span> <span class=\"nn\">Tokenizer<\/span><span class=\"p\">::<\/span><span class=\"nf\">new<\/span><span class=\"p\">(<\/span><span class=\"o\">&amp;<\/span><span class=\"n\">extractor<\/span><span class=\"py\">.mode<\/span><span class=\"p\">,<\/span> <span class=\"s\">\"\"<\/span><span class=\"p\">);<\/span>\r\n    <span class=\"k\">let<\/span> <span class=\"n\">tokens<\/span> <span class=\"o\">=<\/span> <span class=\"n\">tokenizer<\/span><span class=\"nf\">.tokenize<\/span><span class=\"p\">(<\/span><span class=\"n\">input<\/span><span class=\"nf\">.to_str<\/span><span class=\"p\">());<\/span>\r\n\r\n    <span class=\"k\">let<\/span> <span class=\"k\">mut<\/span> <span class=\"n\">result<\/span> <span class=\"o\">=<\/span> <span class=\"nn\">Array<\/span><span class=\"p\">::<\/span><span class=\"nf\">new<\/span><span class=\"p\">();<\/span>\r\n    <span class=\"k\">for<\/span> <span class=\"n\">token<\/span> <span class=\"n\">in<\/span> <span class=\"n\">tokens<\/span> <span class=\"p\">{<\/span>\r\n        <span class=\"k\">let<\/span> <span class=\"n\">detail<\/span> <span class=\"o\">=<\/span> <span class=\"n\">token<\/span><span class=\"py\">.detail<\/span><span class=\"p\">;<\/span>\r\n        <span class=\"k\">let<\/span> <span class=\"n\">pos<\/span><span class=\"p\">:<\/span> <span class=\"nb\">String<\/span> <span class=\"o\">=<\/span> <span class=\"n\">detail<\/span><span class=\"nf\">.join<\/span><span class=\"p\">(<\/span><span class=\"s\">\",\"<\/span><span class=\"p\">);<\/span>\r\n        <span class=\"k\">if<\/span> <span class=\"n\">extractor<\/span><span class=\"py\">.allowed_poss<\/span><span class=\"nf\">.iter<\/span><span class=\"p\">()<\/span><span class=\"nf\">.any<\/span><span class=\"p\">(|<\/span><span class=\"n\">s<\/span><span class=\"p\">|<\/span> <span class=\"n\">pos<\/span><span class=\"nf\">.starts_with<\/span><span class=\"p\">(<\/span><span class=\"n\">s<\/span><span class=\"p\">))<\/span> <span class=\"p\">{<\/span>\r\n            <span class=\"n\">result<\/span><span class=\"nf\">.push<\/span><span class=\"p\">(<\/span><span class=\"nn\">RString<\/span><span class=\"p\">::<\/span><span class=\"nf\">new_utf8<\/span><span class=\"p\">(<\/span><span class=\"o\">&amp;<\/span><span class=\"n\">token<\/span><span class=\"py\">.text<\/span><span class=\"p\">));<\/span>\r\n        <span class=\"p\">}<\/span>\r\n    <span class=\"p\">}<\/span>\r\n\r\n    <span class=\"n\">result<\/span>\r\n<span class=\"p\">}<\/span>\r\n<\/code><\/pre>\n<p>\u5165\u529b\u30c6\u30ad\u30b9\u30c8\u3092 RString\uff08Ruby \u306e String \u306b\u5bfe\u5fdc\u3059\u308b\u3082\u306e\uff09\u3067\u4e0e\u3048\u308b\u3068\uff0c\u5f62\u614b\u7d20\u306e\u30ea\u30b9\u30c8\u304c Array of String \u306e\u5f62\u3067\u8fd4\u308b\u3002<\/p>\n<p>rtself \u306f methods! \u30de\u30af\u30ed\u306e\u7b2c\u4e8c\u5f15\u6570\u306b\u4e0e\u3048\u305f\u3082\u306e\u3067\uff0cRuby \u306e\u30af\u30e9\u30b9 PhonemeExtractor \u306e\u30a4\u30f3\u30b9\u30bf\u30f3\u30b9\u306b\u5bfe\u5fdc\u3059\u308b\uff08\uff1f\uff09\u3088\u3046\u3060\u3002<br \/>\n\u5909\u6570 extractor \u306f RustPhonemeExtractor \u306e\u30a4\u30f3\u30b9\u30bf\u30f3\u30b9\u3002<\/p>\n<p>\u30e6\u30fc\u30b6\u30fc\u8f9e\u66f8\u306e\u8ffd\u52a0\u3092\u3057\u306a\u3044\u3068\u304d\u306f\uff0c\u30c8\u30fc\u30af\u30ca\u30a4\u30b6\u30fc\u3092 Tokenizer::new \u3067\u751f\u6210\u3059\u308b\u3002\u7b2c\u4e00\u5f15\u6570\u306f\u5148\u8ff0\u306e\u30e2\u30fc\u30c9\u306e\u6587\u5b57\u5217\u3067\uff0c\u7b2c\u4e8c\u5f15\u6570\u306f\u4f7f\u3046\u8f9e\u66f8\u306e\u30c7\u30a3\u30ec\u30af\u30c8\u30ea\u30fc\u30d1\u30b9\u3092\u4e0e\u3048\u308b\u3002\u7b2c\u4e8c\u5f15\u6570\u306b\u7a7a\u6587\u5b57\u5217\u3092\u4e0e\u3048\u308b\u3068\uff0c\u30c7\u30d5\u30a9\u30eb\u30c8\u3067\u3042\u308b IPADIC \u304c\u4f7f\u308f\u308c\u308b\u3002<\/p>\n<p>\u30e6\u30fc\u30b6\u30fc\u8f9e\u66f8\u3092\u4f7f\u3046\u3068\u304d\u306f Tokenizer::new_with_userdic \u3092\u4f7f\u3044\uff0c\u7b2c\u4e09\u5f15\u6570\u306b\u30e6\u30fc\u30b6\u30fc\u8f9e\u66f8\uff08CSV \u5f62\u5f0f\uff09\u306e\u30d1\u30b9\u3092\u4e0e\u3048\u308b\u3002<\/p>\n<p>\u30c8\u30fc\u30af\u30ca\u30a4\u30b6\u30fc\u306e tokenize \u30e1\u30bd\u30c3\u30c9\u306b\u30c6\u30ad\u30b9\u30c8\u3092\u4e0e\u3048\u308b\u3068\u30c8\u30fc\u30af\u30f3\u5217\u304c\u30d9\u30af\u30bf\u30fc\u3067\u8fd4\u308b\u3002\u4e00\u3064\u306e\u5f62\u614b\u7d20\u304c\u4e00\u3064\u306e\u30c8\u30fc\u30af\u30f3\u306b\u5bfe\u5fdc\u3059\u308b\u3002<\/p>\n<p>\u30c8\u30fc\u30af\u30f3\u306f<\/p>\n<pre class=\"post-pre\"><code><span class=\"nd\">#[derive(Serialize,<\/span> <span class=\"nd\">Clone)]<\/span>\r\n<span class=\"k\">pub<\/span> <span class=\"k\">struct<\/span> <span class=\"n\">Token<\/span><span class=\"o\">&lt;<\/span><span class=\"nv\">'a<\/span><span class=\"o\">&gt;<\/span> <span class=\"p\">{<\/span>\r\n    <span class=\"k\">pub<\/span> <span class=\"n\">text<\/span><span class=\"p\">:<\/span> <span class=\"o\">&amp;<\/span><span class=\"nv\">'a<\/span> <span class=\"nb\">str<\/span><span class=\"p\">,<\/span>\r\n    <span class=\"k\">pub<\/span> <span class=\"n\">detail<\/span><span class=\"p\">:<\/span> <span class=\"nb\">Vec<\/span><span class=\"o\">&lt;<\/span><span class=\"nb\">String<\/span><span class=\"o\">&gt;<\/span><span class=\"p\">,<\/span>\r\n<span class=\"p\">}<\/span>\r\n<\/code><\/pre>\n<p>\u3068\u3044\u3046\u5b9a\u7fa9\u306b\u306a\u3063\u3066\u3044\u308b\u3002<\/p>\n<p>text \u306f\u5206\u89e3\u3055\u308c\u305f\u5f62\u614b\u7d20\u305d\u306e\u3082\u306e\u3002\u300c\u30b3\u30fc\u30c9\u3092\u66f8\u3053\u3046\u300d\u306e\u5834\u5408\uff0c\u300c\u30b3\u30fc\u30c9\u300d\u300c\u3092\u300d\u300c\u66f8\u3053\u300d\u300c\u3046\u300d\u306e\u56db\u3064\u304c\u8a72\u5f53\u3059\u308b\u3002<br \/>\ndetail \u306f\u53d6\u308a\u51fa\u3057\u305f\u4e00\u3064\u306e\u5f62\u614b\u7d20\u306b\u3064\u3044\u3066\u306e\u60c5\u5831\u3092\u307e\u3068\u3081\u3066\u683c\u7d0d\u3059\u308b String \u306e\u30d9\u30af\u30bf\u30fc\u3002\u3069\u3093\u306a\u60c5\u5831\u304c\u3069\u3093\u306a\u9806\u306b\u5165\u3063\u3066\u3044\u308b\u304b\u306f\u4f7f\u3046\u8f9e\u66f8\u306b\u3088\u3063\u3066\u7570\u306a\u308b\u3002<br \/>\n\u30c7\u30d5\u30a9\u30eb\u30c8\u306e IPADIC \u306e\u5834\u5408\uff0c\u30a4\u30f3\u30c7\u30c3\u30af\u30b9 0\u301c3 \u304c\u54c1\u8a5e\u60c5\u5831\u3067\uff0c\u305d\u306e\u307b\u304b\u306b\u6d3b\u7528\u578b\u30fb\u6d3b\u7528\u5f62\u3060\u306e\u539f\u578b\u3060\u306e\u8aad\u307f\u3060\u306e\u3068\u3044\u3063\u305f\u60c5\u5831\u304c\u5165\u3063\u3066\u3044\u308b\u3002<\/p>\n<p>\u3053\u306e\u95a2\u6570\u306e\u809d\u306f\uff0c\u53d6\u308a\u51fa\u3057\u305f\u5f62\u614b\u7d20\u304c\u6307\u5b9a\u3057\u305f\u54c1\u8a5e\u306e\u3069\u308c\u304b\u306b\u5f53\u3066\u306f\u307e\u3063\u3066\u3044\u308b\u304b\u3069\u3046\u304b\u3092\u78ba\u8a8d\u3059\u308b\u3068\u3053\u308d\u3060\u304c\uff0c\u54c1\u8a5e\u4f53\u7cfb\u306e\u8aac\u660e\u3092\u5148\u306b\u3059\u308b\u5fc5\u8981\u304c\u3042\u308b\u306e\u3067\uff0c\u3044\u3063\u305f\u3093\u68da\u4e0a\u3052\u3059\u308b\u3002<br \/>\n\u3068\u3082\u304b\u304f\uff0cRuby \u306e\u914d\u5217 result \u306b\uff0c\u8a72\u5f53\u3059\u308b\u5f62\u614b\u7d20\u306e text \u3092\u653e\u308a\u8fbc\u3093\u3067\u884c\u304d\uff0c\u6700\u5f8c\u306e\u305d\u306e result \u3092\u8fd4\u3059\u3002<\/p>\n<h2>Ruby \u306e\u30af\u30e9\u30b9\u3068\u30e1\u30bd\u30c3\u30c9\u306e\u5272\u308a\u5f53\u3066<\/h2>\n<p>\u6b8b\u308b\u90e8\u5206\u306f<\/p>\n<pre class=\"post-pre\"><code><span class=\"nd\">#[allow(non_snake_case)]<\/span>\r\n<span class=\"nd\">#[no_mangle]<\/span>\r\n<span class=\"k\">pub<\/span> <span class=\"k\">extern<\/span> <span class=\"s\">\"C\"<\/span> <span class=\"k\">fn<\/span> <span class=\"nf\">Init_phoneme_extractor<\/span><span class=\"p\">()<\/span> <span class=\"p\">{<\/span>\r\n    <span class=\"nn\">Class<\/span><span class=\"p\">::<\/span><span class=\"nf\">new<\/span><span class=\"p\">(<\/span><span class=\"s\">\"PhonemeExtractor\"<\/span><span class=\"p\">,<\/span> <span class=\"nb\">None<\/span><span class=\"p\">)<\/span><span class=\"nf\">.define<\/span><span class=\"p\">(|<\/span><span class=\"n\">klass<\/span><span class=\"p\">|<\/span> <span class=\"p\">{<\/span>\r\n        <span class=\"n\">klass<\/span><span class=\"nf\">.def_self<\/span><span class=\"p\">(<\/span><span class=\"s\">\"new\"<\/span><span class=\"p\">,<\/span> <span class=\"n\">phoneme_extractor_new<\/span><span class=\"p\">);<\/span>\r\n        <span class=\"n\">klass<\/span><span class=\"nf\">.def<\/span><span class=\"p\">(<\/span><span class=\"s\">\"extract\"<\/span><span class=\"p\">,<\/span> <span class=\"n\">extract<\/span><span class=\"p\">);<\/span>\r\n    <span class=\"p\">});<\/span>\r\n<span class=\"p\">}<\/span>\r\n<\/code><\/pre>\n<p>\u306e\u307f\u3002<br \/>\nRuby \u306e PhonemeExtractor \u30af\u30e9\u30b9\u3068\uff0c\u305d\u306e\u7279\u7570\u30e1\u30bd\u30c3\u30c9 new \u304a\u3088\u3073\u30a4\u30f3\u30b9\u30bf\u30f3\u30b9\u30e1\u30bd\u30c3\u30c9 extract \u3092\uff0cmethods! \u30de\u30af\u30ed\u3067\u5b9a\u7fa9\u3057\u305f\u30e1\u30bd\u30c3\u30c9\u306b\u5272\u308a\u5f53\u3066\u3066\u3044\u308b\u3002<br \/>\n\u524d\u56de\u306e\u8a18\u4e8b\u3092\u53c2\u7167\u3002<\/p>\n<h1>\u54c1\u8a5e\u4f53\u7cfb<\/h1>\n<p>\u54c1\u8a5e\u306f\uff0cIPADIC \u306e\u5834\u5408\uff0c\u56db\u968e\u5c64\u304b\u3089\u306a\u308b\u300cIPA \u54c1\u8a5e\u4f53\u7cfb\u300d\u3068\u3044\u3046\u3082\u306e\u306b\u5f93\u3063\u3066\u3044\u308b\u3089\u3057\u3044\u3002<br \/>\n\u3053\u306e\u4f53\u7cfb\u306e\u4e00\u6b21\u60c5\u5831\u304c\u3069\u3053\u306b\u3042\u308b\u306e\u304b\u3055\u3063\u3071\u308a\u5206\u304b\u3089\u306a\u304b\u3063\u305f\u304c\uff0c\u4ee5\u4e0b\u306e\u30da\u30fc\u30b8\u306b\u3068\u308a\u3042\u3048\u305a\u66f8\u304b\u308c\u3066\u3044\u308b\u3002<br \/>\n\u5f62\u614b\u7d20\u89e3\u6790\u30c4\u30fc\u30eb\u306e\u54c1\u8a5e\u4f53\u7cfb<\/p>\n<p>\u3053\u308c\u306b\u3088\u308b\u3068\u4f8b\u3048\u3070\uff0c\u4ee5\u4e0b\u306e\u3088\u3046\u306b\u306a\u308b\u3088\u3046\u3060\u3002<\/p>\n<p>&#8220;\u82b1\u5b50&#8221; \u2192 [&#8220;\u540d\u8a5e&#8221;, &#8220;\u56fa\u6709\u540d\u8a5e&#8221;, &#8220;\u4eba\u540d&#8221;, &#8220;\u540d&#8221;]<\/p>\n<p>&#8220;\u7389\u306d\u304e&#8221; \u2192 [&#8220;\u540d\u8a5e&#8221;, &#8220;\u4e00\u822c&#8221;, &#8220;&#8221;, &#8220;&#8221;]<\/p>\n<p>\uff08\u30c8\u30fc\u30af\u30f3\u306e detail \u306e\u5148\u982d 4 \u8981\u7d20\u3092\u629c\u304d\u51fa\u3057\u305f\u30a4\u30e1\u30fc\u30b8\uff09<\/p>\n<p>\u6ce8\u610f\u3059\u3079\u304d\u306f\uff0cdetail \u306e\u9577\u3055\uff08\u8981\u7d20\u6570\uff09\u306f IPADIC \u3067\u306f\u57fa\u672c\u7684\u306b 9 \u306a\u306e\u3060\u304c\uff0c\u300c\u672a\u77e5\u8a9e\u300d\u3068\u5224\u5b9a\u3055\u308c\u308b\u5f62\u614b\u7d20\u306b\u9650\u3063\u3066\u306f detail \u304c [&#8220;UNK&#8221;] \u3068\u3044\u3046\u9577\u3055 1 \u306e\u30d9\u30af\u30bf\u30fc\u306b\u306a\u308b\u3068\u3044\u3046\u3053\u3068\u3002<\/p>\n<p>\uff082021-02-04 \u8ffd\u8a18\uff09<br \/>\n\u5f62\u614b\u7d20\u89e3\u6790\u7528\u8f9e\u66f8\u306e\u54c1\u8a5e\u306e\u4f53\u7cfb\u3092\u7406\u89e3\u3059\u308b\u306e\u306f\uff0c\u79c1\u306e\u3088\u3046\u306a\u7d20\u4eba\u306b\u306f\u306a\u304b\u306a\u304b\u96e3\u3057\u3044\u3002\u4e00\u89a7\u3092\u898b\u305f\u3060\u3051\u3067\u306f\u7121\u7406\u3002\u3084\u306f\u308a\u89e3\u8aac\u304c\u6b32\u3057\u3044\u3002\u4ee5\u4e0b\u306e\u8a18\u4e8b\u304c\u5f79\u306b\u7acb\u3064\u3068\u601d\u3046\u3002<br \/>\n[\u5f62\u614b\u7d20\u89e3\u6790] \u300c\u4eee\u5b9a\u7e2e\u7d04\uff11\u300d\u3068\u306f\uff1fMeCab\u30fbIPADIC\u306e\u54c1\u8a5e\u5206\u985e\u3092\u7406\u89e3\u3057\u3088\u3046 &#8211; Qiita<br \/>\n\u30bf\u30a4\u30c8\u30eb\u306b\u300c\u300c\u4eee\u5b9a\u7e2e\u7d04\uff11\u300d\u3068\u306f\uff1f\u300d\u3068\u3042\u308b\u304c\uff0c\u4eee\u5b9a\u7e2e\u7d04\u3092\u30c6\u30fc\u30de\u306b\u3057\u305f\u8a18\u4e8b\u3067\u306f\u306a\u304f\uff0cIPA \u8f9e\u66f8\u306e\u54c1\u8a5e\u4f53\u7cfb\u5168\u4f53\u306b\u3064\u3044\u3066\u66f8\u304b\u308c\u3066\u3044\u308b\u3002<\/p>\n<h2>\u54c1\u8a5e\u306e\u6307\u5b9a\u3068\u5224\u5b9a<\/h2>\n<p>\u3055\u3066\uff0c\u7528\u9014\u306b\u3088\u3063\u3066\uff0c\u54c1\u8a5e\u60c5\u5831\u306e\u7b2c 0 \u8981\u7d20\u304c \u540d\u8a5e \u306e\u3082\u306e\u3092\u3059\u3079\u3066\u306b\u62fe\u3044\u305f\u3044\u3053\u3068\u3082\u3042\u308c\u3070\uff0c\u7b2c 0\uff0c\u7b2c 1 \u8981\u7d20\u304c\u305d\u308c\u305e\u308c \u540d\u8a5e\uff0c\u56fa\u6709\u540d\u8a5e \u306e\u3082\u306e\uff08\u7b2c 3\uff0c\u7b2c 4 \u8981\u7d20\u306f\u554f\u308f\u306a\u3044\uff09\u3068\u3044\u3063\u305f\u5834\u5408\u3082\u3042\u308d\u3046\u3002<br \/>\n\u3064\u307e\u308a\uff0c\u3069\u3053\u307e\u3067\u7d30\u304b\u304f\u6307\u5b9a\u3057\u305f\u3044\u304b\u306f\u5834\u5408\u306b\u3088\u308a\u3051\u308a\u3002<\/p>\n<p>\u3053\u308c\u3092\u3069\u306e\u3088\u3046\u306b\u6307\u5b9a\u3055\u305b\u3066\uff0c\u3069\u306e\u3088\u3046\u306b\u5224\u5b9a\u3059\u308c\u3070\u3044\u3044\u304b\u3002<br \/>\n\u306a\u308b\u3079\u304f\u5358\u7d14\u306b\u3084\u308a\u305f\u3044\u306e\u3067\uff0c\u4ee5\u4e0b\u306e\u3088\u3046\u306b\u3059\u308b\u3053\u3068\u306b\u3057\u305f\u3002<\/p>\n<p>\u6307\u5b9a\u306f &#8220;\u540d\u8a5e&#8221; \u3068\u304b &#8220;\u540d\u8a5e,\u56fa\u6709\u540d\u8a5e&#8221; \u3068\u3044\u3063\u305f\u3088\u3046\u306b\uff0c\u5fc5\u8981\u306a\u6df1\u3055\u307e\u3067\u306e\u54c1\u8a5e\u60c5\u5831\u3092\u30ab\u30f3\u30de\u3067\u533a\u5207\u3063\u305f\u6587\u5b57\u5217\u3068\u3059\u308b\u3002<\/p>\n<p>\u307e\u305f\uff0c\u898b\u51fa\u3055\u308c\u305f\u5f62\u614b\u7d20\u306b\u3064\u3044\u3066\u306f\uff0cdetail \u3092\u30ab\u30f3\u30de\u3067\u533a\u5207\u3063\u305f\u6587\u5b57\u5217\uff08\u3064\u307e\u308a join(&#8220;,&#8221;) \u3057\u305f\u3082\u306e\uff09\u3068\u3059\u308b\u3002<\/p>\n<p>\u305d\u3057\u3066\uff0c\u5f8c\u8005\u306e\u5148\u982d\u306b\u524d\u8005\u304c\u5b58\u5728\u3059\u308b\u304b\u3092\uff0cString \u306e starts_with \u30e1\u30bd\u30c3\u30c9\u3067\u5224\u5b9a\u3059\u308b\u3002<\/p>\n<p>\u305f\u3060\u3057\uff0c\u54c1\u8a5e\u306e\u6307\u5b9a\u306f\u8907\u6570\u4e0e\u3048\u3089\u308c\u308b\u3088\u3046\u306b\u3057\uff0c\u305d\u306e\u3046\u3061\u306e\u3069\u308c\u304b\u306b\u5f53\u3066\u306f\u307e\u3063\u3066\u3044\u308c\u3070\u3088\u3044\u3053\u3068\u306b\u3059\u308b\u3002<br \/>\n\u305d\u308c\u304c\u3053\u306e\u90e8\u5206\uff1a<\/p>\n<pre class=\"post-pre\"><code><span class=\"k\">for<\/span> <span class=\"n\">token<\/span> <span class=\"n\">in<\/span> <span class=\"n\">tokens<\/span> <span class=\"p\">{<\/span>\r\n    <span class=\"k\">let<\/span> <span class=\"n\">detail<\/span> <span class=\"o\">=<\/span> <span class=\"n\">token<\/span><span class=\"py\">.detail<\/span><span class=\"p\">;<\/span>\r\n    <span class=\"k\">let<\/span> <span class=\"n\">pos<\/span><span class=\"p\">:<\/span> <span class=\"nb\">String<\/span> <span class=\"o\">=<\/span> <span class=\"n\">detail<\/span><span class=\"nf\">.join<\/span><span class=\"p\">(<\/span><span class=\"s\">\",\"<\/span><span class=\"p\">);<\/span>\r\n    <span class=\"k\">if<\/span> <span class=\"n\">extractor<\/span><span class=\"py\">.allowed_poss<\/span><span class=\"nf\">.iter<\/span><span class=\"p\">()<\/span><span class=\"nf\">.any<\/span><span class=\"p\">(|<\/span><span class=\"n\">s<\/span><span class=\"p\">|<\/span> <span class=\"n\">pos<\/span><span class=\"nf\">.starts_with<\/span><span class=\"p\">(<\/span><span class=\"n\">s<\/span><span class=\"p\">))<\/span> <span class=\"p\">{<\/span>\r\n        <span class=\"n\">result<\/span><span class=\"nf\">.push<\/span><span class=\"p\">(<\/span><span class=\"nn\">RString<\/span><span class=\"p\">::<\/span><span class=\"nf\">new_utf8<\/span><span class=\"p\">(<\/span><span class=\"o\">&amp;<\/span><span class=\"n\">token<\/span><span class=\"py\">.text<\/span><span class=\"p\">));<\/span>\r\n    <span class=\"p\">}<\/span>\r\n<span class=\"p\">}<\/span>\r\n<\/code><\/pre>\n<p>any \u306a\u3093\u3066\uff0cRuby \u306e Enumerable#any? \u305d\u3063\u304f\u308a\u3002<\/p>\n<p>\u306a\u304a\uff0cRString::new_utf8 \u306f Rust \u306e\u6587\u5b57\u5217\u304b\u3089 Ruby \u306e String \u3092\u4f5c\u308b\u3082\u306e\u3002<\/p>\n<h2>\u30b3\u30f3\u30d1\u30a4\u30eb<\/h2>\n<p>\u4f8b\u306b\u3088\u3063\u3066<\/p>\n<pre class=\"post-pre\"><code>cargo build <span class=\"nt\">--release<\/span>\r\n<\/code><\/pre>\n<p>\u3068\u3059\u308b\u3002<br \/>\n\u6210\u679c\u7269\u304c target\/release\/libmy_rutie_math.dylib \u3068\u3044\u3046\u30d1\u30b9\u306b\u51fa\u6765\u308b\uff08\u62e1\u5f35\u5b50\u306f\u30bf\u30fc\u30b2\u30c3\u30c8\u306b\u3088\u308b\uff09\u3002<\/p>\n<h1>\u5b9f\u88c5\uff1aRuby \u5074<\/h1>\n<p>Ruby \u30b9\u30af\u30ea\u30d7\u30c8\u306f\u3053\u308c\u3060\u3051\u3002<br \/>\n\u4f8b\u306b\u3088\u3063\u3066\uff0c\u3053\u306e\u30b9\u30af\u30ea\u30d7\u30c8\u304c Rust \u306e\u30d7\u30ed\u30b8\u30a7\u30af\u30c8\u306e\u30eb\u30fc\u30c8\u30c7\u30a3\u30ec\u30af\u30c8\u30ea\u30fc\u306b\u5b58\u5728\u3059\u308b\u3068\u3057\u3066\uff0cRust \u306e\u30e9\u30a4\u30d6\u30e9\u30ea\u30fc\u306e\u30d1\u30b9\u3092\u8a18\u8ff0\u3057\u3066\u3044\u308b\u3002<\/p>\n<pre class=\"post-pre\"><code><span class=\"c1\"># encoding: utf-8<\/span>\r\n\r\n<span class=\"nb\">require<\/span> <span class=\"s2\">\"rutie\"<\/span>\r\n\r\n<span class=\"no\">Rutie<\/span><span class=\"p\">.<\/span><span class=\"nf\">new<\/span><span class=\"p\">(<\/span><span class=\"ss\">:phoneme_extractor<\/span><span class=\"p\">,<\/span> <span class=\"ss\">lib_path: <\/span><span class=\"s2\">\"target\/release\"<\/span><span class=\"p\">).<\/span><span class=\"nf\">init<\/span> <span class=\"s2\">\"Init_phoneme_extractor\"<\/span><span class=\"p\">,<\/span> <span class=\"n\">__dir__<\/span>\r\n\r\n<span class=\"n\">pe<\/span> <span class=\"o\">=<\/span> <span class=\"no\">PhonemeExtractor<\/span><span class=\"p\">.<\/span><span class=\"nf\">new<\/span> <span class=\"o\">&lt;&lt;<\/span><span class=\"no\">JSON<\/span><span class=\"sh\">\r\n  {\r\n    \"mode\": \"normal\",\r\n    \"allowed_poss\": [\r\n      \"\u540d\u8a5e,\u4e00\u822c\",\r\n      \"\u540d\u8a5e,\u56fa\u6709\u540d\u8a5e\",\r\n      \"\u540d\u8a5e,\u526f\u8a5e\u53ef\u80fd\",\r\n      \"\u540d\u8a5e,\u30b5\u5909\u63a5\u7d9a\",\r\n      \"\u540d\u8a5e,\u5f62\u5bb9\u52d5\u8a5e\u8a9e\u5e79\",\r\n      \"\u540d\u8a5e,\u30ca\u30a4\u5f62\u5bb9\u8a5e\u8a9e\u5e79\"\r\n    ]\r\n  }\r\n<\/span><span class=\"no\">JSON<\/span>\r\n\r\n<span class=\"n\">text<\/span> <span class=\"o\">=<\/span> <span class=\"o\">&lt;&lt;<\/span><span class=\"no\">EOT<\/span><span class=\"sh\">\r\n\u300c\u9053\u7a0b\u300d\u3000\u9ad8\u6751\u5149\u592a\u90ce\r\n\u50d5\u306e\u524d\u306b\u9053\u306f\u306a\u3044\r\n\u50d5\u306e\u5f8c\u308d\u306b\u9053\u306f\u3067\u304d\u308b\r\n\u3042\u3042\u3001\u81ea\u7136\u3088\r\n\u7236\u3088\r\n\u50d5\u3092\u4e00\u4eba\u7acb\u3061\u306b\u3055\u305b\u305f\u5e83\u5927\u306a\u7236\u3088\r\n\u50d5\u304b\u3089\u76ee\u3092\u96e2\u3055\u306a\u3044\u3067\u5b88\u308b\u4e8b\u3092\u305b\u3088\r\n\u5e38\u306b\u7236\u306e\u6c17\u9b44\u3092\u50d5\u306b\u5145\u305f\u305b\u3088\r\n\u3053\u306e\u9060\u3044\u9053\u7a0b\u306e\u305f\u3081\r\n\u3053\u306e\u9060\u3044\u9053\u7a0b\u306e\u305f\u3081\r\n<\/span><span class=\"no\">EOT<\/span>\r\n\r\n\r\n<span class=\"n\">pe<\/span><span class=\"p\">.<\/span><span class=\"nf\">extract<\/span><span class=\"p\">(<\/span><span class=\"n\">text<\/span><span class=\"p\">).<\/span><span class=\"nf\">tally<\/span>\r\n  <span class=\"p\">.<\/span><span class=\"nf\">sort_by<\/span><span class=\"p\">{<\/span> <span class=\"o\">|<\/span><span class=\"n\">word<\/span><span class=\"p\">,<\/span> <span class=\"n\">freq<\/span><span class=\"o\">|<\/span> <span class=\"o\">-<\/span><span class=\"n\">freq<\/span> <span class=\"p\">}<\/span>\r\n  <span class=\"p\">.<\/span><span class=\"nf\">each<\/span><span class=\"p\">{<\/span> <span class=\"o\">|<\/span><span class=\"n\">word<\/span><span class=\"p\">,<\/span> <span class=\"n\">freq<\/span><span class=\"o\">|<\/span> <span class=\"nb\">puts<\/span> <span class=\"s2\">\"%4d %s\"<\/span> <span class=\"o\">%<\/span> <span class=\"p\">[<\/span><span class=\"n\">freq<\/span><span class=\"p\">,<\/span> <span class=\"n\">word<\/span><span class=\"p\">]<\/span> <span class=\"p\">}<\/span>\r\n<\/code><\/pre>\n<p>\u7d50\u679c\uff1a<\/p>\n<pre class=\"post-pre\"><code>   3 \u7236\r\n   3 \u9053\u7a0b\r\n   2 \u9053\r\n   1 \u524d\r\n   1 \u5f8c\u308d\r\n   1 \u81ea\u7136\r\n   1 \u7acb\u3061\r\n   1 \u5e83\u5927\r\n   1 \u76ee\r\n   1 \u6c17\u9b44\r\n   1 \u9ad8\u6751\r\n   1 \u5149\u592a\u90ce\r\n<\/code><\/pre>\n<p>\u3075\u3046\uff0c\u75b2\u308c\u305f\u3002<\/p>\n<h1>\u304a\u308f\u308a\u306b<\/h1>\n<p>\u8aac\u660e\u3092\u52a0\u3048\u3088\u3046\u3068\u3059\u308b\u3068\u3069\u3093\u3069\u3093\u9577\u304f\u306a\u308b\u3057\uff0c\u63a8\u6572\u3092\u91cd\u306d\u308b\u3068\u3044\u3064\u307e\u3067\u3082\u66f8\u304d\u7d42\u308f\u3089\u306a\u3044\u3002<br \/>\n\u7533\u3057\u8a33\u306a\u3044\u3051\u3069\uff0c\u8a18\u4e8b\u306e\u54c1\u8cea\u306f\u30a4\u30de\u30a4\u30c1\u304b\u3082\u3002<br \/>\n\u8cea\u554f\u306f\u6b53\u8fce\u306a\u306e\u3067\uff0c\u3069\u3093\u306a\u3053\u3068\u3067\u3082\u8a0a\u3044\u3066\u304f\u3060\u3055\u3044\u3002\u79c1\u306b\u5206\u304b\u308b\u3053\u3068\u306a\u3089\u7b54\u3048\u307e\u3059\u3002<\/p>\n<div>\n<p>\u307b\u304b\u306b\u3082\u3042\u308b\u3088\u3046\u3060\u304c\u3088\u304f\u77e5\u3089\u306a\u3044\u3002\u00a0\u21a9<\/p>\n<p>\u672c\u5f53\u306b\u52b9\u7387\u304c\u60aa\u3044\u306e\u304b\u3069\u3046\u304b\uff0c\u307e\u305f\uff0c\u3069\u306e\u7a0b\u5ea6\u306e\u91cf\u306e\u30c6\u30ad\u30b9\u30c8\u3092\u6271\u3048\u3070\u6027\u80fd\u306b\u5f71\u97ff\u3092\u4e0e\u3048\u308b\u306e\u304b\uff0c\u306b\u3064\u3044\u3066\u306f\u304d\u3061\u3093\u3068\u3057\u305f\u5b9f\u9a13\u3092\u884c\u308f\u306a\u3044\u3068\u4f55\u3068\u3082\u8a00\u3048\u306a\u3044\u3002\u00a0\u21a9<\/p>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>\u9023\u8a18\u4e8b\u76ee\u6b21 Ruby\/Rust \u9023\u643a (1)\u3000\u76ee\u7684 &nbsp; Ruby\/Rust \u9023\u643a (2)\u3000\u624b\u6bb5 &#038;n [&hellip;]<\/p>\n","protected":false},"author":12,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-45467","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v21.5 (Yoast SEO v21.5) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>- Blog - Silicon Cloud<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.silicloud.com\/zh\/blog\/45467-2\/\" \/>\n<meta property=\"og:locale\" content=\"zh_CN\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:description\" content=\"\u9023\u8a18\u4e8b\u76ee\u6b21 Ruby\/Rust \u9023\u643a (1)\u3000\u76ee\u7684 &nbsp; Ruby\/Rust \u9023\u643a (2)\u3000\u624b\u6bb5 &amp;n [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.silicloud.com\/zh\/blog\/45467-2\/\" \/>\n<meta property=\"og:site_name\" content=\"Blog - Silicon Cloud\" \/>\n<meta property=\"article:published_time\" content=\"2023-09-11T14:11:34+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-04-28T19:51:52+00:00\" \/>\n<meta name=\"author\" content=\"\u9038, \u79d1\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"\u4f5c\u8005\" \/>\n\t<meta name=\"twitter:data1\" content=\"\u9038, \u79d1\" \/>\n\t<meta name=\"twitter:label2\" content=\"\u9884\u8ba1\u9605\u8bfb\u65f6\u95f4\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 \u5206\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/45467-2\/\",\"url\":\"https:\/\/www.silicloud.com\/zh\/blog\/45467-2\/\",\"name\":\"- Blog - Silicon Cloud\",\"isPartOf\":{\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/#website\"},\"datePublished\":\"2023-09-11T14:11:34+00:00\",\"dateModified\":\"2024-04-28T19:51:52+00:00\",\"author\":{\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/85c1dae56e6ea1e695c73d33c684d487\"},\"inLanguage\":\"zh-Hans\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.silicloud.com\/zh\/blog\/45467-2\/\"]}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/#website\",\"url\":\"https:\/\/www.silicloud.com\/zh\/blog\/\",\"name\":\"Blog - Silicon Cloud\",\"description\":\"\",\"inLanguage\":\"zh-Hans\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/85c1dae56e6ea1e695c73d33c684d487\",\"name\":\"\u9038, \u79d1\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"zh-Hans\",\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/c94f6d9cbbfbca863fab309840bd690c153c95f8490c290ad2ed54dd693dad16?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/c94f6d9cbbfbca863fab309840bd690c153c95f8490c290ad2ed54dd693dad16?s=96&d=mm&r=g\",\"caption\":\"\u9038, \u79d1\"},\"url\":\"https:\/\/www.silicloud.com\/zh\/blog\/author\/keyi\/\"},{\"@type\":\"ImageObject\",\"inLanguage\":\"zh-Hans\",\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/45467-2\/#local-main-organization-logo\",\"url\":\"\",\"contentUrl\":\"\",\"caption\":\"Blog - Silicon Cloud\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"- Blog - Silicon Cloud","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.silicloud.com\/zh\/blog\/45467-2\/","og_locale":"zh_CN","og_type":"article","og_description":"\u9023\u8a18\u4e8b\u76ee\u6b21 Ruby\/Rust \u9023\u643a (1)\u3000\u76ee\u7684 &nbsp; Ruby\/Rust \u9023\u643a (2)\u3000\u624b\u6bb5 &n [&hellip;]","og_url":"https:\/\/www.silicloud.com\/zh\/blog\/45467-2\/","og_site_name":"Blog - Silicon Cloud","article_published_time":"2023-09-11T14:11:34+00:00","article_modified_time":"2024-04-28T19:51:52+00:00","author":"\u9038, \u79d1","twitter_card":"summary_large_image","twitter_misc":{"\u4f5c\u8005":"\u9038, \u79d1","\u9884\u8ba1\u9605\u8bfb\u65f6\u95f4":"4 \u5206"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.silicloud.com\/zh\/blog\/45467-2\/","url":"https:\/\/www.silicloud.com\/zh\/blog\/45467-2\/","name":"- Blog - Silicon Cloud","isPartOf":{"@id":"https:\/\/www.silicloud.com\/zh\/blog\/#website"},"datePublished":"2023-09-11T14:11:34+00:00","dateModified":"2024-04-28T19:51:52+00:00","author":{"@id":"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/85c1dae56e6ea1e695c73d33c684d487"},"inLanguage":"zh-Hans","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.silicloud.com\/zh\/blog\/45467-2\/"]}]},{"@type":"WebSite","@id":"https:\/\/www.silicloud.com\/zh\/blog\/#website","url":"https:\/\/www.silicloud.com\/zh\/blog\/","name":"Blog - Silicon Cloud","description":"","inLanguage":"zh-Hans"},{"@type":"Person","@id":"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/85c1dae56e6ea1e695c73d33c684d487","name":"\u9038, \u79d1","image":{"@type":"ImageObject","inLanguage":"zh-Hans","@id":"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/c94f6d9cbbfbca863fab309840bd690c153c95f8490c290ad2ed54dd693dad16?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/c94f6d9cbbfbca863fab309840bd690c153c95f8490c290ad2ed54dd693dad16?s=96&d=mm&r=g","caption":"\u9038, \u79d1"},"url":"https:\/\/www.silicloud.com\/zh\/blog\/author\/keyi\/"},{"@type":"ImageObject","inLanguage":"zh-Hans","@id":"https:\/\/www.silicloud.com\/zh\/blog\/45467-2\/#local-main-organization-logo","url":"","contentUrl":"","caption":"Blog - Silicon Cloud"}]}},"_links":{"self":[{"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/posts\/45467","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/users\/12"}],"replies":[{"embeddable":true,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/comments?post=45467"}],"version-history":[{"count":2,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/posts\/45467\/revisions"}],"predecessor-version":[{"id":79751,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/posts\/45467\/revisions\/79751"}],"wp:attachment":[{"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/media?parent=45467"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/categories?post=45467"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/tags?post=45467"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}