{"id":45517,"date":"2023-07-26T05:45:23","date_gmt":"2023-06-01T09:03:14","guid":{"rendered":"https:\/\/www.silicloud.com\/zh\/blog\/45517-2\/"},"modified":"2024-04-30T15:23:55","modified_gmt":"2024-04-30T07:23:55","slug":"45517-2","status":"publish","type":"post","link":"https:\/\/www.silicloud.com\/zh\/blog\/45517-2\/","title":{"rendered":""},"content":{"rendered":"<p>\u3061\u3087\u3063\u3068Web\u30b9\u30af\u30ec\u30a4\u30d4\u30f3\u30b0\u3059\u308b\u5fc5\u8981\u304c\u3042\u3063\u305f\u306e\u3067 scraper \u30af\u30ec\u30a4\u30c8\u3092\u4f7f\u3063\u3066\u307f\u307e\u3057\u305f\u3002<\/p>\n<p>\u5e38\u8b58\u7684\u306b\u8003\u3048\u3066 Python \u3067\u3084\u308a\u305d\u3046\u306a\u5c40\u9762\u3067\u3059\u304c\u3001\u4eca\u56de\u3082\u3042\u3048\u3066 Rust \u3067\u3044\u304d\u307e\u3059\u3002Rust \u4fee\u884c\u4e2d\u306a\u306e\u3067\u3002<\/p>\n<p>\u307b\u307c\u9700\u8981\u306a\u3044\u3067\u3057\u3087\u3046\u304c\u81ea\u5206\u306e\u305f\u3081\u306b\u30e1\u30e2\u3092\u6b8b\u3057\u307e\u3059\u3002<\/p>\n<h1>\u304a\u984c<\/h1>\n<p>Rust Blog \u306e\u8a18\u4e8b\u30ea\u30b9\u30c8\u3092\u53d6\u5f97\u3057\u3001\u30a8\u30af\u30bb\u30eb\u30d5\u30a1\u30a4\u30eb\u306b\u4fdd\u5b58\u3057\u305f\u3044\u3002<\/p>\n<ul class=\"post-ul\">\n<li style=\"list-style-type: none;\">\n<ul class=\"post-ul\">\u8a18\u4e8b\u30ea\u30b9\u30c8\u3092\u53d6\u5f97\u3059\u308bWeb\u30b5\u30a4\u30c8\u306f https:\/\/blog.rust-lang.org\/ \u3068\u3059\u308b\u3002<\/ul>\n<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<ul class=\"post-ul\">\n<li style=\"list-style-type: none;\">\n<ul class=\"post-ul\">\u8a18\u4e8b\u30ea\u30b9\u30c8\u306e\u9805\u76ee\u306f\u3001\u8a18\u4e8b\u306e\u65e5\u4ed8\u30fb\u8a18\u4e8b\u306e\u30bf\u30a4\u30c8\u30eb\u30fb\u8a18\u4e8b\u306eURL\u3068\u3059\u308b\u3002<\/ul>\n<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<ul class=\"post-ul\">\n<li style=\"list-style-type: none;\">\n<ul class=\"post-ul\">\u8a18\u4e8b\u306e\u65e5\u4ed8\u306f\u300c\u897f\u66a6\/\u6708\/\u65e5\u300d\u306e\u5f62\u5f0f\u3068\u3059\u308b\u3002<\/ul>\n<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<ul class=\"post-ul\">\u51fa\u529b\u3059\u308b\u30a8\u30af\u30bb\u30eb\u30d5\u30a1\u30a4\u30eb\u306e\u540d\u524d\u306f out.xlsx \u3068\u3059\u308b<\/ul>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/657d5f5337434c4406cf7951\/6-0.png\" alt=\"image.png\" \/><\/div>\n<h1>\u691c\u8a0e<\/h1>\n<p>Rust Blog \u3092\u30d6\u30e9\u30a6\u30b6\u3067\u78ba\u8a8d\u3059\u308b\u3002<\/p>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/657d5f5337434c4406cf7951\/9-0.png\" alt=\"1.png\" \/><\/div>\n<p>\u30b5\u30a4\u30c8\u306e\u8868\u793a\u3068\u53d6\u5f97\u3057\u305f\u3044\u8a18\u4e8b\u30ea\u30b9\u30c8\u306e\u9805\u76ee\u306e\u5bfe\u5fdc\u306f\u3053\u3093\u306a\u611f\u3058\u3067\u8003\u3048\u308b\u3002<\/p>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/657d5f5337434c4406cf7951\/11-0.png\" alt=\"2.png\" \/><\/div>\n<p>\u6b21\u306b\u30b5\u30a4\u30c8\u306eHTML \u30b3\u30fc\u30c9\u3068\u53d6\u5f97\u3059\u308b\u8a18\u4e8b\u30ea\u30b9\u30c8\u306e\u9805\u76ee\u306e\u5bfe\u5fdc\u3092\u78ba\u8a8d\u3059\u308b\u3002<\/p>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/657d5f5337434c4406cf7951\/13-0.png\" alt=\"3.png\" \/><\/div>\n<p>\u8a18\u4e8b\u306e\u65e5\u4ed8\u306e\u897f\u66a6\u3001\u8a18\u4e8b\u306e\u65e5\u4ed8\u306e\u6708\u65e5\u3001\u8a18\u4e8b\u306e\u30bf\u30a4\u30c8\u30eb\u3068\u30ea\u30f3\u30af\u306e td \u8981\u7d20\u3092\u3046\u307e\u304f\u3068\u308c\u308c\u3070\u3088\u3055\u305d\u3046\u3002<\/p>\n<ul class=\"post-ul\">\n<li style=\"list-style-type: none;\">\n<ul class=\"post-ul\">\u897f\u66a6\u3068\u6708\u65e5\u304c\u5225\u3005\u306e\u8981\u7d20\u306b\u914d\u7f6e\u3055\u308c\u3066\u3044\u308b\u306e\u3067\u8a18\u4e8b\u306e\u65e5\u4ed8\u3068\u3057\u3066\u3053\u308c\u3089\u3092\u304f\u3063\u3064\u3051\u3066\u3084\u308b\u5fc5\u8981\u304c\u3042\u308b<\/ul>\n<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<ul class=\"post-ul\">\n<li style=\"list-style-type: none;\">\n<ul class=\"post-ul\">\u6708\u306f\u300c\u6708\u540d\u300d\uff08Jan.\uff5eDec.\uff09\u306a\u306e\u3067\u3053\u308c\u3092\u6570\u5b57\u306b\u3057\u3066\u3084\u308b\u5fc5\u8981\u304c\u3042\u308b<\/ul>\n<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<ul class=\"post-ul\">\n<li style=\"list-style-type: none;\">\n<ul class=\"post-ul\">\u6708\u540d\u3068\u65e5\u304c &amp;nbsp (No break space) \u3067\u533a\u5207\u3089\u308c\u3066\u3044\u308b\u3002\u5024\u306f &#8220;\\u{a0}&#8221; \u3068\u306a\u308b\u3053\u3068\u306b\u6ce8\u610f\u3002<\/ul>\n<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<ul class=\"post-ul\">\u30ea\u30f3\u30af\u306f\u76f8\u5bfe\u30d1\u30b9\u306b\u306a\u3063\u3066\u3044\u308b\u306e\u3067\u5b8c\u5168\u306a URL \u306b\u3057\u3066\u3084\u308b\u5fc5\u8981\u304c\u3042\u308b<\/ul>\n<p>\u53d6\u5f97\u5bfe\u8c61\u90e8\u5206\u3092DOM\u3067\u8003\u3048\u308b\u3068\u3053\u3093\u306a\u611f\u3058\u3002<\/p>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/657d5f5337434c4406cf7951\/17-0.png\" alt=\"4.png\" \/><\/div>\n<p>\u5fc5\u8981\u306a\u30ce\u30fc\u30c9\u3092\u7279\u5b9a\u3059\u308b\u30bb\u30ec\u30af\u30bf\u306f\u6b21\u306e\u3088\u3046\u306b\u3057\u3066\u307f\u305f\u3002<\/p>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/657d5f5337434c4406cf7951\/19-0.png\" alt=\"5.png\" \/><\/div>\n<ul class=\"post-ul\">\n<li style=\"list-style-type: none;\">\n<ul class=\"post-ul\">selector1 \u306f\u30c9\u30ad\u30e5\u30e1\u30f3\u30c8\u306e\u30c8\u30c3\u30d7\u304b\u3089 tr \u8981\u7d20\u3092\u9078\u629e\u3059\u308b\u305f\u3081\u306e\u30bb\u30ec\u30af\u30bf\u3002<\/ul>\n<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<ul class=\"post-ul\">selector2 \uff5e 4 \u306f\u305d\u308c\u305e\u308c h3 \/ td \/ a \u8981\u7d20\u3092\u4e0a\u306e tr \u8981\u7d20\u304b\u3089\u76f8\u5bfe\u7684\u306b\u9078\u629e\u3059\u308b\u305f\u3081\u306e\u30bb\u30ec\u30af\u30bf\u3002<\/ul>\n<p>\u305d\u306e\u4ed6\u3001\u52d5\u4f5c\u78ba\u8a8d\u306e\u305f\u3073\u306b Rust Blog \u306e\u30b5\u30a4\u30c8\u306b\u30a2\u30af\u30bb\u30b9\u3092\u767a\u751f\u3055\u305b\u308b\u306e\u306f\u7121\u99c4\u306a\u306e\u3067\u3001\u30ed\u30fc\u30ab\u30eb\u306b\u30ad\u30e3\u30c3\u30b7\u30e5\u30d5\u30a1\u30a4\u30eb\u3092\u4f5c\u6210\u3057\u3001\u305d\u3053\u304b\u3089\u304b\u3089HTML\u30b3\u30fc\u30c9\u3092\u8aad\u307f\u51fa\u3059\u3088\u3046\u306b\u3059\u308b\u3053\u3068\u3068\u3059\u308b\u3002<\/p>\n<h1>\u30b3\u30fc\u30c9<\/h1>\n<p>\u4ee5\u4e0b\u306b main.rs \u3092\u793a\u3059\u3002<\/p>\n<pre class=\"post-pre\"><code><span class=\"cm\">\/*\r\n  Rust Blog \u306e\u8a18\u4e8b\u30ea\u30b9\u30c8\u3092\u30a8\u30af\u30bb\u30eb\u306b\u4fdd\u5b58\u3059\u308b\u30b5\u30f3\u30d7\u30eb\r\n\r\n  Build and run:\r\n    cargo add reqwest --features=\"blocking\"\r\n    cargo add scraper\r\n    cargo add thiserror\r\n    cargo add url\r\n    cargo add xlsxwriter\r\n    cargo run\r\n\r\n  Respect for Mr. John Doe\r\n  https:\/\/qiita.com\/YoshiTheQiita\/items\/f66828d61293c75a4585\r\n  \r\n*\/<\/span>\r\n\r\n<span class=\"k\">const<\/span> <span class=\"n\">RUST_BLOG_URL<\/span><span class=\"p\">:<\/span><span class=\"o\">&amp;<\/span><span class=\"nb\">str<\/span> <span class=\"o\">=<\/span> <span class=\"s\">\"https:\/\/blog.rust-lang.org\/\"<\/span><span class=\"p\">;<\/span>\r\n\r\n<span class=\"k\">const<\/span> <span class=\"n\">CACHE_FILE<\/span><span class=\"p\">:<\/span> <span class=\"o\">&amp;<\/span><span class=\"nb\">str<\/span> <span class=\"o\">=<\/span> <span class=\"s\">\"cache_file\"<\/span><span class=\"p\">;<\/span>\r\n<span class=\"k\">const<\/span> <span class=\"n\">OUT_FILE<\/span><span class=\"p\">:<\/span><span class=\"o\">&amp;<\/span><span class=\"nb\">str<\/span> <span class=\"o\">=<\/span> <span class=\"s\">\"out.xlsx\"<\/span><span class=\"p\">;<\/span>\r\n\r\n<span class=\"k\">const<\/span> <span class=\"n\">NBSP<\/span><span class=\"p\">:<\/span> <span class=\"o\">&amp;<\/span><span class=\"nb\">str<\/span> <span class=\"o\">=<\/span> <span class=\"s\">\"<\/span><span class=\"se\">\\u{a0}<\/span><span class=\"s\">\"<\/span><span class=\"p\">;<\/span>\r\n<span class=\"k\">const<\/span> <span class=\"n\">MONTHS<\/span><span class=\"p\">:[<\/span><span class=\"o\">&amp;<\/span><span class=\"nb\">str<\/span><span class=\"p\">;<\/span><span class=\"mi\">12<\/span><span class=\"p\">]<\/span> <span class=\"o\">=<\/span> <span class=\"p\">[<\/span><span class=\"s\">\"Jan.\"<\/span><span class=\"p\">,<\/span><span class=\"s\">\"Feb.\"<\/span><span class=\"p\">,<\/span><span class=\"s\">\"Mar.\"<\/span><span class=\"p\">,<\/span><span class=\"s\">\"Apr.\"<\/span><span class=\"p\">,<\/span><span class=\"s\">\"May\"<\/span><span class=\"p\">,<\/span>\r\n    <span class=\"s\">\"June\"<\/span><span class=\"p\">,<\/span><span class=\"s\">\"July\"<\/span><span class=\"p\">,<\/span><span class=\"s\">\"Aug.\"<\/span><span class=\"p\">,<\/span><span class=\"s\">\"Sept.\"<\/span><span class=\"p\">,<\/span><span class=\"s\">\"Oct.\"<\/span><span class=\"p\">,<\/span><span class=\"s\">\"Nov.\"<\/span><span class=\"p\">,<\/span><span class=\"s\">\"Dec.\"<\/span><span class=\"p\">];<\/span>\r\n\r\n\r\n<span class=\"k\">use<\/span> <span class=\"nn\">std<\/span><span class=\"p\">::<\/span><span class=\"nn\">io<\/span><span class=\"p\">::<\/span><span class=\"n\">Write<\/span><span class=\"p\">;<\/span>\r\n\r\n<span class=\"k\">fn<\/span> <span class=\"nf\">main<\/span><span class=\"p\">()<\/span> <span class=\"p\">{<\/span>\r\n    <span class=\"k\">if<\/span> <span class=\"k\">let<\/span> <span class=\"nf\">Err<\/span><span class=\"p\">(<\/span><span class=\"n\">e<\/span><span class=\"p\">)<\/span> <span class=\"o\">=<\/span> <span class=\"nf\">run<\/span><span class=\"p\">()<\/span> <span class=\"p\">{<\/span>\r\n        <span class=\"nd\">eprintln!<\/span><span class=\"p\">(<\/span><span class=\"s\">\"{:?}\"<\/span><span class=\"p\">,<\/span> <span class=\"n\">e<\/span><span class=\"p\">)<\/span>\r\n    <span class=\"p\">}<\/span>\r\n<span class=\"p\">}<\/span>\r\n\r\n<span class=\"k\">fn<\/span> <span class=\"nf\">run<\/span> <span class=\"p\">()<\/span> <span class=\"k\">-&gt;<\/span> <span class=\"nb\">Result<\/span><span class=\"o\">&lt;<\/span><span class=\"p\">(),<\/span> <span class=\"n\">MyError<\/span><span class=\"o\">&gt;<\/span> <span class=\"p\">{<\/span>\r\n\r\n    <span class=\"c1\">\/\/ \u30d9\u30fc\u30b9URL\u306e\u7528\u610f\uff08\u76f8\u5bfe\u30d1\u30b9\u3067\u4e0e\u3048\u3089\u308c\u308b\u8a18\u4e8b\u306e\u30ea\u30f3\u30af\u3092URL\u306b\u5909\u63db\u3059\u308b\u306e\u306b\u5229\u7528\uff09<\/span>\r\n    <span class=\"k\">let<\/span> <span class=\"n\">base_url<\/span> <span class=\"o\">=<\/span> <span class=\"nn\">url<\/span><span class=\"p\">::<\/span><span class=\"nn\">Url<\/span><span class=\"p\">::<\/span><span class=\"nf\">parse<\/span><span class=\"p\">(<\/span><span class=\"n\">RUST_BLOG_URL<\/span><span class=\"p\">)<\/span><span class=\"o\">?<\/span><span class=\"p\">;<\/span>\r\n\r\n    <span class=\"c1\">\/\/ \u30bb\u30ec\u30af\u30bf\u306e\u5b9a\u7fa9<\/span>\r\n    <span class=\"k\">let<\/span> <span class=\"n\">selector1<\/span> <span class=\"o\">=<\/span> <span class=\"nn\">scraper<\/span><span class=\"p\">::<\/span><span class=\"nn\">Selector<\/span><span class=\"p\">::<\/span><span class=\"nf\">parse<\/span><span class=\"p\">(<\/span><span class=\"s\">\"table.post-list tr\"<\/span><span class=\"p\">)<\/span><span class=\"nf\">.unwrap<\/span><span class=\"p\">();<\/span>    \r\n    <span class=\"k\">let<\/span> <span class=\"n\">selector2<\/span> <span class=\"o\">=<\/span> <span class=\"nn\">scraper<\/span><span class=\"p\">::<\/span><span class=\"nn\">Selector<\/span><span class=\"p\">::<\/span><span class=\"nf\">parse<\/span><span class=\"p\">(<\/span><span class=\"s\">\"td.bn &gt; h3\"<\/span><span class=\"p\">)<\/span><span class=\"nf\">.unwrap<\/span><span class=\"p\">();<\/span>\r\n    <span class=\"k\">let<\/span> <span class=\"n\">selector3<\/span> <span class=\"o\">=<\/span> <span class=\"nn\">scraper<\/span><span class=\"p\">::<\/span><span class=\"nn\">Selector<\/span><span class=\"p\">::<\/span><span class=\"nf\">parse<\/span><span class=\"p\">(<\/span><span class=\"s\">\"td.tr\"<\/span><span class=\"p\">)<\/span><span class=\"nf\">.unwrap<\/span><span class=\"p\">();<\/span>\r\n    <span class=\"k\">let<\/span> <span class=\"n\">selector4<\/span> <span class=\"o\">=<\/span> <span class=\"nn\">scraper<\/span><span class=\"p\">::<\/span><span class=\"nn\">Selector<\/span><span class=\"p\">::<\/span><span class=\"nf\">parse<\/span><span class=\"p\">(<\/span><span class=\"s\">\"td.bn &gt; a\"<\/span><span class=\"p\">)<\/span><span class=\"nf\">.unwrap<\/span><span class=\"p\">();<\/span>\r\n\r\n    <span class=\"c1\">\/\/ \u30ad\u30e3\u30c3\u30b7\u30e5\u30d5\u30a1\u30a4\u30eb\u306e\u7528\u610f<\/span>\r\n    <span class=\"nf\">prepare_cache_file<\/span><span class=\"p\">()<\/span><span class=\"o\">?<\/span><span class=\"p\">;<\/span>\r\n\r\n    <span class=\"c1\">\/\/ \u30ad\u30e3\u30c3\u30b7\u30e5\u30d5\u30a1\u30a4\u30eb\u306e\u8aad\u307f\u8fbc\u307f<\/span>\r\n    <span class=\"k\">let<\/span> <span class=\"n\">content<\/span> <span class=\"o\">=<\/span> <span class=\"nn\">std<\/span><span class=\"p\">::<\/span><span class=\"nn\">fs<\/span><span class=\"p\">::<\/span><span class=\"nf\">read_to_string<\/span><span class=\"p\">(<\/span><span class=\"n\">CACHE_FILE<\/span><span class=\"p\">)<\/span><span class=\"o\">?<\/span><span class=\"p\">;<\/span>\r\n\r\n    <span class=\"c1\">\/\/ \u30ad\u30e3\u30c3\u30b7\u30e5\u30d5\u30a1\u30a4\u30eb\u304c\u7a7a\u306e\u3068\u304d<\/span>\r\n    <span class=\"k\">if<\/span> <span class=\"n\">content<\/span><span class=\"nf\">.len<\/span><span class=\"p\">()<\/span> <span class=\"o\">&lt;<\/span> <span class=\"mi\">1<\/span> <span class=\"p\">{<\/span>\r\n        <span class=\"k\">return<\/span> <span class=\"nf\">Err<\/span><span class=\"p\">(<\/span><span class=\"nn\">MyError<\/span><span class=\"p\">::<\/span><span class=\"n\">EmptyContentError<\/span><span class=\"p\">);<\/span>\r\n    <span class=\"p\">}<\/span>\r\n\r\n    <span class=\"c1\">\/\/ \u51fa\u529b\u5148\u30a8\u30af\u30bb\u30eb\u306e\u7528\u610f<\/span>\r\n    <span class=\"k\">let<\/span> <span class=\"n\">wb<\/span> <span class=\"o\">=<\/span> <span class=\"nn\">xlsxwriter<\/span><span class=\"p\">::<\/span><span class=\"nn\">workbook<\/span><span class=\"p\">::<\/span><span class=\"nn\">Workbook<\/span><span class=\"p\">::<\/span><span class=\"nf\">new<\/span><span class=\"p\">(<\/span><span class=\"n\">OUT_FILE<\/span><span class=\"p\">)<\/span><span class=\"o\">?<\/span><span class=\"p\">;<\/span>\r\n    <span class=\"k\">let<\/span> <span class=\"k\">mut<\/span> <span class=\"n\">sh<\/span> <span class=\"o\">=<\/span> <span class=\"n\">wb<\/span><span class=\"nf\">.add_worksheet<\/span><span class=\"p\">(<\/span><span class=\"nb\">None<\/span><span class=\"p\">)<\/span><span class=\"o\">?<\/span><span class=\"p\">;<\/span>\r\n    <span class=\"k\">let<\/span> <span class=\"k\">mut<\/span> <span class=\"n\">i<\/span><span class=\"p\">:<\/span><span class=\"nn\">xlsxwriter<\/span><span class=\"p\">::<\/span><span class=\"nn\">worksheet<\/span><span class=\"p\">::<\/span><span class=\"n\">WorksheetRow<\/span> <span class=\"o\">=<\/span> <span class=\"mi\">0<\/span><span class=\"p\">;<\/span>\r\n\r\n    <span class=\"k\">let<\/span> <span class=\"k\">mut<\/span> <span class=\"n\">year<\/span><span class=\"p\">:<\/span><span class=\"nb\">usize<\/span> <span class=\"o\">=<\/span> <span class=\"mi\">0<\/span><span class=\"p\">;<\/span>\r\n\r\n    <span class=\"c1\">\/\/ \u30ad\u30e3\u30c3\u30b7\u30e5\u30d5\u30a1\u30a4\u30eb\u306e\u4e2d\u8eab\u3092 HTML \u3068\u3057\u3066\u30d1\u30fc\u30b9<\/span>\r\n    <span class=\"k\">let<\/span> <span class=\"n\">document<\/span> <span class=\"o\">=<\/span> <span class=\"nn\">scraper<\/span><span class=\"p\">::<\/span><span class=\"nn\">Html<\/span><span class=\"p\">::<\/span><span class=\"nf\">parse_document<\/span><span class=\"p\">(<\/span><span class=\"o\">&amp;<\/span><span class=\"n\">content<\/span><span class=\"p\">);<\/span>\r\n\r\n    <span class=\"c1\">\/\/ \u30bb\u30ec\u30af\u30bf\u3067\u8a72\u5f53\u3059\u308b tr \u8981\u7d20\u3092\u62bd\u51fa\u3057\u3001\u305d\u308c\u305e\u308c\u306b\u3064\u3044\u3066\u51e6\u7406\u3059\u308b<\/span>\r\n    <span class=\"k\">let<\/span> <span class=\"n\">trs<\/span> <span class=\"o\">=<\/span> <span class=\"n\">document<\/span><span class=\"nf\">.select<\/span><span class=\"p\">(<\/span><span class=\"o\">&amp;<\/span><span class=\"n\">selector1<\/span><span class=\"p\">);<\/span>\r\n    <span class=\"k\">for<\/span> <span class=\"n\">tr<\/span> <span class=\"k\">in<\/span> <span class=\"n\">trs<\/span> <span class=\"p\">{<\/span>\r\n        <span class=\"k\">let<\/span> <span class=\"k\">mut<\/span> <span class=\"n\">tmp<\/span> <span class=\"o\">=<\/span> <span class=\"nn\">String<\/span><span class=\"p\">::<\/span><span class=\"nf\">new<\/span><span class=\"p\">();<\/span>\r\n\r\n        <span class=\"c1\">\/\/ tr \u8981\u7d20\u306e\u4e0b\u306e h3 \u8981\u7d20\u3088\u308a\u300c\u897f\u66a6\u300d\u3092\u53d6\u5f97\u3059\u308b<\/span>\r\n        <span class=\"k\">for<\/span> <span class=\"n\">h3<\/span> <span class=\"k\">in<\/span> <span class=\"n\">tr<\/span><span class=\"nf\">.select<\/span><span class=\"p\">(<\/span><span class=\"o\">&amp;<\/span><span class=\"n\">selector2<\/span><span class=\"p\">)<\/span> <span class=\"p\">{<\/span>\r\n            <span class=\"n\">tmp<\/span> <span class=\"o\">=<\/span> <span class=\"nf\">text2str<\/span><span class=\"p\">(<\/span><span class=\"o\">&amp;<\/span><span class=\"k\">mut<\/span> <span class=\"n\">h3<\/span><span class=\"nf\">.text<\/span><span class=\"p\">());<\/span>\r\n        <span class=\"p\">}<\/span>\r\n        <span class=\"k\">if<\/span> <span class=\"n\">tmp<\/span> <span class=\"o\">!=<\/span> <span class=\"s\">\"\"<\/span> <span class=\"p\">{<\/span>\r\n            <span class=\"n\">year<\/span> <span class=\"o\">=<\/span> <span class=\"nf\">parse_year<\/span><span class=\"p\">(<\/span><span class=\"n\">tmp<\/span><span class=\"p\">)<\/span><span class=\"o\">?<\/span><span class=\"p\">;<\/span>\r\n            <span class=\"k\">continue<\/span><span class=\"p\">;<\/span>\r\n        <span class=\"p\">}<\/span>\r\n\r\n        <span class=\"k\">let<\/span> <span class=\"k\">mut<\/span> <span class=\"n\">month<\/span><span class=\"p\">:<\/span><span class=\"nb\">usize<\/span> <span class=\"o\">=<\/span> <span class=\"mi\">0<\/span><span class=\"p\">;<\/span>\r\n        <span class=\"k\">let<\/span> <span class=\"k\">mut<\/span> <span class=\"n\">day<\/span><span class=\"p\">:<\/span><span class=\"nb\">usize<\/span> <span class=\"o\">=<\/span> <span class=\"mi\">0<\/span><span class=\"p\">;<\/span>\r\n        <span class=\"k\">let<\/span> <span class=\"k\">mut<\/span> <span class=\"n\">article_url<\/span> <span class=\"o\">=<\/span> <span class=\"nn\">String<\/span><span class=\"p\">::<\/span><span class=\"nf\">new<\/span><span class=\"p\">();<\/span>\r\n        <span class=\"k\">let<\/span> <span class=\"k\">mut<\/span> <span class=\"n\">article_title<\/span> <span class=\"o\">=<\/span> <span class=\"nn\">String<\/span><span class=\"p\">::<\/span><span class=\"nf\">new<\/span><span class=\"p\">();<\/span>\r\n\r\n        <span class=\"c1\">\/\/ tr \u8981\u7d20\u306e\u4e0b\u306e td \u8981\u7d20\u3088\u308a\u300c\u6708\u300d\u3068\u300c\u65e5\u300d\u3092\u53d6\u5f97\u3059\u308b<\/span>\r\n        <span class=\"k\">for<\/span> <span class=\"n\">td<\/span> <span class=\"k\">in<\/span> <span class=\"n\">tr<\/span><span class=\"nf\">.select<\/span><span class=\"p\">(<\/span><span class=\"o\">&amp;<\/span><span class=\"n\">selector3<\/span><span class=\"p\">)<\/span> <span class=\"p\">{<\/span>\r\n            <span class=\"k\">let<\/span> <span class=\"n\">date<\/span> <span class=\"o\">=<\/span> <span class=\"nf\">text2str<\/span><span class=\"p\">(<\/span><span class=\"o\">&amp;<\/span><span class=\"k\">mut<\/span> <span class=\"n\">td<\/span><span class=\"nf\">.text<\/span><span class=\"p\">());<\/span>\r\n            <span class=\"p\">(<\/span><span class=\"n\">month<\/span><span class=\"p\">,<\/span> <span class=\"n\">day<\/span><span class=\"p\">)<\/span> <span class=\"o\">=<\/span> <span class=\"nf\">parse_date<\/span><span class=\"p\">(<\/span><span class=\"n\">date<\/span><span class=\"p\">)<\/span><span class=\"o\">?<\/span><span class=\"p\">;<\/span>\r\n        <span class=\"p\">}<\/span>\r\n\r\n        <span class=\"c1\">\/\/ tr \u8981\u7d20\u306e\u4e0b\u306e a \u8981\u7d20\u3088\u308a\u300c\u8a18\u4e8b\u30bf\u30a4\u30c8\u30eb\u300d\u3068\u300c\u8a18\u4e8bURL\u300d\u3092\u53d6\u5f97\u3059\u308b<\/span>\r\n        <span class=\"k\">for<\/span> <span class=\"n\">a<\/span> <span class=\"k\">in<\/span> <span class=\"n\">tr<\/span><span class=\"nf\">.select<\/span><span class=\"p\">(<\/span><span class=\"o\">&amp;<\/span><span class=\"n\">selector4<\/span><span class=\"p\">)<\/span> <span class=\"p\">{<\/span>\r\n            <span class=\"n\">article_title<\/span> <span class=\"o\">=<\/span> <span class=\"nf\">text2str<\/span><span class=\"p\">(<\/span><span class=\"o\">&amp;<\/span><span class=\"k\">mut<\/span> <span class=\"n\">a<\/span><span class=\"nf\">.text<\/span><span class=\"p\">());<\/span>\r\n            <span class=\"k\">let<\/span> <span class=\"n\">a<\/span> <span class=\"o\">=<\/span> <span class=\"n\">a<\/span><span class=\"nf\">.value<\/span><span class=\"p\">();<\/span>\r\n            <span class=\"k\">if<\/span> <span class=\"k\">let<\/span> <span class=\"nf\">Some<\/span><span class=\"p\">(<\/span><span class=\"n\">h<\/span><span class=\"p\">)<\/span> <span class=\"o\">=<\/span> <span class=\"n\">a<\/span><span class=\"nf\">.attr<\/span><span class=\"p\">(<\/span><span class=\"s\">\"href\"<\/span><span class=\"p\">)<\/span> <span class=\"p\">{<\/span>\r\n                <span class=\"n\">article_url<\/span> <span class=\"o\">=<\/span> <span class=\"n\">base_url<\/span><span class=\"nf\">.join<\/span><span class=\"p\">(<\/span><span class=\"n\">h<\/span><span class=\"p\">)<\/span><span class=\"o\">?<\/span><span class=\"nf\">.to_string<\/span><span class=\"p\">();<\/span>\r\n            <span class=\"p\">}<\/span>\r\n        <span class=\"p\">}<\/span>\r\n\r\n        <span class=\"c1\">\/\/ \u3069\u308c\u304b\u4e00\u3064\u3067\u3082\u9805\u76ee\u3092\u53d6\u5f97\u3067\u304d\u306a\u304b\u3063\u305f\u5834\u5408\u306f\u6b21\u3078<\/span>\r\n        <span class=\"k\">if<\/span> <span class=\"n\">year<\/span> <span class=\"o\">==<\/span> <span class=\"mi\">0<\/span> <span class=\"p\">||<\/span> <span class=\"n\">month<\/span> <span class=\"o\">==<\/span> <span class=\"mi\">0<\/span> <span class=\"p\">||<\/span> <span class=\"n\">day<\/span> <span class=\"o\">==<\/span> <span class=\"mi\">0<\/span> <span class=\"p\">||<\/span> <span class=\"n\">article_title<\/span> <span class=\"o\">==<\/span> <span class=\"s\">\"\"<\/span> <span class=\"p\">||<\/span> <span class=\"n\">article_url<\/span> <span class=\"o\">==<\/span> <span class=\"s\">\"\"<\/span> <span class=\"p\">{<\/span>\r\n            <span class=\"k\">continue<\/span><span class=\"p\">;<\/span>\r\n        <span class=\"p\">}<\/span>\r\n\r\n        <span class=\"k\">if<\/span> <span class=\"nd\">cfg!<\/span><span class=\"p\">(<\/span><span class=\"n\">debug_assertions<\/span><span class=\"p\">)<\/span> <span class=\"p\">{<\/span>\r\n            <span class=\"nd\">println!<\/span><span class=\"p\">(<\/span><span class=\"s\">\"{y}\/{m}\/{d}, {t}, {u}\"<\/span><span class=\"p\">,<\/span>\r\n                <span class=\"n\">y<\/span> <span class=\"o\">=<\/span> <span class=\"n\">year<\/span><span class=\"p\">,<\/span>\r\n                <span class=\"n\">m<\/span> <span class=\"o\">=<\/span> <span class=\"n\">month<\/span><span class=\"p\">,<\/span>\r\n                <span class=\"n\">d<\/span> <span class=\"o\">=<\/span> <span class=\"n\">day<\/span><span class=\"p\">,<\/span>\r\n                <span class=\"n\">t<\/span> <span class=\"o\">=<\/span> <span class=\"n\">article_title<\/span><span class=\"p\">,<\/span>\r\n                <span class=\"n\">u<\/span> <span class=\"o\">=<\/span> <span class=\"n\">article_url<\/span><span class=\"p\">);<\/span>\r\n        <span class=\"p\">}<\/span>\r\n\r\n        <span class=\"c1\">\/\/ \u30a8\u30af\u30bb\u30eb\u306b\u51fa\u529b<\/span>\r\n        <span class=\"n\">sh<\/span><span class=\"nf\">.write_string<\/span><span class=\"p\">(<\/span><span class=\"n\">i<\/span><span class=\"p\">,<\/span> <span class=\"mi\">0<\/span><span class=\"p\">,<\/span> <span class=\"o\">&amp;<\/span><span class=\"nd\">format!<\/span><span class=\"p\">(<\/span><span class=\"s\">\"{}\/{}\/{}\"<\/span><span class=\"p\">,<\/span> <span class=\"n\">year<\/span><span class=\"p\">,<\/span> <span class=\"n\">month<\/span><span class=\"p\">,<\/span> <span class=\"n\">day<\/span><span class=\"p\">),<\/span> <span class=\"nb\">None<\/span><span class=\"p\">)<\/span><span class=\"o\">?<\/span><span class=\"p\">;<\/span>\r\n        <span class=\"n\">sh<\/span><span class=\"nf\">.write_string<\/span><span class=\"p\">(<\/span><span class=\"n\">i<\/span><span class=\"p\">,<\/span> <span class=\"mi\">1<\/span><span class=\"p\">,<\/span> <span class=\"o\">&amp;<\/span><span class=\"n\">article_title<\/span><span class=\"p\">,<\/span> <span class=\"nb\">None<\/span><span class=\"p\">)<\/span><span class=\"o\">?<\/span><span class=\"p\">;<\/span>\r\n        <span class=\"n\">sh<\/span><span class=\"nf\">.write_string<\/span><span class=\"p\">(<\/span><span class=\"n\">i<\/span><span class=\"p\">,<\/span> <span class=\"mi\">2<\/span><span class=\"p\">,<\/span> <span class=\"o\">&amp;<\/span><span class=\"n\">article_url<\/span><span class=\"p\">,<\/span> <span class=\"nb\">None<\/span><span class=\"p\">)<\/span><span class=\"o\">?<\/span><span class=\"p\">;<\/span>\r\n        <span class=\"n\">i<\/span> <span class=\"o\">+=<\/span> <span class=\"mi\">1<\/span><span class=\"p\">;<\/span>\r\n    <span class=\"p\">}<\/span>\r\n\r\n    <span class=\"c1\">\/\/ \u30a8\u30af\u30bb\u30eb\u3092\u30af\u30ed\u30fc\u30ba<\/span>\r\n    <span class=\"n\">wb<\/span><span class=\"nf\">.close<\/span><span class=\"p\">()<\/span><span class=\"o\">?<\/span><span class=\"p\">;<\/span>\r\n    <span class=\"nd\">eprintln!<\/span><span class=\"p\">(<\/span><span class=\"s\">\"# saved in {}\"<\/span><span class=\"p\">,<\/span> <span class=\"n\">OUT_FILE<\/span><span class=\"p\">);<\/span>\r\n\r\n    <span class=\"nf\">Ok<\/span><span class=\"p\">(())<\/span>\r\n<span class=\"p\">}<\/span>\r\n\r\n<span class=\"c1\">\/\/ \u897f\u66a6\u3092\u53d6\u5f97\u3059\u308b\u95a2\u6570<\/span>\r\n<span class=\"k\">fn<\/span> <span class=\"nf\">parse_year<\/span><span class=\"p\">(<\/span><span class=\"n\">s<\/span><span class=\"p\">:<\/span><span class=\"nb\">String<\/span><span class=\"p\">)<\/span> <span class=\"k\">-&gt;<\/span> <span class=\"nb\">Result<\/span><span class=\"o\">&lt;<\/span><span class=\"nb\">usize<\/span><span class=\"p\">,<\/span> <span class=\"n\">MyError<\/span><span class=\"o\">&gt;<\/span> <span class=\"p\">{<\/span>\r\n    <span class=\"k\">if<\/span> <span class=\"o\">!<\/span><span class=\"n\">s<\/span><span class=\"nf\">.starts_with<\/span><span class=\"p\">(<\/span><span class=\"s\">\"Posts in \"<\/span><span class=\"p\">)<\/span> <span class=\"p\">{<\/span>\r\n        <span class=\"k\">return<\/span> <span class=\"nf\">Err<\/span><span class=\"p\">(<\/span><span class=\"nn\">MyError<\/span><span class=\"p\">::<\/span><span class=\"n\">UnknownYearError<\/span><span class=\"p\">);<\/span>\r\n    <span class=\"p\">}<\/span>\r\n    <span class=\"c1\">\/\/ 9\u6587\u5b57\u76ee\u4ee5\u964d\u3092\u897f\u66a6\u3092\u793a\u3059\u6570\u5b57\u3068\u3057\u3066\u30d1\u30fc\u30b9\u3059\u308b<\/span>\r\n    <span class=\"nf\">Ok<\/span><span class=\"p\">(<\/span><span class=\"n\">s<\/span><span class=\"p\">[<\/span><span class=\"mi\">9<\/span><span class=\"o\">..<\/span><span class=\"p\">]<\/span><span class=\"nf\">.parse<\/span><span class=\"p\">()<\/span><span class=\"o\">?<\/span><span class=\"p\">)<\/span>\r\n<span class=\"p\">}<\/span>\r\n\r\n<span class=\"c1\">\/\/ \u65e5\u4ed8\uff08\u6708\u30fb\u65e5\uff09\u3092\u53d6\u5f97\u3059\u308b\u95a2\u6570<\/span>\r\n<span class=\"k\">fn<\/span> <span class=\"nf\">parse_date<\/span><span class=\"p\">(<\/span><span class=\"n\">s<\/span><span class=\"p\">:<\/span><span class=\"nb\">String<\/span><span class=\"p\">)<\/span> <span class=\"k\">-&gt;<\/span> <span class=\"nb\">Result<\/span><span class=\"o\">&lt;<\/span><span class=\"p\">(<\/span><span class=\"nb\">usize<\/span><span class=\"p\">,<\/span> <span class=\"nb\">usize<\/span><span class=\"p\">),<\/span> <span class=\"n\">MyError<\/span><span class=\"o\">&gt;<\/span> <span class=\"p\">{<\/span>\r\n    <span class=\"k\">let<\/span> <span class=\"n\">p<\/span><span class=\"p\">:<\/span><span class=\"nb\">Vec<\/span><span class=\"o\">&lt;&amp;<\/span><span class=\"nb\">str<\/span><span class=\"o\">&gt;<\/span> <span class=\"o\">=<\/span> <span class=\"n\">s<\/span><span class=\"nf\">.split<\/span><span class=\"p\">(<\/span><span class=\"n\">NBSP<\/span><span class=\"p\">)<\/span><span class=\"nf\">.collect<\/span><span class=\"p\">();<\/span>\r\n    <span class=\"k\">if<\/span> <span class=\"n\">p<\/span><span class=\"nf\">.len<\/span><span class=\"p\">()<\/span><span class=\"o\">&lt;<\/span><span class=\"mi\">2<\/span> <span class=\"p\">{<\/span>\r\n        <span class=\"k\">return<\/span> <span class=\"nf\">Err<\/span><span class=\"p\">(<\/span><span class=\"nn\">MyError<\/span><span class=\"p\">::<\/span><span class=\"n\">UnknownDateError<\/span><span class=\"p\">);<\/span>\r\n    <span class=\"p\">}<\/span>\r\n    <span class=\"nf\">Ok<\/span><span class=\"p\">((<\/span><span class=\"nf\">conv_month<\/span><span class=\"p\">(<\/span><span class=\"n\">p<\/span><span class=\"p\">[<\/span><span class=\"mi\">0<\/span><span class=\"p\">])<\/span><span class=\"o\">?<\/span><span class=\"p\">,<\/span> <span class=\"n\">p<\/span><span class=\"p\">[<\/span><span class=\"mi\">1<\/span><span class=\"p\">]<\/span><span class=\"nf\">.parse<\/span><span class=\"p\">()<\/span><span class=\"o\">?<\/span><span class=\"p\">))<\/span>\r\n<span class=\"p\">}<\/span>\r\n\r\n<span class=\"c1\">\/\/ \u6708\u540d\u3092\u6570\u5b57\u306b\u5909\u63db\u3059\u308b\u95a2\u6570<\/span>\r\n<span class=\"k\">fn<\/span> <span class=\"nf\">conv_month<\/span><span class=\"p\">(<\/span><span class=\"n\">s<\/span><span class=\"p\">:<\/span><span class=\"o\">&amp;<\/span><span class=\"nb\">str<\/span><span class=\"p\">)<\/span> <span class=\"k\">-&gt;<\/span> <span class=\"nb\">Result<\/span><span class=\"o\">&lt;<\/span><span class=\"nb\">usize<\/span><span class=\"p\">,<\/span> <span class=\"n\">MyError<\/span><span class=\"o\">&gt;<\/span> <span class=\"p\">{<\/span>\r\n    <span class=\"k\">for<\/span> <span class=\"p\">(<\/span><span class=\"n\">i<\/span><span class=\"p\">,<\/span> <span class=\"n\">m<\/span><span class=\"p\">)<\/span> <span class=\"k\">in<\/span> <span class=\"n\">MONTHS<\/span><span class=\"nf\">.iter<\/span><span class=\"p\">()<\/span><span class=\"nf\">.enumerate<\/span><span class=\"p\">()<\/span> <span class=\"p\">{<\/span>\r\n        <span class=\"k\">if<\/span> <span class=\"n\">s<\/span> <span class=\"o\">==<\/span> <span class=\"o\">*<\/span><span class=\"n\">m<\/span> <span class=\"p\">{<\/span>\r\n            <span class=\"k\">return<\/span> <span class=\"nf\">Ok<\/span><span class=\"p\">(<\/span><span class=\"n\">i<\/span><span class=\"o\">+<\/span><span class=\"mi\">1<\/span><span class=\"p\">);<\/span>\r\n        <span class=\"p\">}<\/span>\r\n    <span class=\"p\">}<\/span>\r\n    <span class=\"nf\">Err<\/span><span class=\"p\">(<\/span><span class=\"nn\">MyError<\/span><span class=\"p\">::<\/span><span class=\"n\">UnknownDateError<\/span><span class=\"p\">)<\/span>\r\n<span class=\"p\">}<\/span>\r\n\r\n<span class=\"c1\">\/\/ \u30c6\u30ad\u30b9\u30c8\u306e\u6587\u5b57\u5217\u3092\u8fd4\u3059\u95a2\u6570<\/span>\r\n<span class=\"k\">fn<\/span> <span class=\"nf\">text2str<\/span><span class=\"p\">(<\/span><span class=\"n\">t<\/span><span class=\"p\">:<\/span><span class=\"o\">&amp;<\/span><span class=\"k\">mut<\/span> <span class=\"nn\">scraper<\/span><span class=\"p\">::<\/span><span class=\"nn\">element_ref<\/span><span class=\"p\">::<\/span><span class=\"n\">Text<\/span><span class=\"p\">)<\/span> <span class=\"k\">-&gt;<\/span> <span class=\"nb\">String<\/span> <span class=\"p\">{<\/span>\r\n    <span class=\"c1\">\/\/String::from(t.next().unwrap_or(\"\")) \u3067\u3082\u3044\u3044\u3051\u3069\u30c0\u30b5\u3044\u3068\u611f\u3058\u308b<\/span>\r\n    <span class=\"k\">let<\/span> <span class=\"n\">r<\/span><span class=\"p\">:<\/span><span class=\"nb\">Vec<\/span><span class=\"o\">&lt;&amp;<\/span><span class=\"nb\">str<\/span><span class=\"o\">&gt;<\/span> <span class=\"o\">=<\/span> <span class=\"n\">t<\/span><span class=\"nf\">.collect<\/span><span class=\"p\">();<\/span>\r\n    <span class=\"n\">r<\/span><span class=\"nf\">.join<\/span><span class=\"p\">(<\/span><span class=\"s\">\"\"<\/span><span class=\"p\">)<\/span>\r\n<span class=\"p\">}<\/span>\r\n\r\n<span class=\"c1\">\/\/ \u30ad\u30e3\u30c3\u30b7\u30e5\u30d5\u30a1\u30a4\u30eb\u3092\u7528\u610f\u3059\u308b\u95a2\u6570\uff08\u4f55\u5ea6\u3082\u30b5\u30a4\u30c8\u306b\u30a2\u30af\u30bb\u30b9\u3055\u305b\u306a\u3044\u5de5\u592b\uff09<\/span>\r\n<span class=\"k\">fn<\/span> <span class=\"nf\">prepare_cache_file<\/span><span class=\"p\">()<\/span> <span class=\"k\">-&gt;<\/span> <span class=\"nb\">Result<\/span><span class=\"o\">&lt;<\/span><span class=\"p\">(),<\/span> <span class=\"n\">MyError<\/span><span class=\"o\">&gt;<\/span> <span class=\"p\">{<\/span>\r\n    <span class=\"c1\">\/\/ \u30ad\u30e3\u30c3\u30b7\u30e5\u30d5\u30a1\u30a4\u30eb\u304c\u306a\u3044\u3068\u304d\u306e\u307f\u30c0\u30a6\u30f3\u30ed\u30fc\u30c9\u3059\u308b<\/span>\r\n    <span class=\"k\">if<\/span> <span class=\"o\">!<\/span><span class=\"nf\">file_exists<\/span><span class=\"p\">(<\/span><span class=\"n\">CACHE_FILE<\/span><span class=\"p\">)<\/span> <span class=\"p\">{<\/span>\r\n        <span class=\"nd\">eprintln!<\/span><span class=\"p\">(<\/span><span class=\"s\">\"# downloading...\"<\/span><span class=\"p\">);<\/span>\r\n        <span class=\"k\">let<\/span> <span class=\"n\">body<\/span> <span class=\"o\">=<\/span> <span class=\"nn\">reqwest<\/span><span class=\"p\">::<\/span><span class=\"nn\">blocking<\/span><span class=\"p\">::<\/span><span class=\"nf\">get<\/span><span class=\"p\">(<\/span><span class=\"n\">RUST_BLOG_URL<\/span><span class=\"p\">)<\/span><span class=\"o\">?<\/span><span class=\"nf\">.text<\/span><span class=\"p\">()<\/span><span class=\"o\">?<\/span><span class=\"p\">;<\/span>\r\n\r\n        <span class=\"nd\">eprintln!<\/span><span class=\"p\">(<\/span><span class=\"s\">\"# saving in {}...\"<\/span><span class=\"p\">,<\/span> <span class=\"n\">CACHE_FILE<\/span><span class=\"p\">);<\/span>\r\n        <span class=\"k\">let<\/span> <span class=\"k\">mut<\/span> <span class=\"n\">w<\/span> <span class=\"o\">=<\/span> <span class=\"nn\">std<\/span><span class=\"p\">::<\/span><span class=\"nn\">fs<\/span><span class=\"p\">::<\/span><span class=\"nn\">File<\/span><span class=\"p\">::<\/span><span class=\"nf\">create<\/span><span class=\"p\">(<\/span><span class=\"n\">CACHE_FILE<\/span><span class=\"p\">)<\/span><span class=\"o\">?<\/span><span class=\"p\">;<\/span>\r\n        <span class=\"nd\">write!<\/span><span class=\"p\">(<\/span><span class=\"n\">w<\/span><span class=\"p\">,<\/span> <span class=\"s\">\"{}\"<\/span><span class=\"p\">,<\/span> <span class=\"n\">body<\/span><span class=\"p\">)<\/span><span class=\"o\">?<\/span><span class=\"p\">;<\/span>\r\n        <span class=\"n\">w<\/span><span class=\"nf\">.flush<\/span><span class=\"p\">()<\/span><span class=\"o\">?<\/span><span class=\"p\">;<\/span>\r\n    <span class=\"p\">}<\/span>\r\n    <span class=\"nf\">Ok<\/span><span class=\"p\">(())<\/span>\r\n<span class=\"p\">}<\/span>\r\n\r\n<span class=\"c1\">\/\/ \u30d5\u30a1\u30a4\u30eb\u306e\u6709\u7121\u3092\u30c1\u30a7\u30c3\u30af\u3059\u308b\u95a2\u6570<\/span>\r\n<span class=\"k\">fn<\/span> <span class=\"nf\">file_exists<\/span><span class=\"p\">(<\/span><span class=\"n\">file_path<\/span><span class=\"p\">:<\/span> <span class=\"o\">&amp;<\/span><span class=\"nb\">str<\/span><span class=\"p\">)<\/span> <span class=\"k\">-&gt;<\/span> <span class=\"nb\">bool<\/span> <span class=\"p\">{<\/span>\r\n    <span class=\"nn\">std<\/span><span class=\"p\">::<\/span><span class=\"nn\">path<\/span><span class=\"p\">::<\/span><span class=\"nn\">Path<\/span><span class=\"p\">::<\/span><span class=\"nf\">new<\/span><span class=\"p\">(<\/span><span class=\"n\">file_path<\/span><span class=\"p\">)<\/span><span class=\"nf\">.is_file<\/span><span class=\"p\">()<\/span>\r\n<span class=\"p\">}<\/span>\r\n\r\n<span class=\"c1\">\/\/ \u8907\u6570\u306e\u30a8\u30e9\u30fc\u3092\u307e\u3068\u3081\u308b\u30a8\u30e9\u30fc\u578b<\/span>\r\n<span class=\"nd\">#[derive(thiserror::Error,<\/span> <span class=\"nd\">Debug)]<\/span>\r\n<span class=\"k\">enum<\/span> <span class=\"n\">MyError<\/span> <span class=\"p\">{<\/span>\r\n    <span class=\"nd\">#[error(<\/span><span class=\"s\">\"UnknownYearError\"<\/span><span class=\"nd\">)]<\/span>\r\n    <span class=\"n\">UnknownYearError<\/span><span class=\"p\">,<\/span>\r\n\r\n    <span class=\"nd\">#[error(<\/span><span class=\"s\">\"UnknownDateError\"<\/span><span class=\"nd\">)]<\/span>\r\n    <span class=\"n\">UnknownDateError<\/span><span class=\"p\">,<\/span>\r\n\r\n    <span class=\"nd\">#[error(<\/span><span class=\"s\">\"EmptyContentError\"<\/span><span class=\"nd\">)]<\/span>\r\n    <span class=\"n\">EmptyContentError<\/span><span class=\"p\">,<\/span>\r\n\r\n    <span class=\"nd\">#[error(<\/span><span class=\"s\">\"ReqwestError({0})\"<\/span><span class=\"nd\">)]<\/span>\r\n    <span class=\"nf\">ReqwestError<\/span> <span class=\"p\">(<\/span><span class=\"nd\">#[from]<\/span> <span class=\"nn\">reqwest<\/span><span class=\"p\">::<\/span><span class=\"n\">Error<\/span><span class=\"p\">),<\/span>\r\n\r\n    <span class=\"nd\">#[error(<\/span><span class=\"s\">\"IOError({0})\"<\/span><span class=\"nd\">)]<\/span>\r\n    <span class=\"nf\">IOError<\/span> <span class=\"p\">(<\/span><span class=\"nd\">#[from]<\/span> <span class=\"nn\">std<\/span><span class=\"p\">::<\/span><span class=\"nn\">io<\/span><span class=\"p\">::<\/span><span class=\"n\">Error<\/span><span class=\"p\">),<\/span>\r\n\r\n    <span class=\"nd\">#[error(<\/span><span class=\"s\">\"ParseUrlError({0})\"<\/span><span class=\"nd\">)]<\/span>\r\n    <span class=\"nf\">ParseUrlError<\/span> <span class=\"p\">(<\/span><span class=\"nd\">#[from]<\/span> <span class=\"nn\">url<\/span><span class=\"p\">::<\/span><span class=\"n\">ParseError<\/span><span class=\"p\">),<\/span>\r\n\r\n    <span class=\"nd\">#[error(<\/span><span class=\"s\">\"ParseIntError({0})\"<\/span><span class=\"nd\">)]<\/span>\r\n    <span class=\"nf\">ParseIntError<\/span> <span class=\"p\">(<\/span><span class=\"nd\">#[from]<\/span> <span class=\"nn\">std<\/span><span class=\"p\">::<\/span><span class=\"nn\">num<\/span><span class=\"p\">::<\/span><span class=\"n\">ParseIntError<\/span><span class=\"p\">),<\/span>\r\n\r\n    <span class=\"nd\">#[error(<\/span><span class=\"s\">\"XlsxError({0})\"<\/span><span class=\"nd\">)]<\/span>\r\n    <span class=\"nf\">XlsxError<\/span> <span class=\"p\">(<\/span><span class=\"nd\">#[from]<\/span> <span class=\"nn\">xlsxwriter<\/span><span class=\"p\">::<\/span><span class=\"n\">XlsxError<\/span><span class=\"p\">),<\/span>\r\n<span class=\"p\">}<\/span>\r\n<\/code><\/pre>\n<p>\u4ee5\u4e0a\u3002<\/p>\n","protected":false},"excerpt":{"rendered":"<p>\u3061\u3087\u3063\u3068Web\u30b9\u30af\u30ec\u30a4\u30d4\u30f3\u30b0\u3059\u308b\u5fc5\u8981\u304c\u3042\u3063\u305f\u306e\u3067 scraper \u30af\u30ec\u30a4\u30c8\u3092\u4f7f\u3063\u3066\u307f\u307e\u3057\u305f\u3002 \u5e38\u8b58\u7684\u306b\u8003\u3048\u3066  [&hellip;]<\/p>\n","protected":false},"author":3,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-45517","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v21.5 (Yoast SEO v21.5) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>- Blog - Silicon Cloud<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.silicloud.com\/zh\/blog\/45517-2\/\" \/>\n<meta property=\"og:locale\" content=\"zh_CN\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:description\" content=\"\u3061\u3087\u3063\u3068Web\u30b9\u30af\u30ec\u30a4\u30d4\u30f3\u30b0\u3059\u308b\u5fc5\u8981\u304c\u3042\u3063\u305f\u306e\u3067 scraper \u30af\u30ec\u30a4\u30c8\u3092\u4f7f\u3063\u3066\u307f\u307e\u3057\u305f\u3002 \u5e38\u8b58\u7684\u306b\u8003\u3048\u3066 [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.silicloud.com\/zh\/blog\/45517-2\/\" \/>\n<meta property=\"og:site_name\" content=\"Blog - Silicon Cloud\" \/>\n<meta property=\"article:published_time\" content=\"2023-06-01T09:03:14+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-04-30T07:23:55+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/657d5f5337434c4406cf7951\/6-0.png\" \/>\n<meta name=\"author\" content=\"\u97f5, \u79d1\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"\u4f5c\u8005\" \/>\n\t<meta name=\"twitter:data1\" content=\"\u97f5, \u79d1\" \/>\n\t<meta name=\"twitter:label2\" content=\"\u9884\u8ba1\u9605\u8bfb\u65f6\u95f4\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 \u5206\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/45517-2\/\",\"url\":\"https:\/\/www.silicloud.com\/zh\/blog\/45517-2\/\",\"name\":\"- Blog - Silicon Cloud\",\"isPartOf\":{\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/#website\"},\"datePublished\":\"2023-06-01T09:03:14+00:00\",\"dateModified\":\"2024-04-30T07:23:55+00:00\",\"author\":{\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/6530331a63adef3b3443a1fab53a0e6e\"},\"inLanguage\":\"zh-Hans\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.silicloud.com\/zh\/blog\/45517-2\/\"]}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/#website\",\"url\":\"https:\/\/www.silicloud.com\/zh\/blog\/\",\"name\":\"Blog - Silicon Cloud\",\"description\":\"\",\"inLanguage\":\"zh-Hans\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/6530331a63adef3b3443a1fab53a0e6e\",\"name\":\"\u97f5, \u79d1\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"zh-Hans\",\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/429ccb39b3fff5188bc17986222cfb0936cbadb8cc933cff04ab5ca01bd30a08?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/429ccb39b3fff5188bc17986222cfb0936cbadb8cc933cff04ab5ca01bd30a08?s=96&d=mm&r=g\",\"caption\":\"\u97f5, \u79d1\"},\"url\":\"https:\/\/www.silicloud.com\/zh\/blog\/author\/yunke\/\"},{\"@type\":\"ImageObject\",\"inLanguage\":\"zh-Hans\",\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/45517-2\/#local-main-organization-logo\",\"url\":\"\",\"contentUrl\":\"\",\"caption\":\"Blog - Silicon Cloud\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"- Blog - Silicon Cloud","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.silicloud.com\/zh\/blog\/45517-2\/","og_locale":"zh_CN","og_type":"article","og_description":"\u3061\u3087\u3063\u3068Web\u30b9\u30af\u30ec\u30a4\u30d4\u30f3\u30b0\u3059\u308b\u5fc5\u8981\u304c\u3042\u3063\u305f\u306e\u3067 scraper \u30af\u30ec\u30a4\u30c8\u3092\u4f7f\u3063\u3066\u307f\u307e\u3057\u305f\u3002 \u5e38\u8b58\u7684\u306b\u8003\u3048\u3066 [&hellip;]","og_url":"https:\/\/www.silicloud.com\/zh\/blog\/45517-2\/","og_site_name":"Blog - Silicon Cloud","article_published_time":"2023-06-01T09:03:14+00:00","article_modified_time":"2024-04-30T07:23:55+00:00","og_image":[{"url":"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/657d5f5337434c4406cf7951\/6-0.png"}],"author":"\u97f5, \u79d1","twitter_card":"summary_large_image","twitter_misc":{"\u4f5c\u8005":"\u97f5, \u79d1","\u9884\u8ba1\u9605\u8bfb\u65f6\u95f4":"3 \u5206"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.silicloud.com\/zh\/blog\/45517-2\/","url":"https:\/\/www.silicloud.com\/zh\/blog\/45517-2\/","name":"- Blog - Silicon Cloud","isPartOf":{"@id":"https:\/\/www.silicloud.com\/zh\/blog\/#website"},"datePublished":"2023-06-01T09:03:14+00:00","dateModified":"2024-04-30T07:23:55+00:00","author":{"@id":"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/6530331a63adef3b3443a1fab53a0e6e"},"inLanguage":"zh-Hans","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.silicloud.com\/zh\/blog\/45517-2\/"]}]},{"@type":"WebSite","@id":"https:\/\/www.silicloud.com\/zh\/blog\/#website","url":"https:\/\/www.silicloud.com\/zh\/blog\/","name":"Blog - Silicon Cloud","description":"","inLanguage":"zh-Hans"},{"@type":"Person","@id":"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/6530331a63adef3b3443a1fab53a0e6e","name":"\u97f5, \u79d1","image":{"@type":"ImageObject","inLanguage":"zh-Hans","@id":"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/429ccb39b3fff5188bc17986222cfb0936cbadb8cc933cff04ab5ca01bd30a08?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/429ccb39b3fff5188bc17986222cfb0936cbadb8cc933cff04ab5ca01bd30a08?s=96&d=mm&r=g","caption":"\u97f5, \u79d1"},"url":"https:\/\/www.silicloud.com\/zh\/blog\/author\/yunke\/"},{"@type":"ImageObject","inLanguage":"zh-Hans","@id":"https:\/\/www.silicloud.com\/zh\/blog\/45517-2\/#local-main-organization-logo","url":"","contentUrl":"","caption":"Blog - Silicon Cloud"}]}},"_links":{"self":[{"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/posts\/45517","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/comments?post=45517"}],"version-history":[{"count":2,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/posts\/45517\/revisions"}],"predecessor-version":[{"id":92713,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/posts\/45517\/revisions\/92713"}],"wp:attachment":[{"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/media?parent=45517"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/categories?post=45517"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/tags?post=45517"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}