{"id":22633,"date":"2024-03-15T23:54:19","date_gmt":"2024-03-15T23:54:19","guid":{"rendered":"https:\/\/www.silicloud.com\/blog\/how-to-use-xpath-to-extract-li-tags-in-python\/"},"modified":"2024-03-21T23:20:57","modified_gmt":"2024-03-21T23:20:57","slug":"how-to-use-xpath-to-extract-li-tags-in-python","status":"publish","type":"post","link":"https:\/\/www.silicloud.com\/blog\/how-to-use-xpath-to-extract-li-tags-in-python\/","title":{"rendered":"How to use XPath to extract li tags in Python?"},"content":{"rendered":"<p>In Python, you can use XPath to retrieve the li tags from an HTML or XML document. To start, you will need to import the necessary libraries, such as the lxml library.<\/p>\n<p>Here is an example code using XPath to extract li tags:<\/p>\n<pre class=\"post-pre\"><code><span class=\"hljs-keyword\">from<\/span> lxml <span class=\"hljs-keyword\">import<\/span> etree\r\n\r\n<span class=\"hljs-comment\"># HTML\u6216XML\u6587\u6863\u7684\u793a\u4f8b\u5b57\u7b26\u4e32<\/span>\r\nhtml = <span class=\"hljs-string\">\"\"\"\r\n&lt;html&gt;\r\n  &lt;body&gt;\r\n    &lt;ul&gt;\r\n      &lt;li&gt;Item 1&lt;\/li&gt;\r\n      &lt;li&gt;Item 2&lt;\/li&gt;\r\n      &lt;li&gt;Item 3&lt;\/li&gt;\r\n    &lt;\/ul&gt;\r\n  &lt;\/body&gt;\r\n&lt;\/html&gt;\r\n\"\"\"<\/span>\r\n\r\n<span class=\"hljs-comment\"># \u521b\u5efa\u4e00\u4e2aXPath\u89e3\u6790\u5668<\/span>\r\nparser = etree.HTMLParser()\r\n\r\n<span class=\"hljs-comment\"># \u5c06\u5b57\u7b26\u4e32\u89e3\u6790\u4e3a\u4e00\u4e2aXPath\u53ef\u89e3\u6790\u7684\u5bf9\u8c61<\/span>\r\ntree = etree.fromstring(html, parser)\r\n\r\n<span class=\"hljs-comment\"># \u4f7f\u7528XPath\u8868\u8fbe\u5f0f\u83b7\u53d6li\u6807\u7b7e<\/span>\r\nli_tags = tree.xpath(<span class=\"hljs-string\">'\/\/li'<\/span>)\r\n\r\n<span class=\"hljs-comment\"># \u904d\u5386\u83b7\u53d6\u5230\u7684li\u6807\u7b7e<\/span>\r\n<span class=\"hljs-keyword\">for<\/span> li <span class=\"hljs-keyword\">in<\/span> li_tags:\r\n    <span class=\"hljs-built_in\">print<\/span>(li.text)\r\n<\/code><\/pre>\n<p>Running the above code will result in:<\/p>\n<pre class=\"post-pre\"><code>Item 1\r\nItem 2\r\nItem 3\r\n<\/code><\/pre>\n<p>In the XPath expression &#8216;\/\/li&#8217;, \/\/ selects all li tags in the document, while li selects li tags specifically. Therefore, tree.xpath(&#8216;\/\/li&#8217;) will return a list containing all li tags. You can then iterate through this list to retrieve the content of each li tag.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>In Python, you can use XPath to retrieve the li tags from an HTML or XML document. To start, you will need to import the necessary libraries, such as the lxml library. Here is an example code using XPath to extract li tags: from lxml import etree # HTML\u6216XML\u6587\u6863\u7684\u793a\u4f8b\u5b57\u7b26\u4e32 html = &#8220;&#8221;&#8221; &lt;html&gt; &lt;body&gt; &lt;ul&gt; [&hellip;]<\/p>\n","protected":false},"author":12,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_import_markdown_pro_load_document_selector":0,"_import_markdown_pro_submit_text_textarea":"","footnotes":""},"categories":[1],"tags":[],"class_list":["post-22633","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v21.5 (Yoast SEO v21.5) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>How to use XPath to extract li tags in Python? - Blog - Silicon Cloud<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.silicloud.com\/blog\/how-to-use-xpath-to-extract-li-tags-in-python\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"How to use XPath to extract li tags in Python?\" \/>\n<meta property=\"og:description\" content=\"In Python, you can use XPath to retrieve the li tags from an HTML or XML document. To start, you will need to import the necessary libraries, such as the lxml library. Here is an example code using XPath to extract li tags: from lxml import etree # HTML\u6216XML\u6587\u6863\u7684\u793a\u4f8b\u5b57\u7b26\u4e32 html = &quot;&quot;&quot; &lt;html&gt; &lt;body&gt; &lt;ul&gt; [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.silicloud.com\/blog\/how-to-use-xpath-to-extract-li-tags-in-python\/\" \/>\n<meta property=\"og:site_name\" content=\"Blog - Silicon Cloud\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/SiliCloudGlobal\/\" \/>\n<meta property=\"article:published_time\" content=\"2024-03-15T23:54:19+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-03-21T23:20:57+00:00\" \/>\n<meta name=\"author\" content=\"Liam\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@SiliCloudGlobal\" \/>\n<meta name=\"twitter:site\" content=\"@SiliCloudGlobal\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Liam\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"1 minute\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/how-to-use-xpath-to-extract-li-tags-in-python\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.silicloud.com\/blog\/how-to-use-xpath-to-extract-li-tags-in-python\/\"},\"author\":{\"name\":\"Liam\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/#\/schema\/person\/23786905eb7b377f45ddb01c17da7671\"},\"headline\":\"How to use XPath to extract li tags in Python?\",\"datePublished\":\"2024-03-15T23:54:19+00:00\",\"dateModified\":\"2024-03-21T23:20:57+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.silicloud.com\/blog\/how-to-use-xpath-to-extract-li-tags-in-python\/\"},\"wordCount\":105,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/www.silicloud.com\/blog\/#organization\"},\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/how-to-use-xpath-to-extract-li-tags-in-python\/\",\"url\":\"https:\/\/www.silicloud.com\/blog\/how-to-use-xpath-to-extract-li-tags-in-python\/\",\"name\":\"How to use XPath to extract li tags in Python? - Blog - Silicon Cloud\",\"isPartOf\":{\"@id\":\"https:\/\/www.silicloud.com\/blog\/#website\"},\"datePublished\":\"2024-03-15T23:54:19+00:00\",\"dateModified\":\"2024-03-21T23:20:57+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/www.silicloud.com\/blog\/how-to-use-xpath-to-extract-li-tags-in-python\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.silicloud.com\/blog\/how-to-use-xpath-to-extract-li-tags-in-python\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/how-to-use-xpath-to-extract-li-tags-in-python\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.silicloud.com\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"How to use XPath to extract li tags in Python?\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/#website\",\"url\":\"https:\/\/www.silicloud.com\/blog\/\",\"name\":\"Silicon Cloud Blog\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\/\/www.silicloud.com\/blog\/#organization\"},\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/#organization\",\"name\":\"Silicon Cloud Blog\",\"url\":\"https:\/\/www.silicloud.com\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.silicloud.com\/blog\/wp-content\/uploads\/2023\/11\/EN-SILICON-Full.png\",\"contentUrl\":\"https:\/\/www.silicloud.com\/blog\/wp-content\/uploads\/2023\/11\/EN-SILICON-Full.png\",\"width\":1024,\"height\":1024,\"caption\":\"Silicon Cloud Blog\"},\"image\":{\"@id\":\"https:\/\/www.silicloud.com\/blog\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/SiliCloudGlobal\/\",\"https:\/\/twitter.com\/SiliCloudGlobal\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/#\/schema\/person\/23786905eb7b377f45ddb01c17da7671\",\"name\":\"Liam\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/8d37ed3e7f770dde8bf069ba0b4298688028c3abaacf1131742fc1352d174ebd?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/8d37ed3e7f770dde8bf069ba0b4298688028c3abaacf1131742fc1352d174ebd?s=96&d=mm&r=g\",\"caption\":\"Liam\"},\"sameAs\":[\"http:\/\/Wilson\"],\"url\":\"https:\/\/www.silicloud.com\/blog\/author\/liamwilson\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"How to use XPath to extract li tags in Python? - Blog - Silicon Cloud","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.silicloud.com\/blog\/how-to-use-xpath-to-extract-li-tags-in-python\/","og_locale":"en_US","og_type":"article","og_title":"How to use XPath to extract li tags in Python?","og_description":"In Python, you can use XPath to retrieve the li tags from an HTML or XML document. To start, you will need to import the necessary libraries, such as the lxml library. Here is an example code using XPath to extract li tags: from lxml import etree # HTML\u6216XML\u6587\u6863\u7684\u793a\u4f8b\u5b57\u7b26\u4e32 html = \"\"\" &lt;html&gt; &lt;body&gt; &lt;ul&gt; [&hellip;]","og_url":"https:\/\/www.silicloud.com\/blog\/how-to-use-xpath-to-extract-li-tags-in-python\/","og_site_name":"Blog - Silicon Cloud","article_publisher":"https:\/\/www.facebook.com\/SiliCloudGlobal\/","article_published_time":"2024-03-15T23:54:19+00:00","article_modified_time":"2024-03-21T23:20:57+00:00","author":"Liam","twitter_card":"summary_large_image","twitter_creator":"@SiliCloudGlobal","twitter_site":"@SiliCloudGlobal","twitter_misc":{"Written by":"Liam","Est. reading time":"1 minute"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.silicloud.com\/blog\/how-to-use-xpath-to-extract-li-tags-in-python\/#article","isPartOf":{"@id":"https:\/\/www.silicloud.com\/blog\/how-to-use-xpath-to-extract-li-tags-in-python\/"},"author":{"name":"Liam","@id":"https:\/\/www.silicloud.com\/blog\/#\/schema\/person\/23786905eb7b377f45ddb01c17da7671"},"headline":"How to use XPath to extract li tags in Python?","datePublished":"2024-03-15T23:54:19+00:00","dateModified":"2024-03-21T23:20:57+00:00","mainEntityOfPage":{"@id":"https:\/\/www.silicloud.com\/blog\/how-to-use-xpath-to-extract-li-tags-in-python\/"},"wordCount":105,"commentCount":0,"publisher":{"@id":"https:\/\/www.silicloud.com\/blog\/#organization"},"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/www.silicloud.com\/blog\/how-to-use-xpath-to-extract-li-tags-in-python\/","url":"https:\/\/www.silicloud.com\/blog\/how-to-use-xpath-to-extract-li-tags-in-python\/","name":"How to use XPath to extract li tags in Python? - Blog - Silicon Cloud","isPartOf":{"@id":"https:\/\/www.silicloud.com\/blog\/#website"},"datePublished":"2024-03-15T23:54:19+00:00","dateModified":"2024-03-21T23:20:57+00:00","breadcrumb":{"@id":"https:\/\/www.silicloud.com\/blog\/how-to-use-xpath-to-extract-li-tags-in-python\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.silicloud.com\/blog\/how-to-use-xpath-to-extract-li-tags-in-python\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.silicloud.com\/blog\/how-to-use-xpath-to-extract-li-tags-in-python\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.silicloud.com\/blog\/"},{"@type":"ListItem","position":2,"name":"How to use XPath to extract li tags in Python?"}]},{"@type":"WebSite","@id":"https:\/\/www.silicloud.com\/blog\/#website","url":"https:\/\/www.silicloud.com\/blog\/","name":"Silicon Cloud Blog","description":"","publisher":{"@id":"https:\/\/www.silicloud.com\/blog\/#organization"},"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.silicloud.com\/blog\/#organization","name":"Silicon Cloud Blog","url":"https:\/\/www.silicloud.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.silicloud.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/www.silicloud.com\/blog\/wp-content\/uploads\/2023\/11\/EN-SILICON-Full.png","contentUrl":"https:\/\/www.silicloud.com\/blog\/wp-content\/uploads\/2023\/11\/EN-SILICON-Full.png","width":1024,"height":1024,"caption":"Silicon Cloud Blog"},"image":{"@id":"https:\/\/www.silicloud.com\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/SiliCloudGlobal\/","https:\/\/twitter.com\/SiliCloudGlobal"]},{"@type":"Person","@id":"https:\/\/www.silicloud.com\/blog\/#\/schema\/person\/23786905eb7b377f45ddb01c17da7671","name":"Liam","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.silicloud.com\/blog\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/8d37ed3e7f770dde8bf069ba0b4298688028c3abaacf1131742fc1352d174ebd?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/8d37ed3e7f770dde8bf069ba0b4298688028c3abaacf1131742fc1352d174ebd?s=96&d=mm&r=g","caption":"Liam"},"sameAs":["http:\/\/Wilson"],"url":"https:\/\/www.silicloud.com\/blog\/author\/liamwilson\/"}]}},"_links":{"self":[{"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/posts\/22633","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/users\/12"}],"replies":[{"embeddable":true,"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/comments?post=22633"}],"version-history":[{"count":1,"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/posts\/22633\/revisions"}],"predecessor-version":[{"id":56565,"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/posts\/22633\/revisions\/56565"}],"wp:attachment":[{"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/media?parent=22633"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/categories?post=22633"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/tags?post=22633"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}