{"id":21813,"date":"2024-03-15T22:37:58","date_gmt":"2024-03-15T22:37:58","guid":{"rendered":"https:\/\/www.silicloud.com\/blog\/how-to-merge-two-datasets-in-r-and-remove-duplicates\/"},"modified":"2024-03-21T21:22:57","modified_gmt":"2024-03-21T21:22:57","slug":"how-to-merge-two-datasets-in-r-and-remove-duplicates","status":"publish","type":"post","link":"https:\/\/www.silicloud.com\/blog\/how-to-merge-two-datasets-in-r-and-remove-duplicates\/","title":{"rendered":"How to merge two datasets in R and remove duplicates?"},"content":{"rendered":"<p>In R, you can combine two datasets using the merge() function and remove duplicates using the unique() function.<\/p>\n<p>Here is an example code for merging two datasets and removing duplicates.<\/p>\n<pre class=\"post-pre\"><code><span class=\"hljs-comment\"># \u521b\u5efa\u4e24\u4e2a\u6570\u636e\u96c6<\/span>\r\ndf1 <span class=\"hljs-operator\">&lt;-<\/span> data.frame<span class=\"hljs-punctuation\">(<\/span>id <span class=\"hljs-operator\">=<\/span> <span class=\"hljs-built_in\">c<\/span><span class=\"hljs-punctuation\">(<\/span><span class=\"hljs-number\">1<\/span><span class=\"hljs-punctuation\">,<\/span> <span class=\"hljs-number\">2<\/span><span class=\"hljs-punctuation\">,<\/span> <span class=\"hljs-number\">3<\/span><span class=\"hljs-punctuation\">)<\/span><span class=\"hljs-punctuation\">,<\/span> name <span class=\"hljs-operator\">=<\/span> <span class=\"hljs-built_in\">c<\/span><span class=\"hljs-punctuation\">(<\/span><span class=\"hljs-string\">\"Alice\"<\/span><span class=\"hljs-punctuation\">,<\/span> <span class=\"hljs-string\">\"Bob\"<\/span><span class=\"hljs-punctuation\">,<\/span> <span class=\"hljs-string\">\"Charlie\"<\/span><span class=\"hljs-punctuation\">)<\/span><span class=\"hljs-punctuation\">)<\/span>\r\ndf2 <span class=\"hljs-operator\">&lt;-<\/span> data.frame<span class=\"hljs-punctuation\">(<\/span>id <span class=\"hljs-operator\">=<\/span> <span class=\"hljs-built_in\">c<\/span><span class=\"hljs-punctuation\">(<\/span><span class=\"hljs-number\">2<\/span><span class=\"hljs-punctuation\">,<\/span> <span class=\"hljs-number\">3<\/span><span class=\"hljs-punctuation\">,<\/span> <span class=\"hljs-number\">4<\/span><span class=\"hljs-punctuation\">)<\/span><span class=\"hljs-punctuation\">,<\/span> age <span class=\"hljs-operator\">=<\/span> <span class=\"hljs-built_in\">c<\/span><span class=\"hljs-punctuation\">(<\/span><span class=\"hljs-number\">25<\/span><span class=\"hljs-punctuation\">,<\/span> <span class=\"hljs-number\">30<\/span><span class=\"hljs-punctuation\">,<\/span> <span class=\"hljs-number\">35<\/span><span class=\"hljs-punctuation\">)<\/span><span class=\"hljs-punctuation\">)<\/span>\r\n\r\n<span class=\"hljs-comment\"># \u4f7f\u7528merge()\u51fd\u6570\u5408\u5e76\u6570\u636e\u96c6<\/span>\r\nmerged_df <span class=\"hljs-operator\">&lt;-<\/span> merge<span class=\"hljs-punctuation\">(<\/span>df1<span class=\"hljs-punctuation\">,<\/span> df2<span class=\"hljs-punctuation\">,<\/span> by <span class=\"hljs-operator\">=<\/span> <span class=\"hljs-string\">\"id\"<\/span><span class=\"hljs-punctuation\">,<\/span> <span class=\"hljs-built_in\">all<\/span> <span class=\"hljs-operator\">=<\/span> <span class=\"hljs-literal\">TRUE<\/span><span class=\"hljs-punctuation\">)<\/span>\r\n\r\n<span class=\"hljs-comment\"># \u4f7f\u7528unique()\u51fd\u6570\u53bb\u9664\u91cd\u590d\u9879<\/span>\r\nunique_df <span class=\"hljs-operator\">&lt;-<\/span> unique<span class=\"hljs-punctuation\">(<\/span>merged_df<span class=\"hljs-punctuation\">)<\/span>\r\n<\/code><\/pre>\n<p>In the code above, two datasets df1 and df2 are first created. Then, the merge() function is used to combine the two datasets based on the id column, with the parameter all set to TRUE, indicating that all rows should be kept. The merged result is stored in merged_df.<\/p>\n<p>Finally, the unique() function is used to remove duplicates from merged_df, and the resulting data is stored in unique_df.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>In R, you can combine two datasets using the merge() function and remove duplicates using the unique() function. Here is an example code for merging two datasets and removing duplicates. # \u521b\u5efa\u4e24\u4e2a\u6570\u636e\u96c6 df1 &lt;- data.frame(id = c(1, 2, 3), name = c(&#8220;Alice&#8221;, &#8220;Bob&#8221;, &#8220;Charlie&#8221;)) df2 &lt;- data.frame(id = c(2, 3, 4), age = c(25, 30, [&hellip;]<\/p>\n","protected":false},"author":8,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_import_markdown_pro_load_document_selector":0,"_import_markdown_pro_submit_text_textarea":"","footnotes":""},"categories":[1],"tags":[],"class_list":["post-21813","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v21.5 (Yoast SEO v21.5) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>How to merge two datasets in R and remove duplicates? - Blog - Silicon Cloud<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.silicloud.com\/blog\/how-to-merge-two-datasets-in-r-and-remove-duplicates\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"How to merge two datasets in R and remove duplicates?\" \/>\n<meta property=\"og:description\" content=\"In R, you can combine two datasets using the merge() function and remove duplicates using the unique() function. Here is an example code for merging two datasets and removing duplicates. # \u521b\u5efa\u4e24\u4e2a\u6570\u636e\u96c6 df1 &lt;- data.frame(id = c(1, 2, 3), name = c(&quot;Alice&quot;, &quot;Bob&quot;, &quot;Charlie&quot;)) df2 &lt;- data.frame(id = c(2, 3, 4), age = c(25, 30, [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.silicloud.com\/blog\/how-to-merge-two-datasets-in-r-and-remove-duplicates\/\" \/>\n<meta property=\"og:site_name\" content=\"Blog - Silicon Cloud\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/SiliCloudGlobal\/\" \/>\n<meta property=\"article:published_time\" content=\"2024-03-15T22:37:58+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-03-21T21:22:57+00:00\" \/>\n<meta name=\"author\" content=\"William Carter\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@SiliCloudGlobal\" \/>\n<meta name=\"twitter:site\" content=\"@SiliCloudGlobal\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"William Carter\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"1 minute\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/how-to-merge-two-datasets-in-r-and-remove-duplicates\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.silicloud.com\/blog\/how-to-merge-two-datasets-in-r-and-remove-duplicates\/\"},\"author\":{\"name\":\"William Carter\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/#\/schema\/person\/f697031891aacefc4b681d139781d3c0\"},\"headline\":\"How to merge two datasets in R and remove duplicates?\",\"datePublished\":\"2024-03-15T22:37:58+00:00\",\"dateModified\":\"2024-03-21T21:22:57+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.silicloud.com\/blog\/how-to-merge-two-datasets-in-r-and-remove-duplicates\/\"},\"wordCount\":111,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/www.silicloud.com\/blog\/#organization\"},\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/how-to-merge-two-datasets-in-r-and-remove-duplicates\/\",\"url\":\"https:\/\/www.silicloud.com\/blog\/how-to-merge-two-datasets-in-r-and-remove-duplicates\/\",\"name\":\"How to merge two datasets in R and remove duplicates? - Blog - Silicon Cloud\",\"isPartOf\":{\"@id\":\"https:\/\/www.silicloud.com\/blog\/#website\"},\"datePublished\":\"2024-03-15T22:37:58+00:00\",\"dateModified\":\"2024-03-21T21:22:57+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/www.silicloud.com\/blog\/how-to-merge-two-datasets-in-r-and-remove-duplicates\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.silicloud.com\/blog\/how-to-merge-two-datasets-in-r-and-remove-duplicates\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/how-to-merge-two-datasets-in-r-and-remove-duplicates\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.silicloud.com\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"How to merge two datasets in R and remove duplicates?\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/#website\",\"url\":\"https:\/\/www.silicloud.com\/blog\/\",\"name\":\"Silicon Cloud Blog\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\/\/www.silicloud.com\/blog\/#organization\"},\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/#organization\",\"name\":\"Silicon Cloud Blog\",\"url\":\"https:\/\/www.silicloud.com\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.silicloud.com\/blog\/wp-content\/uploads\/2023\/11\/EN-SILICON-Full.png\",\"contentUrl\":\"https:\/\/www.silicloud.com\/blog\/wp-content\/uploads\/2023\/11\/EN-SILICON-Full.png\",\"width\":1024,\"height\":1024,\"caption\":\"Silicon Cloud Blog\"},\"image\":{\"@id\":\"https:\/\/www.silicloud.com\/blog\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/SiliCloudGlobal\/\",\"https:\/\/twitter.com\/SiliCloudGlobal\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/#\/schema\/person\/f697031891aacefc4b681d139781d3c0\",\"name\":\"William Carter\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/1786698071dd8d74bec894b512f9e3c610c3a2a32985f67e688976cee3c8bbef?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/1786698071dd8d74bec894b512f9e3c610c3a2a32985f67e688976cee3c8bbef?s=96&d=mm&r=g\",\"caption\":\"William Carter\"},\"url\":\"https:\/\/www.silicloud.com\/blog\/author\/williamcarter\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"How to merge two datasets in R and remove duplicates? - Blog - Silicon Cloud","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.silicloud.com\/blog\/how-to-merge-two-datasets-in-r-and-remove-duplicates\/","og_locale":"en_US","og_type":"article","og_title":"How to merge two datasets in R and remove duplicates?","og_description":"In R, you can combine two datasets using the merge() function and remove duplicates using the unique() function. Here is an example code for merging two datasets and removing duplicates. # \u521b\u5efa\u4e24\u4e2a\u6570\u636e\u96c6 df1 &lt;- data.frame(id = c(1, 2, 3), name = c(\"Alice\", \"Bob\", \"Charlie\")) df2 &lt;- data.frame(id = c(2, 3, 4), age = c(25, 30, [&hellip;]","og_url":"https:\/\/www.silicloud.com\/blog\/how-to-merge-two-datasets-in-r-and-remove-duplicates\/","og_site_name":"Blog - Silicon Cloud","article_publisher":"https:\/\/www.facebook.com\/SiliCloudGlobal\/","article_published_time":"2024-03-15T22:37:58+00:00","article_modified_time":"2024-03-21T21:22:57+00:00","author":"William Carter","twitter_card":"summary_large_image","twitter_creator":"@SiliCloudGlobal","twitter_site":"@SiliCloudGlobal","twitter_misc":{"Written by":"William Carter","Est. reading time":"1 minute"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.silicloud.com\/blog\/how-to-merge-two-datasets-in-r-and-remove-duplicates\/#article","isPartOf":{"@id":"https:\/\/www.silicloud.com\/blog\/how-to-merge-two-datasets-in-r-and-remove-duplicates\/"},"author":{"name":"William Carter","@id":"https:\/\/www.silicloud.com\/blog\/#\/schema\/person\/f697031891aacefc4b681d139781d3c0"},"headline":"How to merge two datasets in R and remove duplicates?","datePublished":"2024-03-15T22:37:58+00:00","dateModified":"2024-03-21T21:22:57+00:00","mainEntityOfPage":{"@id":"https:\/\/www.silicloud.com\/blog\/how-to-merge-two-datasets-in-r-and-remove-duplicates\/"},"wordCount":111,"commentCount":0,"publisher":{"@id":"https:\/\/www.silicloud.com\/blog\/#organization"},"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/www.silicloud.com\/blog\/how-to-merge-two-datasets-in-r-and-remove-duplicates\/","url":"https:\/\/www.silicloud.com\/blog\/how-to-merge-two-datasets-in-r-and-remove-duplicates\/","name":"How to merge two datasets in R and remove duplicates? - Blog - Silicon Cloud","isPartOf":{"@id":"https:\/\/www.silicloud.com\/blog\/#website"},"datePublished":"2024-03-15T22:37:58+00:00","dateModified":"2024-03-21T21:22:57+00:00","breadcrumb":{"@id":"https:\/\/www.silicloud.com\/blog\/how-to-merge-two-datasets-in-r-and-remove-duplicates\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.silicloud.com\/blog\/how-to-merge-two-datasets-in-r-and-remove-duplicates\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.silicloud.com\/blog\/how-to-merge-two-datasets-in-r-and-remove-duplicates\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.silicloud.com\/blog\/"},{"@type":"ListItem","position":2,"name":"How to merge two datasets in R and remove duplicates?"}]},{"@type":"WebSite","@id":"https:\/\/www.silicloud.com\/blog\/#website","url":"https:\/\/www.silicloud.com\/blog\/","name":"Silicon Cloud Blog","description":"","publisher":{"@id":"https:\/\/www.silicloud.com\/blog\/#organization"},"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.silicloud.com\/blog\/#organization","name":"Silicon Cloud Blog","url":"https:\/\/www.silicloud.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.silicloud.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/www.silicloud.com\/blog\/wp-content\/uploads\/2023\/11\/EN-SILICON-Full.png","contentUrl":"https:\/\/www.silicloud.com\/blog\/wp-content\/uploads\/2023\/11\/EN-SILICON-Full.png","width":1024,"height":1024,"caption":"Silicon Cloud Blog"},"image":{"@id":"https:\/\/www.silicloud.com\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/SiliCloudGlobal\/","https:\/\/twitter.com\/SiliCloudGlobal"]},{"@type":"Person","@id":"https:\/\/www.silicloud.com\/blog\/#\/schema\/person\/f697031891aacefc4b681d139781d3c0","name":"William Carter","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.silicloud.com\/blog\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/1786698071dd8d74bec894b512f9e3c610c3a2a32985f67e688976cee3c8bbef?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/1786698071dd8d74bec894b512f9e3c610c3a2a32985f67e688976cee3c8bbef?s=96&d=mm&r=g","caption":"William Carter"},"url":"https:\/\/www.silicloud.com\/blog\/author\/williamcarter\/"}]}},"_links":{"self":[{"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/posts\/21813","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/users\/8"}],"replies":[{"embeddable":true,"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/comments?post=21813"}],"version-history":[{"count":1,"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/posts\/21813\/revisions"}],"predecessor-version":[{"id":55696,"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/posts\/21813\/revisions\/55696"}],"wp:attachment":[{"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/media?parent=21813"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/categories?post=21813"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/tags?post=21813"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}