How to select data based on conditions in R language?

In R programming language, you can use logical operators and conditional statements to select data based on certain conditions. Here are two commonly used methods:

  1. Use logical operators and subset operators []:
# 创建一个数据框
data <- data.frame(
  name = c("Alice", "Bob", "Charlie", "David"),
  age = c(25, 30, 35, 40),
  gender = c("Female", "Male", "Male", "Male")
)

# 选取age大于30的行
subset_data <- data[data$age > 30, ]
print(subset_data)

output:

    name age gender
3 Charlie  35   Male
4   David  40   Male
  1. By using logical operators and the filter() function:
# 安装并加载dplyr包
install.packages("dplyr")
library(dplyr)

# 创建一个数据框
data <- data.frame(
  name = c("Alice", "Bob", "Charlie", "David"),
  age = c(25, 30, 35, 40),
  gender = c("Female", "Male", "Male", "Male")
)

# 选取age大于30的行
subset_data <- filter(data, age > 30)
print(subset_data)

Output:

    name age gender
1 Charlie  35   Male
2   David  40   Male

Both methods can achieve the function of selecting data based on conditions, the choice of which method depends on personal preference and project requirements.

bannerAds