rCopy code# Summary statistics
summary(data)
# Calculate mean height and weight
mean_height <- mean(data$height)
mean_weight <- mean(data$weight)
cat("Mean Height:", mean_height, "\n")
cat("Mean Weight:", mean_weight, "\n")
Author: saqibkhan
-
Basic Data Summary
-
Analyzing and Visualizing a Dataset
Step 1: Create a Dataset
rCopy code# Create a sample dataset set.seed(123) # For reproducibility data <- data.frame( id = 1:100, age = sample(18:65, 100, replace = TRUE), height = rnorm(100, mean = 170, sd = 10), weight = rnorm(100, mean = 70, sd = 15) ) # View the first few rows of the dataset head(data) -
How to chain several operations together in R?
We can chain several operations in R by using the pipe operator (
%>%) provided by the tidyverse collection. Using this operator allows creating a pipeline of functions where the output of the first function is passed as the input into the second function and so on, until the pipeline ends. This eliminates the need for creating additional variables and significantly enhances the overall code readability.An example of using the pipe operator on a data frame:
df <- data.frame(a=1:4, b=11:14, c=21:24) print(df) df_new <- df %>% select(a, b) %>% filter(a > 2) print(df_new)Powered ByOutput:
a b c 1 1 11 21 2 2 12 22 3 3 13 23 4 4 14 24 a b 1 3 12 2 4 13 -
How to transpose two-dimensional data in R?
We can transpose a data frame or a matrix in R so that the columns become the rows and vice versa. For this purpose, we need to use the
t()function of the base R. For example:df <- data.frame(col_1=c(10, 20, 30), col_2=c(11, 22, 33)) print(df) transposed_df <- t(df) print(transposed_df)Powered ByOutput:
col_1 col_2 1 10 11 2 20 22 3 30 33
col_1 10 20 30 col_2 11 22 33[,1] [,2] [,3] -
Visualization in Publications
R’s visualizations are often favored in academic publications, as they can produce high-quality graphics suitable for journals and reports.
-
Popularity in Academia
R is widely used in academia and research, particularly in statistics, bioinformatics, and social sciences, due to its strong statistical capabilities and flexibility.
-
How to concatenate strings in R?
We can concatenate two or more strings in R by using the
paste()orcat()functions. The first approach is more popular. Both functions take in any number of strings to be concatenated and can also take in an optional parametersep(along with some other optional parameters)—a character or a sequence of characters that will separate attached strings in the resulting string (a white space by default). -
Community and Support
R has a large and active community. Resources such as forums, mailing lists, and user groups help users find solutions and share knowledge.
-
Integration with Other Languages
R can easily integrate with languages like Python, C++, and SQL, allowing users to leverage the strengths of multiple programming environments.
-
How to merge data in R?
1. Using the
cbind()function—only if the data frames have the same number of rows, and the records are the same and in the same order:df <- cbind(df1, df2)Powered By2. Using the
rbind()function to combine the data frames vertically—only if they have an equal number of identically named columns of the same data type and appearing in the same order:df <- rbind(df1, df2)Powered By3. Using the
merge()function to merge data frames by a column in common, usually an ID column:- Inner join:
df <- merge(df1, df2, by="ID")Powered By- Left join:
df <- merge(df1, df2, by="ID", all.x=TRUE)Powered By- Right join:
df <- merge(df1, df2, by="ID", all.y=TRUE)Powered By- Outer join:
df <- merge(df1, df2, by="ID", all=TRUE)Powered By4. Using the
join()function of the dplyr package to merge data frames by a column in common, usually an ID column:df <- join(df1, df2, by="ID", type="type_of_join")Powered ByThe
typeparameter takes in one of the following values:inner,left,right, orfull.