Getting the descriptive statistics in RStudio is quick for one or multiple variables. Descriptive statistics are measures we can use to learn more about the distribution of observations in variables for analysis, transforming variables, and reporting. Each descriptive statistic has their own formula that we will not be covering in this guide, but we will walk through the interpretation of each.
Below is the code for calculating the descriptive statistics of the variable wages.
The output chart shows us descriptive statistics and missing values. We are going to focus on a couple of descriptive statistics in this output. Moving from left to right, we can see the Min. (minimum), 1st Qu (first quartile), Median, Mean, 3rd Qu (third quartile), Max. (maximum), and NA's (missing values).
The average wage value in this dataset is 15.553 which is below the middle value of 26.11 ((49.92 – 2.30)/2) , indicating the distribution of the data is skewed toward lower values.