Data Screening

The screener() function will screen data frames and return details such as variable names, class, levels and missing values. The plot.screener() creates bar plots to visualize % of missing observations for each variable in a data frame.

ds_screener() plot(<ds_screener>)

Screen data

Continuous Data

The following functions ease the process of generating and visualizing descriptive statistics for continuous data.

ds_auto_summary_stats()

Descriptive statistics and frquency tables

ds_extreme_obs()

Extreme observations

ds_freq_table() plot(<ds_freq_table>)

Frequency table

ds_measures_location()

Measures of location

ds_measures_symmetry()

Measures of symmetry

ds_measures_variation()

Measures of variation

ds_percentiles()

Percentiles

ds_summary_stats()

Descriptive statistics

ds_tidy_stats()

Tidy descriptive statistics

Categorical Data

The following functions ease the process of generating and visualizing descriptive statistics for categorical data.

ds_auto_freq_table() ds_auto_cross_table()

Multiple One & Two Way Tables

ds_cross_table() plot(<ds_cross_table>) ds_twoway_table()

Two way table

ds_freq_table() plot(<ds_freq_table>)

Frequency table

Grouped Summary

The following functions generate grouped summary statistics.

ds_auto_group_summary()

Tabulation

ds_group_summary() plot(<ds_group_summary>)

Groupwise descriptive statistics

ds_group_summary_interact()

Category wise descriptive statistics

Visualization

The following functions generate plots for different data types.

ds_plot_bar()

Generate bar plots

ds_plot_bar_grouped()

Generate grouped bar plots

ds_plot_bar_stacked()

Generate stacked bar plots

ds_plot_box_group()

Compare distributions

ds_plot_box_single()

Generate box plots

ds_plot_density()

Generate density plots

ds_plot_histogram()

Generate histograms

ds_plot_scatter()

Generate scatter plots

Other Functions

A set of functions used to generate summary_stats.

ds_css()

Corrected Sum of Squares

ds_cvar()

Coefficient of Variation

ds_gmean()

Geometric Mean

ds_hmean()

Harmonic Mean

ds_kurtosis()

Kurtosis

ds_mdev()

Mean Absolute Deviation

ds_mode()

Mode

ds_range()

Range

ds_rindex()

Index Values

ds_skewness()

Skewness

ds_std_error()

Standard error of mean

ds_tailobs()

Tail Observations

Data

descriptr comes with a built-in dataset hsb which is used to illustrate the functions as well as for running tests.

hsb

High School and Beyond Data Set

mtcarz

mtcarz