# Data Exploration with Visualization

## Data Exploration with Visualization

Before you start exploring your sample data, you look closely into the samples to see if any erroneous observations were made. If so, you may remove them and obtain different samples from another observations. Once you have a clean set of samples, you will visualize the sample dataset with histograms and stem-and-leaf diagram. (Section 2.3) Based on the relative-frequency histogram of your sample data, you will determine the shape of the sample distribution. (Section 2.4)

Items to report in this task:

1. Histograms of the dataset (both frequency histogram and relative-frequency histogram)

2. Stem-and-Leaf diagram of the dataset

3. Identify the shape of the distribution of the sample data based on the relative-frequency histogram of your sample data.

Tips: You may google search for instructions for creating histograms using Excel. You can use any software tool in creating graphs.

Data Exploration with Descriptive Measures

In Task #2, you explored the sample data visually. They offered us a sense of overall distribution of the sample data. Here, you will summarize the sample data with numbers, especially the sample mean and standard deviation along with other measures.

Items to report in this task:

1. The sample mean, median, and mode of the sample data (Section 3.1 Measures of center)

2. The sample variance and standard deviation of the sample data (Section 3.2 Measures of Variation)

3. The five-number summary of the sample data (Section 3.4)

4. The box-plot based on the five-number summary in Part 3. (Extra Credit: 5 points)

Tips: Box-plots can also be created with Excel.

