Sunday, August 26, 2018

1. Intro to Data Visualization



    For qualitative data, if we are just looking at one column worth of data,
    We have these 4 common visuals:
    1. Histogram
    1. Normal Quantile Plot
    2. Stem and Leaf Plot
    3. Box and Whisker Plot

    For categorical data, if we are looking at just one variable(column), we have three common visuals:
    1. Bar Chart
    1. Pie Chart
    2. Pareto Chart

    Scatter Plots
    • Correlation coefficient: the strength and direction of a linear relationship
    • Commonly denoted by r (-1 < r < 1)
      • +1: positive linear correlation
      • -1: negative linear correlation
      • 0: no linear relationship
    • Strength: Weak, Moderate, Strong
    • Direction: Positive, Negative, None

    Strong                               Moderate                               Weak
    0.7≤∣r∣≤1.00.7≤∣r∣≤1.0              0.3≤∣r∣<0.70.3≤∣r∣<0.7                     0.0≤∣r∣<0.30.0≤∣r∣<0.3



    Line Plots
    • Line plots are a common plot for viewing data over time.
    • Allow us to quickly identify overall trends, seasonal occurrences, peaks, and valleys in the data.
    Ex) stock prices over time

    Recap
    1. You motivated the need for data visualization by showing that summary statistics don't tell the full story. You saw datasets where the summary statistics were the same, but the actual data were very different!
    2. You did a review of data types. In general there are quantitative and categorical variables. Quantitative variables can be either discrete or continuous, while categorical variables are either ordinal or nominal.  
    3. You looked at univariate plots. In most cases a histogram should be used for quantitative data, while a bar chart should be used for categorical data. There are some cases where you might use one of the other plots.
    4. You then looked at bivariate plots, where you were comparing two variables to one another.  Scatter plots are the most common way to visualize two quantitative variables, while a line chart is common for data that you are watching over time. If you are comparing two categorical variables, the best choice is probably a side-by-side bar chart.
    5. You learned about correlation coefficients, which provide the strength and direction of linear relationships. You learned a rule of thumb for determining whether the relationship between two quantitative variables is strongmoderate, or weak.
    6. You then looked at cases where we had more than two variables. You learned that using these plots effectively is about building the plot that helps you see the insight that answers the question you have.
    7. You gained some insight into visual encodings and data dashboards, which will be a part of the next lessons!




6 comments:

  1. Water Hack Burns 2lb of Fat OVERNIGHT

    Over 160 thousand men and women are using a easy and SECRET "water hack" to drop 1-2 lbs each night while they sleep.

    It is easy and works on anybody.

    Here's how you can do it yourself:

    1) Grab a clear glass and fill it up with water half glass

    2) Proceed to follow this crazy hack

    so you'll become 1-2 lbs thinner when you wake up!

    ReplyDelete
  2. Film108 is one of the best movie promotion and marketing agency in Mumbai, always trying to adapt to innovative approaches for each brand and mark a difference in the digital world.

    ReplyDelete
  3. very good blog and Useful Information. plz keep sharing.
    If you are looking for website Designing
    Cheap SEO in Delhi
    Website Designing Company in Delhi
    Website Designing Services in Delhi

    ReplyDelete

SQL Joins

Database Normalization:  Are the tables storing logical groupings of the data? Can I make changes in a single location, rather than in...