You are here

Exploratory Data Analysis

Submitted by sbrewer on Fri, 03/08/2019 - 11:00

 

  • Install R and Rcmdr.
  • Refer to R Commander Installation Notes for details: http://socserv.socsci.mcmaster.ca/jfox/Misc/Rcmdr/installation-notes.html
  • Import the data:
    • In Rcmdr. under “Data” menu “Import data” from “text file”
    • Set the “Field Separator” to “Commas”.
    • Navigate to the CSV file and select it.
    • Click the “Edit data set” button to open the data set in a window.
    • Make a note of all outliers (to put in the legend of the figure).
    • Click on the number of each row with an outlier, then right-click and “Delete current row”.
    • Click OK to save edited data set.
  • Under “Graphs” choose “Scatterplot matrix...”
    • Select all three variables.
    • Click “Plot by groups”, select Gender, and click OK.
    • Click “Options” and select the checkbox for Least-squares line and click OK.
    • Check for outliers and repeat as necessary.
  • Each student independently choose one pair of variables to report in a scatter plot
    • Choose independent and dependent variables
    • Click “Plot by groups”, select Sex, and click OK.
    • Click “Options” and select the checkbox for Least-squares line and click OK.
    • Save as PDF (use this for posters)
    • Convert to PNG (use this to post at website)
    • Post as Image: write legend that includes title, description, and reports outliers.
    • (If time available, report relevant statistics, e.g. means, linear regression, or one-way ANOVA)

GIYF and here is a book for additional assistance: https://stellmack.dl.umn.edu/sites/g/files/pua2461/f/media/_the_rcmdr_gu...

Post: