GEOG 414/514:  Advanced Geographic Data Analysis
Topic:  Visualization and Data Analysis
Introduction to the course

Data Analysis vs "Statistics" and their roles in geography

  • Data analysis features continual iterations between the development of a conceptual model of reality (theory building or hypothesis generation) and the testing of that model (using formal or informal hypothesis testing).
  • Classical statistics has been more oriented toward assessment (hypothesis testing) than toward discovery of relationships within data sets.
  • Modern data analysis exploits recent developments in computing, and "scientific visualization," but still uses more traditional "statistical analysis" approaches when appropriate.
  • Roles in Geography:  "quantitative revolution", "GIS revolution", "Geographic Visualization".

Statistics is not mathematics; data analysis is not statistics; but visualization is data analysis

Course Plan:  Visualization and Data Analysis Using "R"

  • lecture web pages with embedded R examples
  • exercises with other examples
  • R "packages" for advanced analyses
  • two take-home exams

Nature of Geographical Data

An implicit feature of most data sets that are examined by geographers is that individual "observations" have locational information attached to them.

Most statistical packages do not explicitly recognize those spatial attributes--i.e. they treat them as ordinary variables.

The principal exception is the software package we'll use here--R

The "Data Cube" -- attributes, locations, occasions. The cube is made up of individual cells or datums, that represent a single attribute or variable, measured at a particular place and time (after Rummel, 1970, and many others).

The Rectangular Data Set -- Two Examples

R:  Software for data analysis and visualization

R -- Back to the future?

  • Command-line interface as opposed to point-and-click GUI
  • Extensible to new analyses
  • Based on the "S" language
  • Open source -- many add-on "packages"
    http://www.r-project.org/ (home page)  http://cran.us.r-project.org/ (software)
     
  • Many ways to use
    • directly from the command line -- Rterm.exe (hard-core)
    • Windows RGui interface, along with a text-editor (recommended)
    • other gui's (Rcmdr package)
  • Basic idea:  Use text editor (e.g. TextPad) and RGui jointly to write and debug "scripts"

An example R session

Exercise 1:  Getting and Using R

Readings:

Links to the readings can be found on the GeogR pages.  Click on the "documentation" link below.

Owen (The R Guide):  sections 1.1-1.8, scan section 3; Kuhnert & Venebles (An Introduction...):  p. 13-20; Cleveland (Visualizing Data): Ch. 1;

[Geog. 414/514] [syllabus] [lectures & exercises] | [GeogR] [topics] [data sets] [documentation]