1. Manipulationg
data in
2013-02-22 @HSPH
Kazuki Yoshida, M.D.
MPH-CLE student
FREEDOM
TO
KNOW
2. Manipulating data in R
n What are Objects?
n What is Class attribute?
n Various data objects you will see in R.
3. Objects
n Just about everything named in R is an object
n An object is a container that
n knows its class (label for what’s inside).
n has contents (eg, Actual numbers).
4. Examples of objects
n dataset, which you use for analysis (various
classes)
n functions, which perform analysis (function class)
n results, which come out of analysis (various
classes)
n In effect, you always get a new dataset filled
with results when you analyze data.
5. Classes of data values
inside data objects
n Numeric: Continuous variables
n Factor: Categorical variables
n Logical: TRUE/FALSE binary variables
n etc...
6. Class?
n An object’s class tells R how the object should be
handled.
n For example, summarizing data should work
differently for numbers and categories!
7. Object
iables
var
ical !
gor ide
C ate ins
Class attribute
http://en.wikipedia.org/wiki/File:3_D-Box.jpg
8. Data objects
n Vector (contains single class of data values)
n List (contains multiple classes of data values)
9. Data objects
n Vector (contains single class of data values)
n Array including Matrix
n List (contains multiple classes of data values)
n Data frame
10. Vector
n Smallest building block of data objects
n Single dimension
n Combination of values of same class
n vec1 <- c(2013, 2, 15, -10) # combine
n vec2 <- 1:16 # integers 1 to 16
14. List
n Combination of any values or objects
n Can contain objects of multiple classes
n eg, a list of two vectors, a matrix, three arrays
n List_name$Variable_name operation with $ operator
n list1 <- list(first = 1:17, second = matrix(letters, 13,2))
n list2 <- list(alpha = c(1,4,5,7), beta = c("h","s","p","h"))
15. List
Multi-part object
Can contain vectors,
arrays, or lists!
16. Data frame
n Special case of a list
n List of same-length vectors vertically aligned
n df1 <- data.frame(list2)
n list3 <- list(small = letters, large = LETTERS,
number = 1:26)
n df2 <- data.frame(list3)