Change the Blank Cells to “NA”
My target is to assign "NA" to all blank cells irrespective of categorical or numerical values. I am using na.strings="". But it's not assigning NA to all blank cells.
## reading the datadat <- read.csv("data2.csv")
head(dat)
mon hr acc alc sex spd axles door reg cond1 drug1
1 8 21 No Control TRUE F 0 2 2 Physical Impairment (Eyes, Ear, Limb) A
2 7 20 No Control FALSE M 900 2 2 Inattentive D
3 3 9 No Control FALSE F 100 2 2 2004 Normal D
4 1 15 No Control FALSE M 0 2 2 Physical Impairment (Eyes, Ear, Limb) D
5 4 21 No Control FALSE 25 NA NA D
6 4 20 No Control NA F 30 2 4 Drinking Alcohol - Impaired D
inj1 PED_STATE st rac1
1 FatalF 2 ModerateF 3 ModerateM 4 ComplaintM 5 ComplaintF 6 ModerateM ## using na.strings
dat2 <- read.csv("data2.csv", header=T, na.strings="")
head(dat2)
mon hr acc alc sex spd axles door reg cond1 drug1
1 8 21 No Control TRUE F 0 2 2Physical Impairment (Eyes, Ear, Limb) A 2 7 20 No Control FALSE M 900 2 2Inattentive D 3 3 9 No Control FALSE F 100 2 2 2004 Normal D
4 1 15 No Control FALSE M 0 2 2Physical Impairment (Eyes, Ear, Limb) D 5 4 21 No Control FALSE 25 NA NA6 4 20 No Control NA F 30 2 4 D Drinking Alcohol - Impaired D inj1 PED_STATE st rac1
1 Fatal NA F NA
2 Moderate NA F NA
3 Moderate NA M NA
4 Complaint NA M NA
5 Complaint NA F NA
6 Moderate NA M NA
To replace blank with “NA”, you can use the na.strings argument inside the read.csv function while reading the file as follows:
dat2 <- read.csv("data2.csv", header=T, na.strings=c(""," ","NA"))