How to convert country codes to country names in a column in a dataframe using R?
dd$country
[1] US US US US GB US US HK US US US DE DE NL US US US US US CA CA FR FR DK CA GB AU AU IE LT PT AT US US US US US US US US US US US US US SG NL NL IT NL GB US US US NZ US GB GB US US US US ES IE ES
[66] GB IE US US US US IE GB GB GB GB DE DE US FR AU IE US US US US GB GB GB GB GB GB US US IE GB GB GB GB HK US GB GB FR EU FR GB SE FI GB SE FI DK IT IE SE DK GB GB GB GB GB GB GB GB IE GB GB US US
[131] US US US US CA GB GB NL IL US US US US US US US US US US US US US US US US US US US US US GB US US US US US US US US US US US US US US US US US US US US US US NL US US US US US US US US US US US
[196] US US US US US ES US GB US US GB GB TR US US ES ES
Levels: AT AU CA DE DK ES EU FI FR GB HK IE IL IT LT NL NZ PT SE SG TR US
source to share
You can use the country code . Various encoding schemes are supported. It looks like you have data that matches http://en.wikipedia.org/wiki/ISO_3166-1_alpha-2 that countrycode
stands for iso2c
. Full country names are indicated by the symbol country.name
:
library(countrycode)
myCodes <- c("AT", "AU", "CA", "DE", "DK", "ES", "EU",
"FI", "FR", "GB", "HK", "IE", "IL", "IT", "LT",
"NL", "NZ", "PT", "SE", "SG", "TR", "US")
> countrycode(myCodes, "iso2c", "country.name")
[1] "Austria" "Australia" "Canada" "Germany"
[5] "Denmark" "Spain" NA "Finland"
[9] "France" "United Kingdom" "Hong Kong" "Ireland"
[13] "Israel" "Italy" "Lithuania" "Netherlands"
[17] "New Zealand" "Portugal" "Sweden" "Singapore"
[21] "Turkey" "United States"
source to share
jdharrison gives a great answer.
Using the info / wiki page from his answer, below is an alternative way to match codes - perhaps adding a small value for alternative scenarios where the code table is available online but no r package to match
Using the package XML
, you can extract the 3rd table into a web page Wikipedia
, then you can map country codes to county names.
library(XML)
wiki <- "http://en.wikipedia.org/wiki/ISO_3166-1_alpha-2"
country <- readHTMLTable(wiki, header=TRUE, which=3, stringsAsFactors=FALSE)[1:2]
country$'Country name'[match(myCodes, country$Code)]
# [1] "Austria" "Australia" "Canada" "Germany"
# [5] "Denmark" "Spain" NA "Finland"
# [9] "France" "United Kingdom" "Hong Kong" "Ireland"
# [13] "Israel" "Italy" "Lithuania" "Netherlands"
# [17] "New Zealand" "Portugal" "Sweden" "Singapore"
# [21] "Turkey" "United States"
source to share