Notes from the Kölner R meeting, 6 March 2015
Using R in Excel via R.NET
Günter Faes and Matthias Spix
In its current form the add-in is a proof of concept. It demonstrates in principal that functions based on R can be added to Excel. The version Günter and Michael demonstrated doesn’t have a reactive functionality yet, i.e. updating a cell will not update the output of an R function automatically at the moment. Feel free to get in touch with them if you would like to know more about their project. You find their contact details on the last slide of their presentation.
Text Mining with R
- From natural language processing (NLP) to text mining
- Building corpora
- Latent semantic analysis (LSA)
- Topic models/Latent Dirichlet allocation (LDA)
- Sentiment analysis
- Misc useful packages
My key take-aways were: text mining is a fairly recent and very active research topic, there is a lot more in text mining then pretty word clouds, and a good domain knowledge is crucial as many techniques don’t provide clear answers and require the user to interpret to results.
A nice and illustrative example Cornelius presented at the end of his talk was the package
gender by Linclon Mullen that uses historical US census data to predict the gender of people based on their first name. I have several colleagues with the name of ‘Leslie’ or ‘Lesley’. Thanks to
gender I know now that my male colleagues are more likely to be spelled ‘Leslie’ than ‘Lesley’ and that a person with either name is more likely to be female.
library(gender) L1 <- gender("Leslie") L2 <- gender("Lesley") cbind(L1, L2) L1 L2 name "Leslie" "Lesley" proportion_male 0.2222 0.0995 proportion_female 0.7778 0.9005 gender "female" "female" year_min 1932 1932 year_max 2012 2012
Drinks and NetworkingNo Cologne R user group meeting would be complete without drinks and schnitzel at the Lux.
|Photo: Günter Faes|
Next Kölner R meetingThe next meeting will be Friday, 26 June. Note the new larger venue: Startplatz, Im Mediapark 5, Köln. We will have two talks:
- Data Science at the Commandline (Kirill Pomogajko)
- An Introduction to RStan and the Stan Modelling Language (Paul Viefers)