Last Thursday the Cologne R user group came together again. This time, our two speakers arrived from Bavaria, to talk about Spark and R Server.
Introduction to Apache Spark
byin R, then it is most likely doable. The
byfunction in R splits a data set into several subsets and applies a specific function to each subgroup and collects the results in the end. In the world of Hadoop, this is called MapReduce. Spark has an advanced DAG (directed acyclic graph) execution engine that supports cyclic data flow and in-memory computing. Additionally, Spark has a direct API for R, which makes it relatively ease to write applications with Spark.
Microsoft R Server
Next Kölner R meetingThe next meeting will be scheduled in about three months time. Details will be published on our Meetup site. Thanks again to Microsoft for their support.
Please get in touch, if you would like to present at the next meeting.