Highcharts.js

highcharts

Let’s test some of javascprit libraries for infographics :) Start with Highcharts.js.

Finally, get it working now!


Spark + R

spark

A week ago, my subscription to the Apache Spark Youtube Channal recommanded me a Bay area meetup video. And it captured my attention immediately as it is about Spark + R. I watched the full video and headed to the site.

“SparkR is an R package that provides a light-weight frontend to use Apache Spark from R. SparkR exposes the Spark API through the RDD class and allows users to interactively run jobs from the R shell on a cluster.” — SparkR authors


Count The Evens

scala

As a Statistics background, I know I keep coming back to R because I can do this

a = c(1,2,3,4,5,6,7,8,9,10)
sum(a[a%%2==0])

Until I was once amazed by the simplicity of Lambda function in Scala and the List comprehension in Python:

in Scala

val a = List(1,2,3,4,5,6,7,8,9,10)
a.filter(_ % 2 == 0).sum

in Python

a = [1,2,3,4,5,6,7,8,9,10]
sum(filter(lambda x: x % 2, a))
sum(x for x in a if x % 2)

Mahout And Mllib

spark, scala

I am getting myself used to two of the most grandeur machine learning library in the Hadoop ecosystem, MLlib in Spark and Mahout in Hadoop. It would be interesting to test the accessibility and ability of the two packages. So I picked up a dataset from UCI ML database.


R In Scala

scala

You haven’t got Spark MLlib set up and you happen to need to run a quick logistic regression in scala, then you should go to ScalaNLP for Breeze and Nak. But if you just need to run a really quick one and you happen to have R installed.


Better To Have A Plan

game

You are offered to throw a die up to three times. And you will earn the face value of the die have you chosen to stop after each throw, or you go to the next round without any previous earnings. What is the optimal strategy and the expected payoff of this game?