Models with multiple output variables

July 5, 2016

I recently had an opportunity to work on a supervised machine learning problem where the target variables are real and multi valued. An initial search of the machine learning libraries revealed that popular Open source ML libraries have little support for the same.

Here's a quick intro in the topic, and (in a later post), a dive into some of the libraries from the JVM ecosystem.

Continue reading →

Open source in Analytics

July 2, 2016

Open source in the software industry is a well known phenomena that needs no mention. In the Analytics industry, while the use of Open source software (OSS) is extremely common, there is certainly a need for OSS platforms that will enable Analytics service providers to collaborate rather than compete.

Sharing an article that I wrote earlier, from the Bridgei2i Blog

Continue reading →

Embedding Gorilla-repl worksheet in a blog post

June 1, 2016

Gorilla-repl is a toolset from the Clojure stable, which enables us to interact, visualize and share Clojure code as a worksheet. For those familiar with IPython notebooks (now called Jupyter), Gorilla-repl is the Clojure equivalent of the same. I find that it is an excellent environment for sharing and discussing data science.

Continue reading →

Using Spark's ML Classifiers in Clojure

April 1, 2016

Using Spark's Machine Learning classifiers in Clojure

The Sparkling project enables Clojure developers to use Apache Spark (which is primarily developed in the Scala langauge) for big data processing and machine learning.

Continue reading →

Html Templating in Clojure using Enlive

January 6, 2016

Html Templating in Clojure.

Clojure's Lisp root (where code-is-data-is-code) lends itself nicely to generating HTML. Enlive is a promising library capable of templating, and here I present a tutorial for Enlive.

Continue reading →