“Give me data and I promise you cluster’s”: The case of k-means algorithm

Introduction The title of this week’s essay is actually derived from the infamous speech (“Give me blood and I promise you freedom!”) by the Indian nationalist Subhash Chandra Bose’s speech delivered in Burma on July 4th 1944. An essay makes more sense if its title can relate to its contents. Thus, after considerable debate on how to aptly title it,…

A random forest approach to predicting breast cancer in working class women

What is a Random Forest? A random forest is an ensemble (group or combination) of tree’s that collectively vote for the most popular class (or feature) amongst them by cancelling out the noise. Ensemble learning– ensemble means group or combination. Ensemble learning in the context of machine learning is referred to methods that generate many classifiers…

Clustering with Weka 3.6 part-1

1. Download and install Weka 3.6 from here 2. Follow this blog to convert your data file to ARFF format 3. Click on ‘open file’ and select the .arff file that you created in step 2. This will open the dataset in the Weka Preprocess window. Please look at the screenshot below. 4. Click on “Cluster” tab…