Random forest

We are going to use the same dataset we used in the Personal Cancer Diagnosis artile that was publised earlier

We will not be dealing with the exploratory data analysis but will straight go ahead with the machine learning model implementation of the dataset.

In Random Forests we have

  • Decision…

Problem statement :

Classify the given genetic variations/mutations based on evidence from text-based clinical literature.


Source: https://www.kaggle.com/c/msk-redefining-cancer-treatment/

Data: Thanks to Memorial Sloan Kettering Cancer Center (MSKCC) for curating this dataset a great help to humanity.

Business Objective and Constraints

  • There is no Latency requirement
  • High accuracy is required as we are dealing with human lives
  • Errors are…

Logisitc regression ==> simple Algorithm

Naive Bayes ==> No geometric intuition. Learnt it through Probabilistic technique

Logistic Regression can be learnt from three perspectives

  • Geometric intuition
  • probabilistic approach which involves dense mathematics
  • Loss minimization framework


  • Classes are linearly or almost linearly seperable plane

Janardhanan a r

In the making Machine Learner programmer music lover

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store