Gaurav Dembla
Dec 3, 2021

Hi Naveen,

In my limited understanding, SMOTE is just used to ensure that we have "richer" data (specifically, class 1 records) for statistical model to be trained well, and hence, perform well later.

Remember that log-loss score of a statistical model just by itself has very little significance. It is only useful in comparative context. We can compare the log-loss score of two (or more) statistical models and conclude the best performing one, provided they were run on the same dataset i.e. the underlying data (and of course, its distribution) should be constant.

Also, even the best performing model should beat the baseline score (of the naive model) on the same dataset.

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

Gaurav Dembla
Gaurav Dembla

Written by Gaurav Dembla

Love for data and planning has enabled me to acquire skills in Anaplan and Data Analytics! www.linkedin.com/in/gauravdembla/

No responses yet

Write a response