gusl: (Default)
[personal profile] gusl
The way classification problems are usually framed, the algorithm outputs a class-label, and the loss function is a function of the number of errors of each type, such that all partial derivatives are positive (i.e. an extra error is always bad, regardless of its type, but some types may be worse than others).

But this is a lossy process: when an algorithm outputs a label, we don't know if was 99% confident or 60% confident. If we, it might often be better for the algorithm to output its full belief, especially when there is little data.

I propose that we generalize the classification framework, so that algorithms output a multinomial probability distribution, instead of a single label. The loss function now becomes defined by a scoring rule.

I came up with this idea because I've been thinking about how Robin Hanson's work (prediction markets, market scoring rules) may be relevant to Machine Learning.

Q: has anyone thought of this before?

A: yes! Here's a Google search.

This paper is from Wharton, Penn's business school, which seems like an unusual place for Machine Learning:
Andreas Buja, Werner Stuetzle, Yi Shen (2005) - Loss Functions for Binary Class Probability Estimation and Classification: Structure and Applications

(no subject)

Date: 2007-10-08 08:16 pm (UTC)
From: [identity profile] techstep.livejournal.com
It doesn't seem that odd for me. Given the number of business-related purposes machine learning has (credit scoring, fraud detection, predicting customer behavior for targeted marketing schemes), it makes sense that there would be at least some ML research coming out of b-schools.

(no subject)

Date: 2007-10-08 08:43 pm (UTC)
From: [identity profile] gustavolacerda.livejournal.com
Yeah, but this seems like a fundamental issue, something you'd expect to see before 1990, in the early days of machine learning.
In any case, I'm no longer who came up with this idea first.

February 2020

S M T W T F S
      1
2345678
9101112131415
16171819202122
23242526272829

Most Popular Tags

Style Credit

Expand Cut Tags

No cut tags