gusl | using scoring rules as loss functions

The way classification problems are usually framed, the algorithm outputs a class-label, and the loss function is a function of the number of errors of each type, such that all partial derivatives are positive (i.e. an extra error is always bad, regardless of its type, but some types may be worse than others).

But this is a lossy process: when an algorithm outputs a label, we don't know if was 99% confident or 60% confident. If we, it might often be better for the algorithm to output its full belief, especially when there is little data.

I propose that we generalize the classification framework, so that algorithms output a multinomial probability distribution, instead of a single label. The loss function now becomes defined by a scoring rule.

I came up with this idea because I've been thinking about how Robin Hanson's work (prediction markets, market scoring rules) may be relevant to Machine Learning.

Q: has anyone thought of this before?

A: yes! Here's a Google search.

This paper is from Wharton, Penn's business school, which seems like an unusual place for Machine Learning:
Andreas Buja, Werner Stuetzle, Yi Shen (2005) - Loss Functions for Binary Class Probability Estimation and Classification: Structure and Applications

Threaded | Top-Level Comments Only

From:

techstep.livejournal.com

It doesn't seem that odd for me. Given the number of business-related purposes machine learning has (credit scoring, fraud detection, predicting customer behavior for targeted marketing schemes), it makes sense that there would be at least some ML research coming out of b-schools.

gustavolacerda.livejournal.com

Yeah, but this seems like a fundamental issue, something you'd expect to see before 1990, in the early days of machine learning.
In any case, I'm no longer who came up with this idea first.

S	M	T	W	T	F	S
						1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29

Gustavo Lacerda

using scoring rules as loss functions

(no subject)

(no subject)

Profile

February 2020

Most Popular Tags

Page Summary

Style Credit

Expand Cut Tags