I feel so much safer now. Sarcasm? Ask the machine.
Although I was just a humble student assistant at the time, technically I did work on natural language classifiers. I am pretty sure that someone who knows about those things told someone who doesnāt that false positives were going to be a problem and this is a bureaucratic solution to a half-remembered problem.
Ah, got it, that explanation makes total sense.
They already have a system to suck up every email, text message and IM chat. That system already flags messages with ābomb,ā āpresident,ā āblow up congressā etc.
They know that the majority of these messages are silly (āsarcasm,ā although thatās not exactly the right word). But the algorithm isnāt smart enough to know that, so they get a lot of āfalse positivesā ā i.e. false matches that arenāt real threats.
So they are requesting a system that would ādetectā these false-positives. Itās poor wording, because they want to detect sarcasm to reduce false-positives.
Honestly this isnāt so dumb. They arenāt requesting a system will detect 100% of false-positives, they just want to reduce them.
I donāt like that they are collecting everything, of course, but since they are collecting everything they ought to be smarter about it. Whichever vendor sells them a system will almost certainly sell them a crappy system, but that doesnāt mean that the aim isnāt valid (in their world).
The Secret Service wants to detect sarcasm on the internet?
Oh no, what a personal disaster.
To be fair, Iām coming at this from the perspective of someone who has a lot of negative interactions with IT projects because of the insane way people write specifications. Iām pretty sure I know exactly what they mean and it makes sense. Iām also pretty sure that is not what they are going to get.
You are dam/damn/darn/durn right! Projects are mostly nightmares because nobody can read anybody elseās code. I can barely even read my own code from earlier today. Even with good comments and a plan. I was on the phone yesterday with a client and she was asking me simple questions about what I did and I finally just said, āWell fuck, Iām going to have to get back to you because I canāt describe it accurately right this second even though itās a simple, straightforward question. Sorry.ā
Hereās how Iād approach this whole false positive thing with this project.
First, read more on them:
A set of false positives here could be benchmarked in the testing phase:
After extensive cataloging and getting your detection system in place, run tests on PEOPLE. Presumably a range of people. Run tests with a known outcome, so, say, you KNOW itās a sarcasm. Then after many runs, you will know the error rate in your detection system. You can tune it to make it more sensitive, and therefore reduce the false positive rate. And to detect āfalse positivesā, you will be making a list of the stuff that causes you the most problems. When one comes up on the TwitTube, you have ādetected a false positive.ā Because you determined most of them a priori.
Thatās what I took it to mean, and how Iād deal with it.
But no, I wouldnāt ever have said ādetect false positivesā in the first place. Thatās unintelligent n00b speak.
What if my apparently earnest sarcasm is, in fact, sarcastic?
Then you belong in Inception, because you have couched your sarcasm in an earnest remark within an irony within an idiom. You are doing well and have passed from Padawan stage to full on Jedi Language Knight status; here is your Strunk & Light Saber.
[quote=āawjt, post:27, topic:33497ā]
You can tune it to make it more sensitive, and therefore reduce the false positive rate.[/quote]Other way round, isnāt it?
[quote=āawjt, post:27, topic:33497ā]And to detect āfalse positivesā, you will be making a list of the stuff that causes you the most problems. When one comes up on the TwitTube, you have ādetected a false positive.ā Because you determined most of them a priori.
[/quote]If I understand you correctly, then I am not sure that makes much sense. If you are able to predict false positives, then you just return a ānegativeā answer and avoid them. Otherwise you would end with the willfully obtuse system suggested by the phrasing in the requirements: a system that answers a binary question, ideally correctly, but sometimes incorrectly against better knowledge.
What I said was correct: If itās less sensitive, then there are more type 1 errors, or more false positives. If itās more sensitive, then the true positive rate increases and the type 1 error rate decreases because there are fewer false+.
For the second one:
Iām suggesting white, black and gray. You determine gray (false positives) a priori and by lack of fitting into white or black. Whenever you come across one of those annoying ones that are on your list from the testing, it isnāt positive or negativeā¦ it goes into the gray bin. Youāve detected it. OR, an alternate path is that something comes up that isnāt on ANY list; that also goes into the gray bin.
Something like this: I want to 1010110101100010010101010011 the 1010101011010101010111
What the heck is that? False positive? I donāt know, says the detector. Throw it in the gray bin.
No. Just look at the trivial classifiers.
recall (sensitivity) = true positives / (true positives + false negatives)
If you just return ānegativeā all the time, then itās 0/(0+FN), i.e. zero sensitivity and not a single false positive anywhere.
If you return āpositiveā all the time, then itās TP/(TP+0), i.e. perfect sensitivity but also also maximal FP.
Regarding the other one, I see now that you mean some kind of confidence measure. You can do that. I interpreted the whole thing as a binary classification task where withholding judgment is effectively a negative.
Sensitivity IS the true positive rate. Less sensitive = lower true positives and therefore higher false positive. Less sensitive does not mean a lower false positive rate. Less sensitive means a higher false positive rate!
if sensitivity is .8 then type 1 error is .2
If sensitivity is .9 then your type 1 error is .1
If the sensitivity is .95 then type 1 error is 0.05
As sensitivity increases, the error rate (FP rate) decreases.
The names of the boxes:
True Positive | False Negative
False Positive | True Negative
No. Really no. I am not sure what is going wrong here and I would like to stop.
Itās basic epidemiology. Youāre probably just coming at it from a different background. Thatās fine.
Computational linguistics. Itās just that the disagreement is so surprisingly fundamental. I do not even agree with your labels for your chart. I may be making some horrible mistake here, but I am really not seeing it.
I donāt think youāre making a mistake. I am talking probability, and you are talking counts. Use counts in your equations and your logic is sound, so itās the weird inverse space of putting things into a probability context that is screwing up this conversation. Sorry to confuse. We are both right.
ā¦and runs in IE8.
Canāt tell if serious, or ā¦
Ohhh, a SARCASM detector, well thatās a REAL useful invention!
Whoop! whoop! whoop! whoop!
What if my apparently earnest sarcasm is, in fact, sarcastic?
I detected a tinge of sarcasm in that statement.