Secret service developing a sarcasm detector. Oh great

rogowar · June 4, 2014, 5:07pm

I feel so much safer now. Sarcasm? Ask the machine.

KarlS · June 4, 2014, 5:28pm

Although I was just a humble student assistant at the time, technically I did work on natural language classifiers. I am pretty sure that someone who knows about those things told someone who doesn’t that false positives were going to be a problem and this is a bureaucratic solution to a half-remembered problem.

SamSam · June 4, 2014, 5:53pm

Ah, got it, that explanation makes total sense.

They already have a system to suck up every email, text message and IM chat. That system already flags messages with “bomb,” “president,” “blow up congress” etc.

They know that the majority of these messages are silly (“sarcasm,” although that’s not exactly the right word). But the algorithm isn’t smart enough to know that, so they get a lot of “false positives” – i.e. false matches that aren’t real threats.

So they are requesting a system that would “detect” these false-positives. It’s poor wording, because they want to detect sarcasm to reduce false-positives.

Honestly this isn’t so dumb. They aren’t requesting a system will detect 100% of false-positives, they just want to reduce them.

I don’t like that they are collecting everything, of course, but since they are collecting everything they ought to be smarter about it. Whichever vendor sells them a system will almost certainly sell them a crappy system, but that doesn’t mean that the aim isn’t valid (in their world).

daneel · June 4, 2014, 5:57pm

The Secret Service wants to detect sarcasm on the internet?

Oh no, what a personal disaster.

anon50609448 · June 4, 2014, 6:05pm

To be fair, I’m coming at this from the perspective of someone who has a lot of negative interactions with IT projects because of the insane way people write specifications. I’m pretty sure I know exactly what they mean and it makes sense. I’m also pretty sure that is not what they are going to get.

anon34812172 · June 4, 2014, 6:27pm

You are dam/damn/darn/durn right! Projects are mostly nightmares because nobody can read anybody else’s code. I can barely even read my own code from earlier today. Even with good comments and a plan. I was on the phone yesterday with a client and she was asking me simple questions about what I did and I finally just said, “Well fuck, I’m going to have to get back to you because I can’t describe it accurately right this second even though it’s a simple, straightforward question. Sorry.”

anon34812172 · June 4, 2014, 6:34pm

Here’s how I’d approach this whole false positive thing with this project.

First, read more on them:

A set of false positives here could be benchmarked in the testing phase:

After extensive cataloging and getting your detection system in place, run tests on PEOPLE. Presumably a range of people. Run tests with a known outcome, so, say, you KNOW it’s a sarcasm. Then after many runs, you will know the error rate in your detection system. You can tune it to make it more sensitive, and therefore reduce the false positive rate. And to detect “false positives”, you will be making a list of the stuff that causes you the most problems. When one comes up on the TwitTube, you have “detected a false positive.” Because you determined most of them a priori.

That’s what I took it to mean, and how I’d deal with it.

But no, I wouldn’t ever have said “detect false positives” in the first place. That’s unintelligent n00b speak.

daneel · June 4, 2014, 6:43pm

What if my apparently earnest sarcasm is, in fact, sarcastic?

anon34812172 · June 4, 2014, 6:46pm

Then you belong in Inception, because you have couched your sarcasm in an earnest remark within an irony within an idiom. You are doing well and have passed from Padawan stage to full on Jedi Language Knight status; here is your Strunk & Light Saber.

KarlS · June 4, 2014, 7:05pm

[quote=“awjt, post:27, topic:33497”]
You can tune it to make it more sensitive, and therefore reduce the false positive rate.[/quote]Other way round, isn’t it?

[quote=“awjt, post:27, topic:33497”]And to detect “false positives”, you will be making a list of the stuff that causes you the most problems. When one comes up on the TwitTube, you have “detected a false positive.” Because you determined most of them a priori.
[/quote]If I understand you correctly, then I am not sure that makes much sense. If you are able to predict false positives, then you just return a “negative” answer and avoid them. Otherwise you would end with the willfully obtuse system suggested by the phrasing in the requirements: a system that answers a binary question, ideally correctly, but sometimes incorrectly against better knowledge.

anon34812172 · June 4, 2014, 7:28pm

What I said was correct: If it’s less sensitive, then there are more type 1 errors, or more false positives. If it’s more sensitive, then the true positive rate increases and the type 1 error rate decreases because there are fewer false+.

For the second one:

I’m suggesting white, black and gray. You determine gray (false positives) a priori and by lack of fitting into white or black. Whenever you come across one of those annoying ones that are on your list from the testing, it isn’t positive or negative… it goes into the gray bin. You’ve detected it. OR, an alternate path is that something comes up that isn’t on ANY list; that also goes into the gray bin.

Something like this: I want to 1010110101100010010101010011 the 1010101011010101010111

What the heck is that? False positive? I don’t know, says the detector. Throw it in the gray bin.

KarlS · June 4, 2014, 8:21pm

No. Just look at the trivial classifiers.

recall (sensitivity) = true positives / (true positives + false negatives)

If you just return “negative” all the time, then it’s 0/(0+FN), i.e. zero sensitivity and not a single false positive anywhere.

If you return “positive” all the time, then it’s TP/(TP+0), i.e. perfect sensitivity but also also maximal FP.

Regarding the other one, I see now that you mean some kind of confidence measure. You can do that. I interpreted the whole thing as a binary classification task where withholding judgment is effectively a negative.

anon34812172 · June 4, 2014, 8:46pm

Sensitivity IS the true positive rate. Less sensitive = lower true positives and therefore higher false positive. Less sensitive does not mean a lower false positive rate. Less sensitive means a higher false positive rate!

if sensitivity is .8 then type 1 error is .2
If sensitivity is .9 then your type 1 error is .1
If the sensitivity is .95 then type 1 error is 0.05

As sensitivity increases, the error rate (FP rate) decreases.

The names of the boxes:
True Positive | False Negative
False Positive | True Negative

KarlS · June 4, 2014, 9:02pm

No. Really no. I am not sure what is going wrong here and I would like to stop.

anon34812172 · June 4, 2014, 9:05pm

It’s basic epidemiology. You’re probably just coming at it from a different background. That’s fine.

KarlS · June 4, 2014, 9:15pm

Computational linguistics. It’s just that the disagreement is so surprisingly fundamental. I do not even agree with your labels for your chart. I may be making some horrible mistake here, but I am really not seeing it.

anon34812172 · June 4, 2014, 9:17pm

I don’t think you’re making a mistake. I am talking probability, and you are talking counts. Use counts in your equations and your logic is sound, so it’s the weird inverse space of putting things into a probability context that is screwing up this conversation. Sorry to confuse. We are both right.

bardfinn · June 4, 2014, 10:38pm

…and runs in IE8.

Can’t tell if serious, or …

Cowicide · June 4, 2014, 11:11pm

Ohhh, a SARCASM detector, well that’s a REAL useful invention!

Whoop! whoop! whoop! whoop!

Cowicide · June 4, 2014, 11:14pm

What if my apparently earnest sarcasm is, in fact, sarcastic?

I detected a tinge of sarcasm in that statement.

Topic		Replies	Views
French invent sarcasm detector boing	15	4072	July 15, 2013
We need a sarcasm mark, happy mutant people! general topics	70	4189	February 18, 2017
Google's new product identifies whether a comment could be perceived as “toxic" to a discussion boing	59	4482	February 28, 2017
Company says facial features reveal terrorists and pedophiles 80% of the time boing	133	8777	May 30, 2016
ACLU sues TSA to make it explain junk science "behavioral detection" program boing	43	4456	March 29, 2015

Secret service developing a sarcasm detector. Oh great

Related topics