How can you say such things?!?: Recognizing disagreement in informal political argument

From AcaWiki
Jump to: navigation, search

Citation: Rob Abbott, Marilyn Walker, Pranav Anand, Jean E. Fox Tree, Robeson Bowmani, and Joseph King (2011) How can you say such things?!?: Recognizing disagreement in informal political argument. Proceedings of the Workshop on Language in Social Media (RSS)
Internet Archive Scholar (search for fulltext): How can you say such things?!?: Recognizing disagreement in informal political argument
Tagged: Computer Science (RSS) disagreement (RSS), online argumentation (RSS), NLP (RSS), natural language processing (RSS), Mechanical Turk (RSS), cue words (RSS), discourse analysis (RSS), hand annotation (RSS), discourse markers (RSS), forum posts (RSS)

Summary

This paper builds a manually annotated corpus of informal argumentation built from a political bulletin board discussion forum, 4forums.com. The ARGUE corpus consists of 11,216 discussions and 109,553 posts by 2764 authors, and was annotated using Mechanical Turk.

The authors define a "quote-response pair" (Q-R pair) where the response is a portion of a post directly following a quotation. 10,003 Q-R pairs were chosen, using 20 discourse markers/cue words (the 17 that occurred at least 50 times in the quote response were: actually, and, because, but, I believe, I know, I see, I think, just, no, oh, really, so, well, yes, you know, you mean).

Turkers were asked whether posters agree/disagree; use fact/emotion; attack or insult; use sarcasm; are nice/nasty.

Cue Word Findings

Marking disagreement

  • Really
  • No
  • Actually
  • But
  • So
  • You mean

Marking agreement

  • Yes
  • I know
  • I believe
  • I think
  • Just

Nearly even

  • And
  • Because
  • Oh
  • I see
  • You know
  • Well

Sarcasm Markers

Unsurprisingly, sarcasm was hard to detect; agreement was low. One interesting finding is that "oh", rather than indicating sarcasm "was the discourse marker with the highest rating of feelings over fact". Further, sarcasm was not correlated with disagreement. However, sarcasm is emotional, personal, and nastier.

Most sarcastic

That said, these are the most indicative of sarcasm:

  • You mean
  • Oh
  • Really
  • So
  • I see

Least sarcastic

  • I think
  • I believe
  • Actually

Methods

Besides the Mechanical Turk annotation, the authors used the Weka machine learning toolkit with Naive-Bayes and JRip.

Features investigated

  • MetaPost (posterid, time between posts, etc.)
  • Unigrams, Bigrams
  • Cue words (initial unigram, bigram, and trigram)
  • Punctuation (collapsed into 3 categories: ??, !!, ?!)
  • LIWC measures and frequencies
  • Dependencies (from the Stanford Parser)
  • Generalized dependencies (POS of the head word; opinion polarity of both words) -- see Somasundaran & Wiebe (2009)

Selected References


Discourse words