QUANTA

Monday, April 11, 2011


Automated Processing of Wikileaks Cables Reveals U.S. Friends, Foes

Natural Language Processing of nearly 4,000 U.S. diplomatic cables reveals fraying relations with traditional allies, and a few other surprises

CHRISTOPHER MIMS 04/11/2011

Software capable of determining the positive or negative sentiment of sentences written by humans has been unleashed on 3,891 U.S. diplomatic cables released by WikiLeaks, and the results are a systematic, if preliminary, analysis of which countries are our besties and which are in the doghouse.

The analysis was part of a class project (pdf) by a pair of computer science undergraduates at Stanford, Xuwen Cao and Beyang Li. By looking at how often a country was mentioned, as well as whether or not it was cast in a positive or negative light, Cao and Li identified four clusters to which countries could belong: countries we don't like that aren't mentioned very often (red), countries we sort-of don't like that aren't mentioned very often (teal), and countries spoken of positively that also aren't mentioned very often (blue).


Source and/or read more: http://goo.gl/x2I32

Publisher and/or Author and/or Managing Editor:__Andres Agostini ─ @Futuretronium at Twitter! Futuretronium Book at http://3.ly/rECc