{"id":6321,"date":"2019-06-24T09:00:50","date_gmt":"2019-06-24T00:00:50","guid":{"rendered":"https:\/\/www.waseda.jp\/inst\/wias\/?p=6321"},"modified":"2019-06-04T12:54:09","modified_gmt":"2019-06-04T03:54:09","slug":"%e9%87%8f%e7%9a%84%e3%83%86%e3%82%ad%e3%82%b9%e3%83%88%e5%88%86%e6%9e%90%e3%81%ab%e3%82%88%e3%82%8b%e5%9b%bd%e9%9a%9b%e6%94%bf%e6%b2%bb%e7%a0%94%e7%a9%b6%e3%80%80%e6%b8%a1%e8%be%ba%e8%80%95%e5%b9%b3-2","status":"publish","type":"post","link":"https:\/\/www.waseda.jp\/inst\/wias\/news-en\/2019\/06\/24\/6321\/","title":{"rendered":"International Political Research through Quantitative Text Analysis\u3000Kohei Watanabe, Assistant Professor"},"content":{"rendered":"<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-5838 size-thumbnail\" src=\"https:\/\/www.waseda.jp\/inst\/wias\/assets\/uploads\/2018\/11\/15db2e3254ffa460ce7209d14ca04c1e-360x270.jpg\" alt=\"\" width=\"360\" height=\"270\" srcset=\"https:\/\/www.waseda.jp\/inst\/wias\/assets\/uploads\/2018\/11\/15db2e3254ffa460ce7209d14ca04c1e-360x270.jpg 360w, https:\/\/www.waseda.jp\/inst\/wias\/assets\/uploads\/2018\/11\/15db2e3254ffa460ce7209d14ca04c1e-720x540.jpg 720w\" sizes=\"auto, (max-width: 360px) 100vw, 360px\" \/><\/p>\n<ul>\n<li><a href=\"https:\/\/www.waseda.jp\/inst\/wias\/other-en\/2018\/06\/01\/5417\/\" target=\"_blank\" rel=\"noopener\"><u><span style=\"color: #0066cc;\">Kohei Watanabe,\u3000Assistant Professor<\/span><\/u><\/a><\/li>\n<\/ul>\n<h3>What is Quantitative Text Analysis?<\/h3>\n<p>Quantitative text analysis is a technique that uses computers to process written or spoken words, or \u201cnatural language,\u201d to extract information for the social sciences and humanities. In political communication research, which is my field of study, quantitative text analysis is used for many applications, including inferring the political ideologies of candidates through analyzing election manifestos and politicians\u2019 speeches, or uncovering concealed political bias in newspaper articles. Quantitative text analysis is also used for tasks such as classifying books according to themes and, in a different kind of application, searching for the authors of anonymously published literary works.<\/p>\n<p>Research on natural language processing began with research into automated translation as an application of decryption technology during the Second World War. In the 1990s, the technology advanced dramatically due to improvements in computer performance in conjunction with the increase in the availability of electronic data and emergence of the Internet; thus, statistical analysis techniques for large-scale text data were established. From the early 2000s to the present day, a variety of machine learning models that can be used for quantitative text analysis have been developed.<\/p>\n<h3>Development of the Quantitative Text Analysis Tool Quanteda<\/h3>\n<p>I obtained a Ph.D. from the London School of Economics and Political Science (LSE), and subsequently worked there as a Research Officer before returning to Japan in 2018. During that period, I was and currently remain involved in the development of <a href=\"https:\/\/quanteda.io\/\" target=\"_blank\" rel=\"noopener\">Quanteda<\/a> (an abbreviation of Quantitative Analysis of Textual Data), a text analysis package written with the programming language R.<br \/>\nQuanteda is a tool for quantitative text analysis that Kenneth Benoit from the LSE began developing in 2012 with support from the European Research Council, and Version 1.0 was announced in January 2018. A variety of software has been developed for text analysis to date, but no multifunctional and efficient software has yet been developed.<br \/>\nEven at its present stage, Quanteda is more multifunctional than tm and tidytext, the previous R packages, and gensim, a Python package. It is also superior in terms of processing speed and memory usage. It is easy for researchers in humanities to use and has gained the support of leading political scientists in North America and Europe.<br \/>\nIt is further differentiated by its ability to support Asian languages such as Chinese, Japanese, and Korean due to its compatibility with Unicode, which can handle all languages. Using the same analytic tools and models as English-speaking researchers, it is possible to make international comparisons and publish in leading journals. I think that researchers in Asia, at least those in Japan, have great expectations of Quanteda.<\/p>\n<div id=\"attachment_5842\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-5842 size-medium\" src=\"https:\/\/www.waseda.jp\/inst\/wias\/assets\/uploads\/2018\/11\/d2a2527386ba3abfc7d314f5b582e3cd-610x409.jpg\" alt=\"\" width=\"610\" height=\"409\" srcset=\"https:\/\/www.waseda.jp\/inst\/wias\/assets\/uploads\/2018\/11\/d2a2527386ba3abfc7d314f5b582e3cd-610x409.jpg 610w, https:\/\/www.waseda.jp\/inst\/wias\/assets\/uploads\/2018\/11\/d2a2527386ba3abfc7d314f5b582e3cd-768x515.jpg 768w, https:\/\/www.waseda.jp\/inst\/wias\/assets\/uploads\/2018\/11\/d2a2527386ba3abfc7d314f5b582e3cd-940x630.jpg 940w, https:\/\/www.waseda.jp\/inst\/wias\/assets\/uploads\/2018\/11\/d2a2527386ba3abfc7d314f5b582e3cd.jpg 1600w\" sizes=\"auto, (max-width: 610px) 100vw, 610px\" \/><p class=\"wp-caption-text\"><center>The Quanteda development team. Professor Kenneth Benoit is to the right of the lecturer Kohei Watanabe in the center.<\/center><\/p><\/div>\n<h3>Analyzing Changes in Other Countries\u2019 Perceptions of the United States from Newspaper Articles<\/h3>\n<p>Using Quanteda, I analyzed how other countries\u2019 perceptions of the United States have correlated with the nation&#8217;s domestic political situation over the past 30 years through newspaper articles published in Japan and Britain.<br \/>\nI performed a keyword search on a database of Japanese and British articles (using the terms \u201cUnited States\u201d OR \u201cUS\u201d AND \u201cgovernment\u201d OR \u201cpolitics\u201d OR \u201cdiplomacy\u201d OR \u201cmilitary\u201d\uff09and used a semi-supervised vector space model \uff08<a href=\"https:\/\/github.com\/koheiw\/LSS\" target=\"_blank\" rel=\"noopener\">Latent Semantic Analysis<\/a>\uff09to quantitatively analyze how the 141,746 articles extracted from <i>The Asahi Shimbun<\/i> and the 176,399 articles extracted from <i>The Guardian<\/i> covered the United States from the perspective of framing.<\/p>\n<div id=\"attachment_5843\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-5843 size-medium\" src=\"https:\/\/www.waseda.jp\/inst\/wias\/assets\/uploads\/2018\/11\/5ddbd2f0dde3151c66238e0c645264ca-610x298.jpg\" alt=\"\" width=\"610\" height=\"298\" srcset=\"https:\/\/www.waseda.jp\/inst\/wias\/assets\/uploads\/2018\/11\/5ddbd2f0dde3151c66238e0c645264ca-610x298.jpg 610w, https:\/\/www.waseda.jp\/inst\/wias\/assets\/uploads\/2018\/11\/5ddbd2f0dde3151c66238e0c645264ca-768x375.jpg 768w, https:\/\/www.waseda.jp\/inst\/wias\/assets\/uploads\/2018\/11\/5ddbd2f0dde3151c66238e0c645264ca-940x459.jpg 940w\" sizes=\"auto, (max-width: 610px) 100vw, 610px\" \/><p class=\"wp-caption-text\"><center>Framing of the United States by <i>The Asahi Shimbun<\/i><\/center><\/p><\/div>\n<p>&nbsp;<\/p>\n<div id=\"attachment_5844\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"size-medium wp-image-5844\" src=\"https:\/\/www.waseda.jp\/inst\/wias\/assets\/uploads\/2018\/11\/efee1ed0bb1ea61d77eb5d917c387e08-610x297.jpg\" alt=\"\" width=\"610\" height=\"297\" srcset=\"https:\/\/www.waseda.jp\/inst\/wias\/assets\/uploads\/2018\/11\/efee1ed0bb1ea61d77eb5d917c387e08-610x297.jpg 610w, https:\/\/www.waseda.jp\/inst\/wias\/assets\/uploads\/2018\/11\/efee1ed0bb1ea61d77eb5d917c387e08-768x373.jpg 768w, https:\/\/www.waseda.jp\/inst\/wias\/assets\/uploads\/2018\/11\/efee1ed0bb1ea61d77eb5d917c387e08-940x457.jpg 940w\" sizes=\"auto, (max-width: 610px) 100vw, 610px\" \/><p class=\"wp-caption-text\"><center>Framing of the United States by <i>The Guardian<\/i><\/center><\/p><\/div>\n<p>When the measured values are averaged for each term of presidents Reagan (1981 \u2013 1988), Bush (H.) (1989 \u2013 1993), Clinton (1993 \u2013 2001), Bush (W.) (2001 \u2013 2009), Obama (2009 \u2013 2017), and Trump (2017 \u2013 ), it is clear that perceptions of the United States have changed over a 30-year period.<\/p>\n<div id=\"attachment_5839\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-5839 size-large\" src=\"https:\/\/www.waseda.jp\/inst\/wias\/assets\/uploads\/2018\/11\/cd2d5dfcfeb006e904abace87c055eca-940x846.jpg\" alt=\"\" width=\"940\" height=\"846\" srcset=\"https:\/\/www.waseda.jp\/inst\/wias\/assets\/uploads\/2018\/11\/cd2d5dfcfeb006e904abace87c055eca-940x846.jpg 940w, https:\/\/www.waseda.jp\/inst\/wias\/assets\/uploads\/2018\/11\/cd2d5dfcfeb006e904abace87c055eca-610x549.jpg 610w, https:\/\/www.waseda.jp\/inst\/wias\/assets\/uploads\/2018\/11\/cd2d5dfcfeb006e904abace87c055eca-768x691.jpg 768w\" sizes=\"auto, (max-width: 940px) 100vw, 940px\" \/><p class=\"wp-caption-text\"><center>Changes in Perception of the United States in Japan<\/center><\/p><\/div>\n<p>&nbsp;<\/p>\n<div id=\"attachment_5840\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-5840 size-large\" src=\"https:\/\/www.waseda.jp\/inst\/wias\/assets\/uploads\/2018\/11\/8c2afc83c445cf4613d845c6cd9bef9c-940x846.jpg\" alt=\"\" width=\"940\" height=\"846\" srcset=\"https:\/\/www.waseda.jp\/inst\/wias\/assets\/uploads\/2018\/11\/8c2afc83c445cf4613d845c6cd9bef9c-940x846.jpg 940w, https:\/\/www.waseda.jp\/inst\/wias\/assets\/uploads\/2018\/11\/8c2afc83c445cf4613d845c6cd9bef9c-610x549.jpg 610w, https:\/\/www.waseda.jp\/inst\/wias\/assets\/uploads\/2018\/11\/8c2afc83c445cf4613d845c6cd9bef9c-768x691.jpg 768w\" sizes=\"auto, (max-width: 940px) 100vw, 940px\" \/><p class=\"wp-caption-text\"><center>Changes in Perception of the United States in the United Kingdom<\/center><\/p><\/div>\n<h3>Analyzing Russia\u2019s International Internet Propaganda Strategy<\/h3>\n<p>Another example of research using Quanteda is its application in analyzing articles from the state-controlled Russian website Sputnik News (<a href=\"https:\/\/sputniknews.com\/\" target=\"_blank\" rel=\"noopener\">https:\/\/sputniknews.com\/<\/a>). Sputnik News has editorial departments in more than 30 languages, including Japanese, English, Spanish, French, and German.<br \/>\nI classified the 51,651 English-language articles that have appeared on the website since July 2017 into six topics, economy, politics, society, diplomacy, military, and nature, and extracted the following keywords from each topic.<\/p>\n<table style=\"width: 100%; border-collapse: collapse;\" border=\"1\">\n<tbody>\n<tr style=\"height: 25px;\">\n<td style=\"width: 15.45%; height: 25px;\"><strong>Topic<\/strong><\/td>\n<td style=\"width: 84.42%; height: 25px;\"><strong>Seed words<\/strong><\/td>\n<\/tr>\n<tr style=\"height: 25px;\">\n<td style=\"width: 15.45%; height: 25px;\">economy<\/td>\n<td style=\"width: 84.42%; height: 25px;\">market*, money, bank*, stock*, bond*, industry, company, shop*<\/td>\n<\/tr>\n<tr style=\"height: 25px;\">\n<td style=\"width: 15.45%; height: 25px;\">politics<\/td>\n<td style=\"width: 84.42%; height: 25px;\">parliament*, congress*, party leader*, party member*, voter*, lawmaker*, politician*<\/td>\n<\/tr>\n<tr style=\"height: 25px;\">\n<td style=\"width: 15.45%; height: 25px;\">society<\/td>\n<td style=\"width: 84.42%; height: 25px;\">police, prison*, school*, hospital*<\/td>\n<\/tr>\n<tr style=\"height: 25px;\">\n<td style=\"width: 15.45%; height: 25px;\">diplomacy<\/td>\n<td style=\"width: 84.42%; height: 25px;\">ambassador*, diplomat*, embassy, treaty<\/td>\n<\/tr>\n<tr style=\"height: 25px;\">\n<td style=\"width: 15.45%; height: 25px;\">military<\/td>\n<td style=\"width: 84.42%; height: 25px;\">military, soldier*, air force, marine, navy, army<\/td>\n<\/tr>\n<tr style=\"height: 25px;\">\n<td style=\"width: 15.45%; height: 25px;\">nature<\/td>\n<td style=\"width: 84.42%; height: 25px;\">water, wind, sand, forest, mountain, desert, animal, human<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p>The graph below shows a classification of featured article categories by country.<\/p>\n<p>There are many articles on \u201csociety\u201d and \u201cpolitics\u201d for the United States (us) and Britain (gb) because Sputnik News targets these two countries for propaganda. \u201cnature\u201d is a somewhat bigger topic for Japan (jp) because there are many articles related to the environment, such as those covering nuclear power. Naturally, \u201cmilitary\u201d is a prominent topic in the coverage of Syria (sy) and Afghanistan (af), but the larger \u201csociety\u201d topic for Sweden (se) gives a sense of Russia\u2019s intention to prevent Sweden from joining the North Atlantic Treaty Organization (NATO). I presented this research at the <a href=\"https:\/\/popularmobilization.net\/2018\/08\/14\/new-research-paper-on-sputnik-news-to-be-presented-at-ecpr-hamburg\/\" target=\"_blank\" rel=\"noopener\">ECPR General Conference<\/a> held in Hamburg, Germany, in August.<\/p>\n<div id=\"attachment_5845\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-5845 size-medium\" src=\"https:\/\/www.waseda.jp\/inst\/wias\/assets\/uploads\/2018\/11\/3524272b960bd3815fc73f3420561240-610x408.jpg\" alt=\"\" width=\"610\" height=\"408\" srcset=\"https:\/\/www.waseda.jp\/inst\/wias\/assets\/uploads\/2018\/11\/3524272b960bd3815fc73f3420561240-610x408.jpg 610w, https:\/\/www.waseda.jp\/inst\/wias\/assets\/uploads\/2018\/11\/3524272b960bd3815fc73f3420561240-768x513.jpg 768w, https:\/\/www.waseda.jp\/inst\/wias\/assets\/uploads\/2018\/11\/3524272b960bd3815fc73f3420561240-940x628.jpg 940w\" sizes=\"auto, (max-width: 610px) 100vw, 610px\" \/><p class=\"wp-caption-text\"><center>Share of Countries and Topics in the English-language Edition of Sputnik News<\/center><\/p><\/div>\n<h3>Media and the Future of Quantitative Text Analysis<\/h3>\n<p>Media research is my original field of study. Our information environment is set to shift steadily in an undesirable direction if the media is left unchecked\u2014a process that is exemplified by the proliferation of so-called fake news and the increasing partisanship of newspapers. In order to halt this shift, I believe that more researchers must conduct analyses from diverse perspectives and identify problems.<br \/>\nIn this respect, I am aware of the significance of developing Quanteda as \u201ca tool that individuals can use\u201d to efficiently process large volumes of data that continue to be created every day without using large-scale computers or huge amounts of manpower.<\/p>\n<p style=\"text-align: right;\">Interview and Composition:Ayako Yamamoto<br \/>\nIn cooperation with: Waseda University Graduate School of Political Science J-School<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Kohei Watanabe,\u3000Assistant Professor What is Quantitative Text Analysis? Quantitative text analysis is a techni [&hellip;]<\/p>\n","protected":false},"author":6,"featured_media":5838,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[95],"tags":[73,107],"class_list":["post-6321","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-news-en","tag-research-en","tag-spotlight-en"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.waseda.jp\/inst\/wias\/wp-json\/wp\/v2\/posts\/6321","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.waseda.jp\/inst\/wias\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.waseda.jp\/inst\/wias\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.waseda.jp\/inst\/wias\/wp-json\/wp\/v2\/users\/6"}],"replies":[{"embeddable":true,"href":"https:\/\/www.waseda.jp\/inst\/wias\/wp-json\/wp\/v2\/comments?post=6321"}],"version-history":[{"count":0,"href":"https:\/\/www.waseda.jp\/inst\/wias\/wp-json\/wp\/v2\/posts\/6321\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.waseda.jp\/inst\/wias\/wp-json\/wp\/v2\/media\/5838"}],"wp:attachment":[{"href":"https:\/\/www.waseda.jp\/inst\/wias\/wp-json\/wp\/v2\/media?parent=6321"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.waseda.jp\/inst\/wias\/wp-json\/wp\/v2\/categories?post=6321"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.waseda.jp\/inst\/wias\/wp-json\/wp\/v2\/tags?post=6321"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}