Project

KEDS data sets

These data were gathered by Phil Schrodt and his team at the University of Kansas.

The Global Terrorism Database

The Global Terrorism Database (GTD), part of the START project housed at the University of Maryland, is an open-source database including terrorist events around the world from 1970 to 2007. We obtained raw data from the GTD contact site for our analysis.

The Institute for the Study of Violent Groups

The Institute for the Study of Violent Groups (ISVG) has an extensive, human-curated collection of 130,000 terrorist events dating from January 2003 to the present. We analyzed the version of the ISVG database as of July 2008. To request this data, please contact them directly.

The Reuters Corpus Volumes 1 and 2

The Reuters Corpus, released in 2000, is a collection of news stories for use in research and development of natural-language processing, and machine learning systems. This corpus is available from NIST. Tokenized versions of the corpus are available at David Lewis' resource page.

The Ares News Database

The Ares news data set contains all stories about countries in the Gulf and the Levant from the Associated Press Archives (1998-2007), BBC Archives (1998-2007), Agence France Presse archives (1998 - 2007), Washington Post archives (1977-2007), Boston Globe (1979-2007), Scripps Howard newswire (1990-2007), and Houston Chronicle (1985-2007). We developed a relational database system using Microsoft SQL Server for archiving and querying these stories.

This project was funded by the National Science Foundation ITR program from 2002-2008.