We constructed a corpus of digitized texts containing about 4% of all books ever printed. Analysis of this corpus enables us to investigate cultural trends quantitatively. We survey the vast terrain of ‘culturomics,’ focusing on linguistic and cultural phenomena that were reflected in the English language between 1800 and 2000. We show how this approach can provide insights about fields as diverse as lexicography, the evolution of grammar, collective memory, the adoption of technology, the pursuit of fame, censorship, and historical epidemiology. Culturomics extends the boundaries of rigorous quantitative inquiry to a wide array of new phenomena spanning the social sciences and the humanities.
The Psychology of Coordination and Common Knowledge. Journal of Personality and Social Psychology , 107 (4), 657-676.(2014).
The Source of Bad Writing. The Wall Street Journal.(2014).
Why academics stink at writing. The Chronicle of Higher Education.(2014).
"The Decline of War and Conceptions of Human Nature" in The Forum: The Decline of War. International Studies Review , 15 (3), 400-405.(2013).
Why It Is Hard to Find Genes Associated With Social Science Traits: Theoretical and Empirical Considerations. American Journal of Public Health , 103 (No. S1), 152-166.(2013).
Obituaries - George A. Miller. American Psychologist , 68 (6), 467-468.(2013).
The Forum: The Decline of War. International Studies Review , 15, 396-419.(2013).
Taming the Devil within Us. Nature , 478, 309-311.(2011).
Quantitative analysis of culture using millions of digitized books. Science , 331, 176-182.Abstract(2011).