Text as Data: A New Framework for Machine Learning and the Social Sciences
Spiral-Bound | March 29, 2022
Justin Grimmer, Margaret E. Roberts, Brandon M. Stewart
★★★★☆+
from Up to 30 ratings
$36.87-Free Shipping
Text as Data: A New Framework for Machine Learning and the Social Sciences
1 / of1
A guide for using computational text analysis to learn about the social world
From social media posts and text messages to digital government documents and archives, researchers are bombarded with a deluge of text reflecting the social world. This textual data gives unprecedented insights into fundamental questions in the social sciences, humanities, and industry. Meanwhile new machine learning tools are rapidly transforming the way science and business are conducted. Text as Data shows how to combine new sources of data, machine learning tools, and social science research design to develop and evaluate new insights.
Text as Data is organized around the core tasks in research projects using text—representation, discovery, measurement, prediction, and causal inference. The authors offer a sequential, iterative, and inductive approach to research design. Each research task is presented complete with real-world applications, example methods, and a distinct style of task-focused research.
Bridging many divides—computer science and social science, the qualitative and the quantitative, and industry and academia—Text as Data is an ideal resource for anyone wanting to analyze large collections of text in an era when data is abundant and computation is cheap, but the enduring challenges of social science remain.
Overview of how to use text as data
Research design for a world of data deluge
Examples from across the social sciences and industry
Publisher: Princeton University Press
Original Binding: Trade Paperback
Pages: 360 pages
ISBN-10: 0691207550
Item Weight: 1.94 lbs
Dimensions: 7.0 x 1.38 x 10.0 inches
Customer Reviews: 4 out of 5 stars Up to 30 ratings
"Among the metaverse of possible books on Text as Data that could have been published . . . I was pleased that my universe produced this one. I will assign this book as a critical part of my own course on content analysis for years to come, and it has already altered and improved the coherence of my own vocabulary and articulation for several critical choices underlying the process of turning text into data. . . . Highly recommend."---James Evans, Sociological Methods & Research
Justin Grimmer is professor of political science and a senior fellow at the Hoover Institution at Stanford University. Twitter @justingrimmer Margaret E. Roberts is associate professor in political science and the Halıcıoğlu Data Science Institute at the University of California, San Diego. Twitter @mollyeroberts Brandon M. Stewart is assistant professor of sociology and Arthur H. Scribner Bicentennial Preceptor at Princeton University. Twitter @b_m_stewart
Quick shop
Choosing a selection results in a full page refresh.