Smart speakers are becoming increasingly ubiquitous in society and are now used for satisfying a variety of information needs, from asking about the weather or traffic to accessing the latest breaking news information. Their growing use for news and information consumption presents new questions related to the quality, source diversity, and comprehensiveness of the news-related information they convey. These questions have significant implications for voice assistant technologies acting as algorithmic information intermediaries, but systematic information quality audits have not yet been undertaken. To address this gap, we develop a methodological approach for evaluating information quality in voice assistants for news-related queries. We demonstrate the approach on the Amazon Alexa voice assistant, first characterising Alexa's performance in terms of response relevance, accuracy, and timeliness, and then further elaborating analyses of information quality based on query phrasing, news category, and information provenance. We discuss the implications of our findings for future audits of information quality on voice assistants and for the consumption of news information via such algorithmic intermediaries more broadly.
This repository contains the source code necessary to facilitate replicating the data collection methods used in the paper described above.
Python 2+ required (for python build)
Install required modules (boto3, sqlite3, requests, speech_recognition, etc).
pip install -r requirements.txt
Create a config.ini file to securely store your private AWS access and secret keys
[auth]
AWSAccessKeyId=XXXXXXXXXX
AWSSecretKey=XXXXXXXXXX
-
Fetching Google's Top 20 Daily Search Trends
python google_trends.py
-
Synthesizing text queries to voice queries using Amazon Polly
python speech_synthesis.py --args
-
Issuing voice queries to the Alexa smart speaker and saving voice responses
python app.py --args
-
Uploading Alexa voice responses to S3 for transcription
python s3_upload.py
-
Transcribing Alexa voice responses with Amazon Transcribe
python speech_transcription.py
This work is supported by the National Science Foundation Grant, Award IIS-1717330. The authors would like to thank Sophie Liu, Victoria Cabales, Benjamin Scharf, and the Knight Lab Studio at Northwestern University for their support and assistance in a related pilot study.