programming_voice_interfaces.pdf

(18161 KB) Pobierz
Programming
Voice Interfaces
GIVING CONNECTED DEVICES A VOICE
Walter Quesada &
Bob Lautenbach
Programming Voice Interfaces
Giving Connected Devices a Voice
Walter Quesada and Bob Lautenbach
Beijing
Boston Farnham Sebastopol
Tokyo
Programming Voice Interfaces
by Walter Quesada and Bob Lautenbach
Copyright © 2018 Walter Quesada, Bob Lautenbach. All rights reserved.
Printed in the United States of America.
Published by O’Reilly Media, Inc., 1005 Gravenstein Highway North, Sebastopol, CA 95472.
O’Reilly books may be purchased for educational, business, or sales promotional use. Online editions are
also available for most titles (http://oreilly.com/safari). For more information, contact our corporate/insti‐
tutional sales department: 800-998-9938 or
corporate@oreilly.com.
Editors:
Susan Conant and Jeff Bleiel
Production Editor:
Shiny Kalapurakkel
Copyeditor:
Jasmine Kwityn
Proofreader:
Kim Cofer
October 2017:
First Edition
Indexer:
Angela Howard
Interior Designer:
David Futato
Cover Designer:
Karen Montgomery
Illustrator:
Rebecca Demarest
Revision History for the First Edition
2017-10-04:
First Release
The O’Reilly logo is a registered trademark of O’Reilly Media, Inc.
Programming Voice Interfaces,
the cover
image, and related trade dress are trademarks of O’Reilly Media, Inc.
While the publisher and the authors have used good faith efforts to ensure that the information and
instructions contained in this work are accurate, the publisher and the authors disclaim all responsibility
for errors or omissions, including without limitation responsibility for damages resulting from the use of
or reliance on this work. Use of the information and instructions contained in this work is at your own
risk. If any code samples or other technology this work contains or describes is subject to open source
licenses or the intellectual property rights of others, it is your responsibility to ensure that your use
thereof complies with such licenses and/or rights.
978-1-491-95600-7
[LSI]
Table of Contents
Preface. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vii
1.
Introduction to Voice Interfaces and the IoT. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
Welcome to a NUI World
Voice All the Things!
What Is NLP?
Speech-to-Text (STT)
Text-to-Speech (TTS)
PLS, SSML, and Other Acronyms
Experience Design
Purpose
One-Off Versus Conversational
Conversation Flows
Sample Utterances
Speech Synthesis Markup Language (SSML)
When to Use Visual Cues
Additional Design Considerations
Decisions, Decisions...
2
3
5
6
7
7
10
11
11
12
14
15
19
20
21
2.
Existing APIs and Libraries. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
Amazon Alexa
Alexa Skills Kit (ASK)
Alexa Voice Service (AVS)
Amazon Lex
Amazon Polly
Microsoft Cognitive Services, Cortana, and More
Google Cloud Speech API
Other Notable Services
Technical Architecture
Conceptual Architecture
23
25
26
29
30
31
32
33
35
35
iii
Zgłoś jeśli naruszono regulamin