1. 程式人生 > >Amazon.com: The Voice in the Machine: Building Computers That Understand Speech (The MIT Press) (9780262533294): Roberto Pieracc

Amazon.com: The Voice in the Machine: Building Computers That Understand Speech (The MIT Press) (9780262533294): Roberto Pieracc

I enjoyed reading this book! It is a comprehensive description of the evolution of the speech technologies focused on the major results of research and the changes of directions that the technology had in the last decades. The last chapter is about the advent of Siri and what will happen in the next future. Reading the book you will encounters many and many protagonists with their anecdotes, ideas and achievements.

I see two main categories of people that might gain great advantage by reading this book. The first are those not involved in the evolution of speech technologies, the second are the insiders, who were involved either in research or at any level, even non technical, in the speech industry. For the former the book explains how a complex technology evolves in reality with all the roadblocks, turns, and steep paths while the author puts all his effort in explaining very complex engineering problems without formulas or technicalities, but using simple and enlightening analogies and examples. The book will help them to understand what is behind Siri, Google Voice, or every other speaking machine. For the latter, the professionals of the voice science and industry, it is very interesting to see how the author assembles a map of the past and current technology, the motivations and the forces behind it, and shows how all the pieces fit together in a technological landscape of the area in which they are currently engaged. For them it is like stepping out for a minute to gain a vantage point perspective and different points of view.

I belong to the second category because I spent 20 years in R&D in the research lab in Italy where Roberto Pieraccini moves his first steps and then I was deep involved in the newborn speech industry.

A last little advice is for the readers who would like to move from the author's examples to more technical readings. I found the Notes section very interesting, like a book inside the book. You might read it from the top to the bottom and you will find there some formulas, pointers to literature and complementary thoughts.

Now, I'll eagerly wait a continuation from Roberto Pieraccini to look forward instead of backward, but I strongly suggest to read this marvelous book now.

相關推薦

Amazon.com: The Voice in the Machine: Building Computers That Understand Speech (The MIT Press) (9780262533294): Roberto Pieracc

I enjoyed reading this book! It is a comprehensive description of the evolution of the speech technologies focused on the major results of research and the

ssh: Could not resolve hostname git.*****-inc.com : Temporary failure in name resolution fatal: The remote end hung up unexpectedly

配置 soft mic target clas 無法執行 ssh pull 開發   問題出現的情景:使用git pull拉取開發的代碼到測試服務器,報錯:   ssh: Could not resolve hostname git.****-inc.com : Tempo

The fusion of AI, ML, and Voice in the Contact Center

Powerful technologies are fusing in the contact center with artificial intelligence (AI), machine learning, and voice recognition being the focal point in

Algorithmia Survey: Large Enterprises Have Taken the Lead in Machine Learning

Companies of all sizes are not satisfied with their machine learning process and various challenges to widespread adoption remain. SEATTLE, Oct. 16, 2018 (

A machine and human’s perception of the world in Augmented Reality

A computer’s understanding of space for Augmented RealityThe goal of Augmented Reality is to superimpose the computer’s perception of space with human’s un

Errors while building APK. You can find the errors in the 'Messages' view.

最近在用Android Studio打包簽名apk時遇到了一個問題,經過查詢資料,順利解決。 問題一:Messages報錯如下: Error:Execution failed for task ':app:lintVitalRelease'. >

Building with Watson: Connect the dots in your domain-specific content

IBM Watson can extract helpful insights about your data out of the box. Like a knowledgeable friend, it “reads” through data to show you its themes and im

Hyperion: Building the Largest In memory Search Tree

Introduction   索引在資料管理中起到很重要的作用,很多索引結構都會採用訪問速度快而且記憶體消耗少的trie樹,但一般常見的trie樹索引結構都強調效率而忽視記憶體的效率,他們的效率雖然高,但記憶體的消耗比較大。這篇文章提出了一種新的樹形結構----Hyperion,在效率上做到對範圍查詢和點

關於jmeter命令行執行.jmx文件出現Error in NonGUIDriver java.lang.RuntimeException: Could not find the TestPlan class的問題

使用 lang exception ava 出現 問題 drive test bug jmeter命令行執行.jmx文件時,有時回出現Error in NonGUIDriver java.lang.RuntimeException: Could not find the T

scale the service in the swarm

docker swarm 一旦你部署了一個服務到swarm集群中,你就可以使用docker命令行來伸縮擴容運行該服務的容器數量。運行在多個容器的一個服務叫做tasks 任務。$docker machine ssh manager1$ docker service scale <SERVICE-I

【二分】Petrozavodsk Winter Training Camp 2017 Day 1: Jagiellonian U Contest, Monday, January 30, 2017 Problem A. The Catcher in the Rye

什麽 不同 stdin n) clas sqrt ios 這份 std 一個區域,垂直分成三塊,每塊有一個速度限制,問你從左下角跑到右上角的最短時間。 將區域看作三塊折射率不同的介質,可以證明,按照光路跑時間最短。 於是可以二分第一個入射角,此時可以推出射到最右側邊界上的位

C語言考題:Find the key in the picture,good luck..

int c語言 bsp pict fin find print str1 bin str1="Find the key in the picture,good luck.." for i in range(256): for j in range(39):

[Angular] Read Custom HTTP Headers Sent by the Server in Angular

conf nor update names fault pan table tom color By default the response body doesn’t contain all the data that might be needed in y

1503.02531-Distilling the Knowledge in a Neural Network.md

gets 任務 其中 不一致 ans softmax special abi use 原來交叉熵還有一個tempature,這個tempature有如下的定義: $$ q_i=\frac{e^{z_i/T}}{\sum_j{e^{z_j/T}}} $$ 其中T就是temp

maven web項目的web.xml報錯The markup in the document following the root element must be well-formed.

utf-8 style sta 元素 nbsp 地形 很好 ati instance maven項目裏面的web.xml開頭約束是這樣的 <?xml version="1.0" encoding="UTF-8"?> <web-app xmlns:xsi=

tomcat啟動時,內存溢出,Exception: java.lang.OutOfMemoryError thrown from the UncaughtExceptionHandler in thread "main"

通過 per memory tomcat配置 -xmx ... nbsp ont ron 問題原因   通過tomcat啟動項目,也許是因為項目太大,配置的內存不夠用了。老是報內存溢出的問題。 解決辦法 1.選中項目 右鍵 run as -》Run Configu

error LOADING Redis is loading the dataset in memory問題解決

分享一下我老師大神的人工智慧教程!零基礎,通俗易懂!http://blog.csdn.net/jiangjunshow 也歡迎大家轉載本篇文章。分享知識,造福人民,實現我們中華民族偉大復興!        

Advice On Purchasing The Best Concrete Block Machine On The Market

Once you buy a concrete block machine, you normally have a construction business of some sort. You are accountable for laying the building blocks

The Stock in Chart

Use chart to show stock chart void createchart() { // Create a chart and specify its location. chart1.Series.Clear(); chart1.Cha

蒸餾神經網路(Distill the Knowledge in a Neural Network) 論文筆記 蒸餾神經網路(Distill the Knowledge in a Neural Network) 論文筆記

轉 蒸餾神經網路(Distill the Knowledge in a Neural Network) 論文筆記 2017年08月06日 16:19:48 haoji00