Microsoft is nailing speech recognition, so Cortana's future looks bright

Cortana on Windows 10

Microsoft is boasting that it's top of the tree when it comes to speech recognition these days, and of course that bodes well for digital assistant Cortana.

Apparently, according to Microsoft's chief speech scientist, Xuedong Huang, the company just set a new record in terms of the industry standard Switchboard speech recognition benchmark, hitting a word error rate (WER) of 6.3%.

Deep neural nets

Both Microsoft and IBM are driving further ahead with better speech recognition thanks to deep neural networks which are really paying dividends these days, and helping to develop the technology at speed.

Recent advances in the deep neural net field have been critical in terms of smoothing over these systems, and they include a new type of cross-layer network connection, along with the use of Microsoft's Computational Network Toolkit (CNTK).

The CNTK bristles with optimizations that enable these networks to run much faster, making use of the power of many GPUs in parallel to hone routines further.

Microsoft stated: "CNTK is already used by the team that helps Microsoft's virtual assistant, Cortana. By combining the use of CNTK and GPU clusters, Cortana's speech training is now able to ingest 10 times more data in the same amount of time."

The end goal is, of course, to have Cortana be able to understand every word someone is saying just as effectively as a real person can. And just maybe that's not as far off as we might imagine…

Via: WinBeta

Darren is a freelancer writing news and features for TechRadar (and occasionally T3) across a broad range of computing topics including CPUs, GPUs, various other hardware, VPNs, antivirus and more. He has written about tech for the best part of three decades, and writes books in his spare time (his debut novel - 'I Know What You Did Last Supper' - was published by Hachette UK in 2013).