Connect with us

Technology

Microsoft builds world’s first human-like speech recognition system

Published

on

“Even five years ago, I wouldn’t have thought we could have achieved this. I just wouldn’t have thought it would be possible,” said Harry Shum, executive vice president who heads the Microsoft Artificial Intelligence and Research group.

microsoft-logo

New York, Oct 19 : In a major breakthrough in the field of speech recognition, Microsoft researchers have created a technology that accurately recognises the words in a conversation like humans do.

The team from Microsoft Artificial Intelligence and Research reported a speech recognition system that makes the same or fewer errors than professional transcriptionists.

The researchers reported a word error rate (WER) of 5.9 percent, down from the 6.3 percent WER the team reported just last month.

The 5.9 percent error rate is about equal to that of people who were asked to transcribe the same conversation, and it’s the lowest ever recorded against the industry standard “Switchboard” speech recognition task.

“We’ve reached human parity. This is an historic achievement,” said Xuedong Huang, the company’s chief speech scientist in a Microsoft blog post.

The milestone means that, for the first time, a computer can recognise the words in a conversation as well as a person would.

In doing so, the team has beat a goal they set less than a year ago – and greatly exceeded everyone else’s expectations as well.

“Even five years ago, I wouldn’t have thought we could have achieved this. I just wouldn’t have thought it would be possible,” said Harry Shum, executive vice president who heads the Microsoft Artificial Intelligence and Research group.

The research milestone comes after decades of research in speech recognition, beginning in the early 1970s with DARPA, the US agency tasked with making technology breakthroughs in the interest of national security.

“This accomplishment is the culmination of over 20 years of effort,” said Geoffrey Zweig, who manages the Speech & Dialog research group.

The milestone will have broad implications for consumer and business products that can be significantly augmented by speech recognition. That includes consumer entertainment devices like the Xbox, accessibility tools such as instant speech-to-text transcription and personal digital assistants such as Cortana.

“This will make Cortana (Microsoft personal assistant) more powerful, making a truly intelligent assistant possible,” Shum said.

To reach the human parity milestone, the team used Microsoft’s Computational Network Toolkit (CNTK), a home-grown system for deep learning that the research team has made available on GitHub via an open source license.

CNTK’s ability to quickly process deep learning algorithms across multiple computers running a specialised chip called a graphics processing unit vastly improved the speed at which the team was able to do research and, ultimately, reach human parity.

Moving forward, the researchers are working on ways to make sure that speech recognition works well in more real-life settings.

That includes places where there is a lot of background noise, such as at a party or while driving on the highway.

In the longer term, researchers will focus on ways to teach computers not just to transcribe the acoustic signals that come out of people’s mouths, but instead to understand the words they are saying.

“The next frontier is to move from recognition to understanding,” Zweig said.

India

Gmail, Google Docs restored for some users after major outage

Published

on

By

New Delhi, 14 Dec : Google-owned services including Gmail and Google Docs on Monday has been partially restored after experiencing a major outage in India and several other countries. According to Downdetector, Google Meet and Google Play and were also inaccessible for some users. Most other Google services including Google Classroom also got affected by the outage.

With Gmail, problems reported by users ranged from accessing website, logging and messages issues. Google has updated that Gmail, Google Drive, Google Calendar and most other services hit by the outage has been restored for some users.

“Gmail service has already been restored for some users, and we expect a resolution for all users in the near future,” Google said in a statement.

Continue Reading

Technology

India bans 43 Chinese mobile apps over national security concerns

Published

on

By

New Delhi, 24 Nov : Continuing with its ban orders on Chinese apps, the Indian government on Tuesday blocked 43 new mobile apps, including shopping website AliExpress owned by e-commerce behemoth Alibaba.

Continue Reading

Mobiles

APPLE HOMEPOD MINI LAUNCHED AT $99, IPHONE 12 STARTING $699, IPHONE 12 PRO FROM $999

Published

on

By

Apple Event 2020 LIVE Updates: Apple has finally removed the curtain from its iPhone 12 series. The company launched the iPhone 12, iPhone 12 Mini, iPhone 12 Pro and iPhone 12 Pro Max at the ‘Hi Speed’ event at the Apple headquarters in Cupertino, California. At the event, the company has also launched Smart HomePod Speaker.

The company says that the iPhone 12 Mini is also the world’s thinnest and lightest 5G smartphone. Let us tell that this event was once again started by Team Cook, CEO of the company.

Continue Reading

Trending Live

Advertisement

Latest Post

Big News

Advertisement

Sports

Advertisement

Entertainment

Trending