Speech Processing

Say Hello, Facebook

Logo

Do you remember my article on wit.ai, a clever SAAS provider for automatisch speech recognition? They are the speech technology company recently bought by Facebook. This is, what wit.ai states on their blog on this issue:

It is an incredible acceleration in the execution of our vision. Facebook has the resources and talent to help us take the next step. Facebook’s mission is to connect everyone and build amazing experiences for the over 1.3 billion people on the platform – technology that understands natural language is a big part of that, and we think we can help.

The platform will remain open and become entirely free for everyone. Developers are the life of our project and the energy, enthusiasm and passion of the community has helped turn what was once just a lofty dream, into a reality. We want to continue to build with you.

 

 

 

Turn speech into actionable data as a service

Nice approach: The California-based startup Wit.Ai offers Speech Recognition for any kind of task, from mobile apps to robotics. Instead of creating such components themselves, developers can use the Wit.Ai technology, send them spoken language and get back structured data. If this works out I think it will have a major impact on the number of apps and even household items that understand natural spoken language commands. Nice!

Wit.ai makes it easy for developers to build applications and devices that you can talk to. Our vision is to empower developers with an open and extensible natural language platform. Wit.ai learns human language from every interaction, and leverages the community: what’s learned is shared across developers.

Wit.ai

via Technology Review: Sprache für das Internet der Dinge

Dialect speaking devices

It was the other day when I was thinking about dialect understanding and speaking devices. Well, here it is:

Scientists in Vienna, Austria have created a speech synthesizer which delivers dialects. It also provides an on-screen Avatar that shows lip and facial movements.

Hackread: Next big thing: your phone will speak your dialect with Austrian speech synthesis

IBM: Big Data, Speech Processing and Machine Translation

For a machine to truly process speech data, it needs cognitive computing – a system with architecture that imitates how the human brain understands information. IBM Watson’s ability to understand natural language is just a first piece to a complex cognitive computing puzzle. But as cognitive computing is applied to Big Data, it will also revolutionize speech recognition and speech translation.

IBM Research: Dimitri Kanevsky Translating Big Data