Welcome!

Cognitive Computing Authors: Zakia Bouachraoui, Pat Romanski, Yeshim Deniz, Liz McMillan, Elizabeth White

Related Topics: Microservices Expo, Microsoft Cloud, Cognitive Computing , Agile Computing, Wearables

Microservices Expo: Article

Speech and Sound: The Next "Killer Paradigm Shift"..?

Speech recognition could impact the business and at a variety of levels

There was a time, not so very long ago, when IT directors and chief information officers dismissed the Internet as something of a passing fad. Somehow though, things took off pretty well with the whole web thing didn't they? Mobile telephony has also grown to a level of dominance that we could never have predicted when it first started appearing around 30 years ago.

Then came the tablet... just another fad right? Well, the first few were, but then "Magic Steve" produced the tablet we all love and cherish didn't he? (OK yes - I know Android is doing well in this space too, you don't need to write in)... so what's coming next?

What Is Our Next Killer Paradigm?
Many believe that "sound" will be the next killer element of "social computing" in terms of information share. After all, we share text in various forms, images and video and all the time. Shouldn't this mean that "sound" should be our next most logically interesting data-share element?

What kind of sound? Our own spoken voice, recorded speech, random commentary, music, environmental recordings -- it's a long list and you can certainly add at least one of your own if you give it a moment's thought. Yes we can link to each other's podcasts already, but we are talking about a level beyond that.

The next tier for sound is allied to its close first cousin "speech" and both could (arguably) be about to move from the playground to the boardroom and therefore potentially move into the CIO's line of sight.

The Speech Steeplechase
The problem is that in its early years, speech/voice recognition technology was something of a novelty. But look at the facts, fingerprint recognition biometrics only surfaced toward the end of the last millennium and now we have "secure USB flash drives" that work by a finger-swipe; so the rapid development curve for surface-level extremely user-facing technologies has been in overdrive for the last decade, if not more.

Speech recognition companies, like Nuance that produces the Dragon NaturallySpeaking off-the-shelf product, see a future in several corporate deployment scenarios for their technology which is grounded in individual user suitability. The company is something of a market leader with manufacturers from HP to Apple to IBM all working with its technology.

According to Nuance, the human voice is described as an "incredibly rich, natural and efficient means of communication" - and the industry is now working to build solutions that enable computers, phones, tablets, automobiles, TVs and consumer electronics to understand the human voice, providing a "natural interface" between man and machine.

Speech recognition could impact the business and at a variety of levels:

  • Speech is used in CRM analytics inside call center deployment scenarios so that customer conversations can be analyzed and filtered in order to discover what keywords customers are using.
  • Healthcare CIOs will already know that CLU (Clinical Language Understanding) technology has a huge role to play in terms of helping healthcare enterprises working to overcome challenges with "Big Data" and the ensuing challenges associated with the ability to collect, process, interpret and then utilise information.
  • Nuance is not alone...  Google is also said to be attempting to "pioneer" technology that will ultimately enable users to search by the spoken word. Microsoft has similar plans with Bing.
  • Mobile applications (at the consumer and enterprise level too) will have a large number of opportunities for speech recognition to be leveraged. From simple voice commands used to control smartphones, to more powerful voice-driven in-car entertainment and/or so-called "infotainment systems," speech arguably has a strong new role to play.

How Does It Work? Nuance Explains...

  1. A user speaks a command into a microphone
  2. System converts sound input into digital signal
  3. The signal is analyzed and chopped into component speech sounds called "phonemes"
  4. Each phoneme is examined in context with those around it and statistical probability algorithms used to determine the intended word from a stored list. This happens for each word
  5. Each word is examined in context with those around it and statistical probability algorithms used to determine the intended command
  6. The appropriate response for the command is triggered

The CIO's Central Message
It seems that many real-world scenarios could be using not only speech recognition technologies, but also its sister disciplines, i.e., text-to-speech technology and also document imaging and electronic dictation services, which do of course throw up their own data storage challenges.

Nuance VP Peter Mahoney has suggested that really robust industrial-grade speech recognition in the space-age style as depicted in Hollywood movies (or to give it its proper name - "robust natural language" technology) is not far off at all - and that we should see six to ten languages fully supported by this technology as soon as the end of this year.

It's not Star Trek quite yet, but we're close!

•   •   •

This post was first published on the Enterprise CIO Forum.

More Stories By Adrian Bridgwater

Adrian Bridgwater is a freelance journalist and corporate content creation specialist focusing on cross platform software application development as well as all related aspects software engineering, project management and technology as a whole.

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


IoT & Smart Cities Stories
Moroccanoil®, the global leader in oil-infused beauty, is thrilled to announce the NEW Moroccanoil Color Depositing Masks, a collection of dual-benefit hair masks that deposit pure pigments while providing the treatment benefits of a deep conditioning mask. The collection consists of seven curated shades for commitment-free, beautifully-colored hair that looks and feels healthy.
The textured-hair category is inarguably the hottest in the haircare space today. This has been driven by the proliferation of founder brands started by curly and coily consumers and savvy consumers who increasingly want products specifically for their texture type. This trend is underscored by the latest insights from NaturallyCurly's 2018 TextureTrends report, released today. According to the 2018 TextureTrends Report, more than 80 percent of women with curly and coily hair say they purcha...
The textured-hair category is inarguably the hottest in the haircare space today. This has been driven by the proliferation of founder brands started by curly and coily consumers and savvy consumers who increasingly want products specifically for their texture type. This trend is underscored by the latest insights from NaturallyCurly's 2018 TextureTrends report, released today. According to the 2018 TextureTrends Report, more than 80 percent of women with curly and coily hair say they purcha...
We all love the many benefits of natural plant oils, used as a deap treatment before shampooing, at home or at the beach, but is there an all-in-one solution for everyday intensive nutrition and modern styling?I am passionate about the benefits of natural extracts with tried-and-tested results, which I have used to develop my own brand (lemon for its acid ph, wheat germ for its fortifying action…). I wanted a product which combined caring and styling effects, and which could be used after shampo...
The platform combines the strengths of Singtel's extensive, intelligent network capabilities with Microsoft's cloud expertise to create a unique solution that sets new standards for IoT applications," said Mr Diomedes Kastanis, Head of IoT at Singtel. "Our solution provides speed, transparency and flexibility, paving the way for a more pervasive use of IoT to accelerate enterprises' digitalisation efforts. AI-powered intelligent connectivity over Microsoft Azure will be the fastest connected pat...
There are many examples of disruption in consumer space – Uber disrupting the cab industry, Airbnb disrupting the hospitality industry and so on; but have you wondered who is disrupting support and operations? AISERA helps make businesses and customers successful by offering consumer-like user experience for support and operations. We have built the world’s first AI-driven IT / HR / Cloud / Customer Support and Operations solution.
Codete accelerates their clients growth through technological expertise and experience. Codite team works with organizations to meet the challenges that digitalization presents. Their clients include digital start-ups as well as established enterprises in the IT industry. To stay competitive in a highly innovative IT industry, strong R&D departments and bold spin-off initiatives is a must. Codete Data Science and Software Architects teams help corporate clients to stay up to date with the mod...
At CloudEXPO Silicon Valley, June 24-26, 2019, Digital Transformation (DX) is a major focus with expanded DevOpsSUMMIT and FinTechEXPO programs within the DXWorldEXPO agenda. Successful transformation requires a laser focus on being data-driven and on using all the tools available that enable transformation if they plan to survive over the long term. A total of 88% of Fortune 500 companies from a generation ago are now out of business. Only 12% still survive. Similar percentages are found throug...
Druva is the global leader in Cloud Data Protection and Management, delivering the industry's first data management-as-a-service solution that aggregates data from endpoints, servers and cloud applications and leverages the public cloud to offer a single pane of glass to enable data protection, governance and intelligence-dramatically increasing the availability and visibility of business critical information, while reducing the risk, cost and complexity of managing and protecting it. Druva's...
BMC has unmatched experience in IT management, supporting 92 of the Forbes Global 100, and earning recognition as an ITSM Gartner Magic Quadrant Leader for five years running. Our solutions offer speed, agility, and efficiency to tackle business challenges in the areas of service management, automation, operations, and the mainframe.