SHARE

Microsoft Gets Ready to Speak

Aiming to build momentum on its speech technology efforts for the .NET platform, Microsoft Wednesday used the annual SpeechTEK International Exposition and Educational Conference as a platform to unwrap the beta 2 release of the .NET Speech Software Development Kit (SDK), a technical preview of the .NET Speech platform, and a new Joint Development Program […]

Written By

Thor Olavsrud

Oct 30, 2002

4 minute read

Datamation content and product recommendations are editorially independent. We may make money when you click on links to our partners. Learn More

Aiming to build momentum on its speech technology efforts for the .NET
platform, Microsoft Wednesday used the annual SpeechTEK
International Exposition and Educational Conference as a platform to unwrap
the beta 2 release of the .NET Speech Software Development Kit (SDK), a
technical preview of the .NET Speech platform, and a new Joint Development
Program (JDP).

The SDK is a developer tool based on the Speech Application Language Tags
(SALT) specification, which defines a set of lightweight tags as extensions
to common Web-based programming languages, allowing developers to add
speech functionality to existing Web applications. The SDK is designed to
integrate with the Visual Studio .NET development environment, and will
allow developers to write combined speech and visual Web applications in a
single code base.

Microsoft unveiled
the first beta of the .NET Speech SDK in May. The beta 2 release adds
enhancements to grammar and prompt creation, editing tools, and debugging
tools for telephony and multimodal applications. The new features include
W3C standards-compatible formats for grammar authoring, a prebuilt library
of reusable speech telephony and application controls, and grammar
libraries.

To add to the momentum, Microsoft showed off a technical preview of the
.NET Speech platform — slated for release in the middle of 2003. The
platform is the core speech recognition engine in Microsoft’s strategy.
Designed to support both telephones and multimodal-enabled devices like
PCs, PDAs and Tablet PCs, the platform contains the SALT interpreter
software, SALT-enabled ASP.NET controls, a SALT-based voice browser and a
text-to-speech engine provided by strategic partner SpeechWorks.

Finally, building on the speech-centric strategic relationships it has
formed with SpeechWorks, Intervoice, and long-standing partner Intel, Microsoft unveiled the JDP, which aims to bring
together enterprise customers and partners looking to build and deploy
applications from the .NET Speech SDK on the .NET Speech platform.

JDP participants will receive access to the .NET Speech platform technical
preview, as well as real-world production-environment-level testing.

Microsoft believes the value proposition of speech technology is clear: it
stands to reduce costs associated with call center agents. A typical
customer service call costs $5 to $10 to support, while an automated voice
recognition system can lower that to 10 cents to 30 cents per call.
Additionally, voice recognition technology can be used to give employees
access to critical information while on the move.

Earlier this year, market research firm the Kelsey Group projected
worldwide spending on voice recognition will reach $41 billion by 2005.

But Microsoft is by no means alone in the space. It is likely to face stiff
competition from IBM , a pioneer in the voice recognition
space. In April, IBM announced it had assigned
about 100 speech researchers from IBM Research to an eight-year project
dubbed the Super Human Speech Recognition Initiative, intended to
revolutionize voice technologies.

Currently IBM offers solutions based on VoiceXML and Java, and has helped
develop a new specification, X+V (a combination of XHTML and VoiceXML)
for multimodal access. For instance, it crafted a system for investment
management firm T. Rowe Price, which allows customers to access and manage
their accounts through natural conversations by utilizing IBM WebSphere
Voice Server with Natural Language Understanding.

Smaller, specialized players, like Mountain View, Calif.-based start-up
TuVox, are also in the space. TuVox, founded by two alums of Apple Computer
uses a combination of artificial intelligence and VoiceXML to help firms
automate their technical support call centers. It has already automated
the after-hours technical support lines for both Handspring and
Activision.

But while the ball is already rolling in the voice recognition space, IBM
says there are still significant hurdles to overcome; hurdles which spurred
it to create the Super Human Speech Recognition Initiative.

Noise, punctuation and grammar, and accents all continue to pose problems
for speech recognition.

Ethics and Artificial Intelligence: Driving Greater Equality

FEATURE | By James Maguire,
December 16, 2020
AI vs. Machine Learning vs. Deep Learning

FEATURE | By Cynthia Harvey,
December 11, 2020
Huawei’s AI Update: Things Are Moving Faster Than We Think

FEATURE | By Rob Enderle,
December 04, 2020
Keeping Machine Learning Algorithms Honest in the ‘Ethics-First’ Era

ARTIFICIAL INTELLIGENCE | By Guest Author,
November 18, 2020
Key Trends in Chatbots and RPA

FEATURE | By Guest Author,
November 10, 2020
Top 10 AIOps Companies

FEATURE | By Samuel Greengard,
November 05, 2020
What is Text Analysis?

ARTIFICIAL INTELLIGENCE | By Guest Author,
November 02, 2020
How Intel’s Work With Autonomous Cars Could Redefine General Purpose AI

ARTIFICIAL INTELLIGENCE | By Rob Enderle,
October 29, 2020
Dell Technologies World: Weaving Together Human And Machine Interaction For AI And Robotics

ARTIFICIAL INTELLIGENCE | By Rob Enderle,
October 23, 2020
The Super Moderator, or How IBM Project Debater Could Save Social Media

FEATURE | By Rob Enderle,
October 16, 2020
Top 10 Chatbot Platforms

FEATURE | By Cynthia Harvey,
October 07, 2020
Finding a Career Path in AI

ARTIFICIAL INTELLIGENCE | By Guest Author,
October 05, 2020
CIOs Discuss the Promise of AI and Data Science

FEATURE | By Guest Author,
September 25, 2020
Microsoft Is Building An AI Product That Could Predict The Future

FEATURE | By Rob Enderle,
September 25, 2020
Top 10 Machine Learning Companies 2021

FEATURE | By Cynthia Harvey,
September 22, 2020
NVIDIA and ARM: Massively Changing The AI Landscape

ARTIFICIAL INTELLIGENCE | By Rob Enderle,
September 18, 2020
Continuous Intelligence: Expert Discussion [Video and Podcast]

ARTIFICIAL INTELLIGENCE | By James Maguire,
September 14, 2020
Artificial Intelligence: Governance and Ethics [Video]

ARTIFICIAL INTELLIGENCE | By James Maguire,
September 13, 2020
IBM Watson At The US Open: Showcasing The Power Of A Mature Enterprise-Class AI

FEATURE | By Rob Enderle,
September 11, 2020
Artificial Intelligence: Perception vs. Reality

FEATURE | By James Maguire,
September 09, 2020

SEE ALL
APPLICATIONS ARTICLES

Thor Olavsrud

Thor Olavsrud is a journalist covering data analytics, security, infrastructure, and networking for CIO.com. He's especially interested in companies that use data to transform their business to tackle problems in innovative ways. As a senior writer, his articles focus on practical insights, analysis, and business use cases that can help CIOs and other IT leaders navigate the shifting IT landscape.

Microsoft Gets Ready to Speak

Thor Olavsrud

Company

Categories

Microsoft Gets Ready to Speak

RELATED NEWS AND ANALYSIS

Thor Olavsrud

Company

Categories