Foodline's new telephone system, which was rolled out in New York in November 1999, is hosted on three custom-built servers running under Windows NT leased from Automated Financial Systems Inc. of New York.
| Talk it up|
Projected sales of telephone-based automatic speech recognition systems indicate big boom ahead.
Source: Voice Information Associates Inc.
Restaurants connect to the servers via Foodline's ISP, which is Digex Inc. of Beltsville, Md. With the speech recognition solution and hosting services outsourced, Infantino and his colleagues can devote most of their efforts to their core business challenges: selling restaurants and diners on Foodline and revving up revenue streams.
The company's business model incorporates multiple sources of revenue, Infantino says. First, Foodline charges restaurants a fee of $100 per month to lease the Foodline Reservation Solution, which is designed to replace the paper-based systems that many restaurants still use. Second, restaurants pay Foodline $1 per diner for each reservation made with the system. Third, Foodline plans to sell audio advertisements for the telephone system and presently markets display ads on the companion Web site.
Still, Infantino is aware of the fiscal challenges his startup faces. "The biggest issue we have with the telephone [system] is the cost of hosting the calls. It's hard to pay for that with the existing revenue streams." Building customer loyalty
American Airlines Inc. has been making an investment of a different sort in speech recognition. American's business challenge is to maintain and build customer loyalty in an age when air travel is a commodity, and service often takes a back seat to profit margins.
|Lessons learned about telephone-based ASR|
| Sample a variety of live, telephone-based ASR systems. Bring your likes and dislikes to the table when you meet with prospective speech vendors. Choose vendors that know how to design call flows and write prompts that elicit the desired information from end users. If the system is awkward, its value is nil. Test the speech system on a small portion of the intended audience before putting it into production. Check for accuracy of speech recognition, user-friendliness, and compatibility with existing telephony systems and databases. Consider outsourcing the systems-integration piece to a vendor experienced in the esoteric field of speech recognition. Probe the limitations of the current technology. If it's not good enough for your application today, reevaluate speech technology in a year or two. The field is advancing rapidly. |
So American, a subsidiary of Fort Worth, Texas-based AMR Corp., decided to boost the customer service experience by incorporating speech recognition into several of its key systems, including the call center that services its top tier of frequent flyers, AAdvantage Executive Platinum members.
American's best customers were annoyed when they were required to enter a long and cumbersome frequent flyer number at the beginning of a call to the customer service desk. So in November 1998 the airline introduced a simple improvement designed to keep Executive Platinum members a bit happier: When they call, they can now speak their frequent flyer numbers naturally rather than straining to enter the alphanumeric sequence on the telephone keypad.
Such a simple idea, but the implementation was not trivial. "Working with multiple vendors is a challenge," says Carline Smith, the Fort Worth, Texas-based manager of reservations planning for American. Smith says that one of the key issues was, "If there was a problem, whose problem was it?" But working with several vendors over eight months, American put the system into production in November 1998.
The key vendor in American's project was Sabre Inc., also of Forth Worth, which acted as systems integrator. Since American's parent AMR also owns 82% of Sabre, the partnership was close. With the new speech recognition telephony system, "the caller calls [an 800 number] and the call is routed to the IVR system," explains Kevin Smilie, senior manager for product distribution at Sabre. "The caller is then asked to speak their AAdvantage number." The spoken alphanumeric sequence is then digitized by the speech recognition software and pops up on the screen of a customer service agent, together with the customer's frequent flyer account information, which has been cached to reduce the wait.
If the speech recognition system misunderstands the caller, the system prompts the caller to pronounce the AAdvantage number again. If the system and the caller reach an impasse, the call is forwarded to a customer service agent.
American and Sabre created the call flow with the PeriProducer development platform from Periphonics Inc., in Bohemia, N.Y.; the interactive voice system runs on Periphonics' VPS/is software. For speech recognition, the American ASR system uses Recognizer software from Nuance of Menlo Park, Calif. The whole system runs on SPECS servers from Sun Microsystems Inc. of Palo Alto, Calif.
"These vendors have a process of getting [speech recognition] systems into pilot and production," says Smilie. "They've done a good job." Carline Smith of American adds: "Nuance goes into pilot and analyzes how the customer responds to the application and changes prompts that don't work."
|VXML: A Web standard for telephone-based speech applications?|
| If the telephone-based ASR customer-service application open standard works as promised, IT shops will be able to save development time and money on ASR systems since VXML-based and HTML-based applications can share critical databases with relative ease. |
But the development environment has a ways to go, according to Smilie. "Deployments are still infantile, and the development tools are still lacking," he says. Nevertheless, Smilie is bullish on speech recognition telephony. "This is the way people will interact with call centers in the future," he says.
Smith agrees, having implemented another speech recognition project in December 1998 that lets anyone dial American Airlines and get flight information by answering a series of voice prompts. "Improved service is one of the ways that airlines can differentiate themselves," she concludes. Improving the telephone experience
Chuck Forgue of Unity Healthcare also wanted to improve the telephone experience by making it easier for people to connect with colleagues via telephone. Forgue, director of communications services in the IT department at Unity, took a tall order from his boss at the St. Louis division of the Sisters of Mercy Health System. Following a corporate combination of six hospitals plus other healthcare operations under the Unity Healthcare brand, Forgue was given one week in April 1999 to implement a better way for the organization's 4,500 employees to contact colleagues at the company's new headquarters.
With multiple telephone systems coming together, the list of employee names was stable, but their telephone numbers were not. Management wanted to avoid delays in connecting calls. They also wanted to spare human operators the frustration of paging through a telephone directory. The company envisioned a system that would allow an employee to call a colleague simply by speaking his or her name into the telephone, and allowing the computer telephony system to handle the rest.
IBM Corp. proposed to Unity's management that the company install its ViaVoice Directory Dialer software to solve the problem. The Armonk, N.Y.-based computer giant flew in engineers to work with Forgue on the implementation. "We put this in down and dirty, real quick," using a cannibalized IBM Netfinity 5500 server, Forgue says. The system was up and running within a couple of days.
"We're taking about 8,000 calls a month [with the system], and it looks like we're saving about 80% of a full-time employee," Forgue says. "But the reason we did it was to facilitate communication and to make sure people had access to headquarters." Forgue's assistant administers the ViaVoice system, stepping in to update the telephone directory when employees join or leave the company and to correct ViaVoice when the system has difficulty matching the sound of an employee's name with its written form.
"Our next step is to roll this out to include the management at the hospitals," Forgue says. "Then we want to widen it to include all our locations." Now there's a project that will have people talking. // John Rossheim is a freelance editor and writer specializing in privacy issues and speech recognition technology. He lives in Providence, R.I., and can be reached at firstname.lastname@example.org.