SHARE

NVIDIA And The Huge Advancement In Conversational AI

Conversational AI is closer to the artificial intelligence we have been expecting and not getting in completed products. The closest thing we have are Digital Assistants like the Amazon Echo, which is mostly just a good speech to text engine working off of Google Search. But with a Conversational AI, the promise is not only […]

Written By

RE

Rob Enderle

May 14, 2020

6 minute read

Datamation content and product recommendations are editorially independent. We may make money when you click on links to our partners. Learn More

Conversational AI is closer to the artificial intelligence we have been expecting and not getting in completed products. The closest thing we have are Digital Assistants like the Amazon Echo, which is mostly just a good speech to text engine working off of Google Search. But with a Conversational AI, the promise is not only an AI that can have a conversation with you; it is one that can display non-verbal cues like facial expressions as well.

At NVIDIA’s GTC this week, Jensen Huan, their CEO, showcased the latest iteration of Jarvis their Conversational AI engine, and it could transform how our technology interacts with us. This AI interaction at scale, if done right, could vastly improve automated sales close rates, increase customer satisfaction with automated help systems, and even better allow robots to become customer-facing interfaces at physical venues. This last is essential because, right now, with COVID-19, customer-facing employees may be at the greatest risk.

Even more interesting is that this technology is on the path to replacing physical talking heads. And you could, in theory, use it so that a long passed CEO could still give keynote speeches at company events. Or, recalling Ronald McDonald and Jack in the Jack In The Box commercials, you could create a virtual spokesperson that could scale to talk to millions, if not billions, of customers.

The Importance Of Non-Verbal Cues

When we communicate in person, we don’t just speak. We have facial expressions that we use to emphasize and contextualize what we are saying. The same sentence, you look good, with different intonation and expression could be a compliment or a sarcastic critique, for instance.

One of the things that makes interfacing with a computer less efficient than interfacing with a person is that computers are emotionally barren and is currently unable to use the full set of tools a human uses for complex expression and communication.

So an AI that can more completely use physical expressions should, if the technology is used effectively, be able to better communicate and create a bond with the human interfacing with it. As we move to more verbal interfaces, a trend that the Pandemic is now starting to drive, we’ll need those interfaces to be more capable than they are, and giving them the ability to converse and emote would go a long way towards getting that done.

The Importance Of Scale

I once had a conversation with a very frustrated Steve Ballmer at Microsoft. Steve indicated that given the massive number of customers the firm had, it was complicated for him to take direction from them, let alone just keep them all straight. He argued that he couldn’t be customer-driven because there were so many of them with diverse needs. Steve couldn’t translate those needs into actions even though he agreed Microsoft needed to be more customer-focused.

What an AI does that a human can’t is scale. We have limitations on the number of people we can collect data from, our ability to retain unaltered that data, and our ability to form that data into information that genuinely reflects our market.

And when backed with in-depth customer data, an AI can better understand what will likely trigger someone to buy, how to best deal with them if they are upset, and may even know a great deal about what the customer is dealing with personally from Social Media.

AIs don’t get mad, they don’t pull pranks, they don’t have substance abuse problems, they don’t make inappropriate comments (unless they aren’t properly trained), and computers don’t get tired. And this scale would allow a Conversational AI to interact with every customer a company had either just to keep those customers informed or to help them through a problem.

For instance, this week, I got a notice that UPS delivered a package that was supposed to come to me to someplace on the other side of the county. I tried to talk to someone at UPS 4 times and got cut off, got busy signals, and finally was told the UPS line was disconnected when I attempted to transfer from their scripted bot to a real person.

NVIDIA’s Jarvis

Now Jarvis isn’t a conversational AI; it is part of a toolset that allows you to create one. As it would likely be implemented, it would be layered on top of something like NVIDIA’s Merlin Application Framework for Deep Learning Recommender Systems, which was also announced this week. Also, you would use NeMO an Open-Source toolkit to build conversational AI-models, Megatron -BERT, which improves reading comprehension enhancing response accuracy, TesorRT 7.1, which improves AI inference accuracy, and Flowtron, a state of the art speech synthesis model which allows the system to talk and emote accurately.

The combination of these technologies, plus future development efforts, should result in a Conversational AI that will revolutionize how we interact with computers en masse. The result should be a massive increase in the use of AIs in customer-facing rolls and a strengthened ability to safely provide seeming in-person support during a time when many of us can’t leave the house.

It is potentially one of the bigger game changers announced at NVIDIA’s GTC this year.

Wrapping Up:

Right now, the future of computing is evolving into an interactive speech interface. NVIDIA showcased a unique and powerful solution this week with Jarvis and Merlin that could result in a Digital Assistant that is more like its name and less like a verbal interface to Google Search. This technology is one of the ways we could also achieve digital immortality because this AI could learn to look and act like you over time and, once you are gone, continue to interface with your loved ones even after you are long gone.

But with the COVID-19 event, the need to replace humans that interface with lots of people during a workday with a system that can’t get sick has never been higher. Jarvis and Merlin, with the other noted technologies, could bridge that gap and take us closer to our speech based new interfaces far more quickly.

Huawei’s AI Update: Things Are Moving Faster Than We Think

FEATURE | By Rob Enderle,
December 04, 2020
Keeping Machine Learning Algorithms Honest in the ‘Ethics-First’ Era

ARTIFICIAL INTELLIGENCE | By Guest Author,
November 18, 2020
Key Trends in Chatbots and RPA

FEATURE | By Guest Author,
November 10, 2020
Top 10 AIOps Companies

FEATURE | By Samuel Greengard,
November 05, 2020
What is Text Analysis?

ARTIFICIAL INTELLIGENCE | By Guest Author,
November 02, 2020
How Intel’s Work With Autonomous Cars Could Redefine General Purpose AI

ARTIFICIAL INTELLIGENCE | By Rob Enderle,
October 29, 2020
Dell Technologies World: Weaving Together Human And Machine Interaction For AI And Robotics

ARTIFICIAL INTELLIGENCE | By Rob Enderle,
October 23, 2020
The Super Moderator, or How IBM Project Debater Could Save Social Media

FEATURE | By Rob Enderle,
October 16, 2020
Top 10 Chatbot Platforms

FEATURE | By Cynthia Harvey,
October 07, 2020
Finding a Career Path in AI

ARTIFICIAL INTELLIGENCE | By Guest Author,
October 05, 2020
CIOs Discuss the Promise of AI and Data Science

FEATURE | By Guest Author,
September 25, 2020
Microsoft Is Building An AI Product That Could Predict The Future

FEATURE | By Rob Enderle,
September 25, 2020
Top 10 Machine Learning Companies 2020

FEATURE | By Cynthia Harvey,
September 22, 2020
NVIDIA and ARM: Massively Changing The AI Landscape

ARTIFICIAL INTELLIGENCE | By Rob Enderle,
September 18, 2020
Continuous Intelligence: Expert Discussion [Video and Podcast]

ARTIFICIAL INTELLIGENCE | By James Maguire,
September 14, 2020
Artificial Intelligence: Governance and Ethics [Video]

ARTIFICIAL INTELLIGENCE | By James Maguire,
September 13, 2020
IBM Watson At The US Open: Showcasing The Power Of A Mature Enterprise-Class AI

FEATURE | By Rob Enderle,
September 11, 2020
Artificial Intelligence: Perception vs. Reality

FEATURE | By James Maguire,
September 09, 2020
Anticipating The Coming Wave Of AI Enhanced PCs

FEATURE | By Rob Enderle,
September 05, 2020
The Critical Nature Of IBM’s NLP (Natural Language Processing) Effort

ARTIFICIAL INTELLIGENCE | By Rob Enderle,
August 14, 2020

SEE ALL
ARTIFICIAL INTELLIGENCE ARTICLES

RE

Rob Enderle

As President and Principal Analyst of the Enderle Group, Rob provides regional and global companies with guidance in how to create credible dialogue with the market, target customer needs, create new business opportunities, anticipate technology changes, select vendors and products, and practice zero dollar marketing. For over 20 years Rob has worked for and with companies like Microsoft, HP, IBM, Dell, Toshiba, Gateway, Sony, USAA, Texas Instruments, AMD, Intel, Credit Suisse First Boston, ROLM, and Siemens.