SoundHound AI Delivers Faster, Smarter Voice Assistants At Scale with NVIDIA AI Enterprise
By integrating SoundHound’s world-class voice AI technology with NVIDIA NIM and NeMo microservices— part of the NVIDIA AI Enterprise software platform—SoundHound is driving low-latency AI processing, real-time retrieval-augmented generation (RAG), and scalable model optimization—helping businesses deploy more responsive and cost-efficient AI solutions.
This expanded collaboration reinforces SoundHound’s commitment to delivering industry-leading voice AI solutions by enhancing the company’s AI-powered offerings across industries, including automotive, restaurant, and customer service. The integration reduces inference latency, improves response accuracy, and optimizes AI deployments for both cloud and edge environments. Key NVIDIA AI Enterprise innovations coming to SoundHound’s advanced voice platform include:
-
Optimized AI Inference using NVIDIA NIM Microservices
- NIM microservices enable containerized and high-speed inference for LLMs, significantly reducing response latency.
-
Smarter Retrieval with NVIDIA NeMo Retriever Microservices for RAG
- NeMo Retriever microservices enhance context-aware RAG by quickly surfacing highly relevant responses, improving AI-driven interactions.
-
Scalable Model Fine-Tuning with NVIDIA NeMo
-
With NeMo,
SoundHound can fine-tune and optimize LLMs across multi-GPU environments, ensuring efficient, high-performance AI at scale.
-
With NeMo,
SoundHound’s integration of NVIDIA AI Enterprise software is already playing a role in automotive AI, including in Lucid’s voice AI system. Currently, select modules within the Lucid Assistant leverage
“The AI industry has made massive strides in model training over the last two years, but the real challenge now is deploying these models at scale efficiently,” said
“The AI industry is at an exciting turning point, where the focus is shifting from breakthrough model training to deploying these advancements efficiently at scale,” said
This collaboration builds on SoundHound’s prior work with NVIDIA. Last year, the company announced on-chip voice AI, running on NVIDIA DRIVE AGX to deliver in-vehicle generative AI responses with no cloud connectivity required. The companies also shared a joint demonstration at CES 2025.
Learn more about SoundHound’s voice and conversational AI solutions.
About
View source version on businesswire.com: https://www.businesswire.com/news/home/20250319393806/en/
Media Contact
201-815-9852
[email protected]
Source: