Conversational AI

Jan 09, 2025

Announcing Nemotron-CC: A Trillion-Token English Language Dataset for LLM Pretraining

NVIDIA is excited to announce the release of Nemotron-CC, a 6.3-trillion-token English language Common Crawl dataset for pretraining highly accurate large...

4 MIN READ

Jan 09, 2025

Upcoming Livestream: NVIDIA Developer Highlights from CES 2025

Tune in January 16th at 9:00 AM PT for a live recap, followed by a Q&A of the latest developer announcements at CES 2025.

1 MIN READ

A surgeon using a medical device in an operating room.

Dec 20, 2024

Build a Generative AI Medical Device Training Assistant with NVIDIA NIM Microservices

Innovation in medical devices continues to accelerate, with a record number authorized by the FDA every year. When these new or updated devices are introduced...

5 MIN READ

Dec 16, 2024

Sandboxing Agentic AI Workflows with WebAssembly

Agentic AI workflows often involve the execution of large language model (LLM)-generated code to perform tasks like creating data visualizations. However, this...

7 MIN READ

Dec 11, 2024

Three Building Blocks for Creating AI Virtual Assistants for Customer Service with an NVIDIA AI Blueprint

In today's fast-paced business environment, providing exceptional customer service is no longer just a nice-to-have—it's a necessity. Whether addressing...

10 MIN READ

Nov 22, 2024

Hymba Hybrid-Head Architecture Boosts Small Language Model Performance

Transformers, with their attention-based architecture, have become the dominant choice for language models (LMs) due to their strong performance,...

12 MIN READ

Nov 19, 2024

Create a Custom Slackbot LLM Agent with NVIDIA NIM and LangChain

In the dynamic world of modern business, where communication and efficient workflows are crucial for success, AI-powered solutions have become a competitive...

9 MIN READ

Oct 28, 2024

Creating RAG-Based Question-and-Answer LLM Workflows at NVIDIA

The rapid development of solutions using retrieval augmented generation (RAG) for question-and-answer LLM workflows has led to new types of system...

11 MIN READ

Oct 22, 2024

Scaling LLMs with NVIDIA Triton and NVIDIA TensorRT-LLM Using Kubernetes

Large language models (LLMs) have been widely used for chatbots, content generation, summarization, classification, translation, and more. State-of-the-art LLMs...

16 MIN READ

Oct 21, 2024

IBM’s New Granite 3.0 Generative AI Models Are Small, Yet Highly Accurate and Efficient

Today, IBM released the third generation of IBM Granite, a collection of open language models and complementary tools. Prior generations of Granite focused on...

5 MIN READ

Oct 16, 2024

Simplify AI Application Development with NVIDIA Cloud Native Stack

In the rapidly evolving landscape of AI and data science, the demand for scalable, efficient, and flexible infrastructure has never been higher. Traditional...

5 MIN READ

Avatars of a patient in a bed with a doctor sitting at a desk in another location, looking at a computer screen.

Oct 01, 2024

Evaluating Medical RAG with NVIDIA AI Endpoints and Ragas

In the rapidly evolving field of medicine, the integration of cutting-edge technologies is crucial for enhancing patient care and advancing research. One such...

11 MIN READ

Sep 26, 2024

Low Latency Inference Chapter 2: Blackwell is Coming. NVIDIA GH200 NVL32 with NVLink Switch Gives Signs of Big Leap in Time to First Token Performance

Many of the most exciting applications of large language models (LLMs), such as interactive speech bots, coding co-pilots, and search, need to begin responding...

8 MIN READ

Sep 25, 2024

Build a Digital Human Interface for AI Apps with an NVIDIA NIM Agent Blueprint

Providing customers with quality service remains a top priority for businesses across industries, from answering questions and troubleshooting issues to...

5 MIN READ

Sep 25, 2024

Deploying Accelerated Llama 3.2 from the Edge to the Cloud

Expanding the open-source Meta Llama collection of models, the Llama 3.2 collection includes vision language models (VLMs), small language models (SLMs), and an...

6 MIN READ

Sep 24, 2024

Accelerating Leaderboard-Topping ASR Models 10x with NVIDIA NeMo

NVIDIA NeMo has consistently developed automatic speech recognition (ASR) models that set the benchmark in the industry, particularly those topping the Hugging...

13 MIN READ