NGC Catalog
Welcome Guest
All You Need to Build AI. All in One Place.
Welcome to the NGC Catalog - GPU Accelerated AI models and SDKs that help you infuse AI into your applications at speed of light
Explore Use Cases
NVIDIA NIM
View All
Mixtral-8x7B-Instruct-v0.1
Container
NVIDIA NIM for GPU accelerated Mixtral-8x7B-Instruct-v0.1 inference through OpenAI compatible APIs
nim-dev
+1
NVIDIA NIM
+1
DiffDock
Container
Diffdock predicts the 3D structure of the interaction between a molecule and a protein.
nim-dev
+1
NVIDIA NIM
+1
Llama-3.1-Swallow-8B-Instruct-v0.1
Container
NVIDIA NIM for GPU accelerated Llama 3.1 Swallow 8B inference through OpenAI compatible APIs
nim-dev
+1
NVIDIA NIM
+1
Llama-3.1-70b-instruct
Container
NVIDIA NIM for GPU accelerated Llama 3.1 70B inference through OpenAI compatible APIs
nim-dev
+1
NVIDIA NIM
+1
Getting Started
NeMo - Automatic Speech Recognition
Collection - Automatic Speech Recognition
This collection contains NeMo models for Automatic Speech Recognition (ASR): Speech to Text, Speech Classification, Speaker Diarization, Speaker Verification, Speaker Recognition, Command Recognition, Voice Activity Detection
DeepStream - CV Deployment
Collection - Intelligent Video Analytics
DeepStream SDK delivers a complete streaming analytics toolkit for AI based video and image understanding and multi-sensor processing. The DeepStream SDK brings deep neural networks and other complex processing tasks into a stream processing pipeline.
Language Modelling
Collection - Natural Language Processing
A collection of easy to use, highly optimized Deep Learning Models for Language Modelling. Deep Learning Examples provides Data Scientist and Software Engineers with recipes to Train, fine-tune, and deploy State-of-the-Art Models
LLMs optimized for RTX PCs
Collection - Windows Rtx Accelerated Models
A collection of TensorRT-LLM accelerated Windows RTX PC LLM models.
Runs on RTX
Command Line Interface
Want to get more from NGC? Everything you see here can be used and managed via our powerful CLI tools.
Download Now
Documentation
We've got a whole host of documentation, covering the NGC UI and our powerful CLI. You can find out more here.
Go to Documentation
AI Enterprise Documentation
Learn how to virtualize any application with NVIDIA virtual GPU technology.
Go to Documentation
Enterprise Support
Get to access to knowledgebase articles and support cases.
File a Ticket
Licensing Portal
Access the software & licensing portal for your products.
Get Your Licenses
NGC Private Registry
Private Registries from NGC allow you to secure, manage, and deploy your own assets to accelerate your journey to AI.
Learn More
Getting Started with NVIDIA AI Enterprise
NIM Agent Blueprint for Vulnerability Analysis
Collection - Natural Language Processing
The Vulnerability Analysis for Container Security is a NIM Agent Blueprint that dramatically accelerates vulnerability detection and mitigation with generative AI and the Morpheus cybersecurity SDK.
NVIDIA NIM
+1
NVIDIA AI Enterprise Infra 5
Collection - Infrastructure
Access Infrastructure and workload management software, exclusively available with your NVIDIA AI Enterprise subscription.
nv-ai-enterprise
NVIDIA AI Enterprise Supported
Production Branch - October 2024 (PB 24h2)
Collection - Deep Learning
Access the production branches of AI frameworks and SDKs. Supported for 9 months with monthly security patches.
nv-ai-enterprise
NVIDIA AI Enterprise Supported
Getting Started with NVIDIA Omniverse Enterprise
Production Branch - December 2024 (PB 24h2)
Collection - Advanced
Access the production branches of Omniverse frameworks and SDKs. Supported for 9 months with monthly security patches.
omniverse
NVIDIA Omniverse Enterprise Supported
Kit SDK
Collection - Advanced
Kit SDK is a toolkit for building native Omniverse applications and microservices.
omniverse
+1
NVIDIA Omniverse Enterprise Supported
Omniverse Kit App Streaming
Collection - Infrastructure
Omniverse Kit App Streaming
omniverse
+1
USD Search API
Collection - Deep Learning
AI-powered search for OpenUSD data, 3D models, images, and assets using text or image-based inputs.
omniverse
+1
NVIDIA Omniverse Enterprise Supported
Popular Collections
View All
Code Llama
Advanced
Code Llama is an LLM capable of generating code, and natural language about code, from both code and natural language prompts.
Llama 2
Advanced
Llama 2 is a large language AI model capable of generating text and code in response to prompts.
Build an AI Chatbot with RAG
Machine Learning
Use a reference application to build a fully functional retrieval-augmented generation (RAG)-based AI chatbot built with NVIDIA NIMTM microservices
Automatic Speech Recognition
Automatic Speech Recognition
A collection of easy to use, highly optimized Deep Learning Models for Recommender Systems. Deep Learning Examples provides Data Scientist and Software Engineers with recipes to Train, fine-tune, and deploy State-of-the-Art Models
NVIDIA Holoscan
Healthcare
The AI sensor processing platform
Clara Discovery
Healthcare
Clara Discovery is a collection of frameworks, applications, and AI models enabling GPU-accelerated computational drug discovery
Clara NLP
Healthcare
Clara NLP is a collection of SOTA biomedical pre-trained language models as well as highly optimized pipelines for training NLP models on biomedical and clinical text
Clara Parabricks
Healthcare
Clara Parabricks is a collection of software tools and notebooks for next generation sequencing, including short- and long-read applications. These tools are designed to be scalable, generating highly accurate results in an accelerated compute environmen
Popular Containers
View All
Python Basic for AI Workbench
Python Basic - AI Workbench Default Container (Beta)
python-cuda122
Python with CUDA 12.2 - AI Workbench Default Container (Beta)
PyTorch for AI Workbench
PyTorch - AI Workbench Default Container (Beta)
DCGM
Manage and Monitor GPUs in Cluster Environments.
Popular Models
View All
ChatGLM3-6B Chat Int4
ChatGLM3-6B is the latest open-source model in the ChatGLM series. ChatGLM3-6B introduces the following features (1) More Powerful Base Model (2) More Comprehensive Function Support (3) More Comprehensive Open-source Series.
GPUNet-0 pretrained weights (PyTorch, AMP, ImageNet)
GPUNet-0 ImageNet pretrained weights
GPUNet-D2 pretrained weights (PyTorch, AMP, ImageNet)
GPUNet-D2 weights pretrained on ImageNet
Llama2-13b Chat Int4
LlaMa 2 is a large language AI model capable of generating text and code in response to prompts.
Popular Resources
View All
tokkio_plugin_llm_rag
Resource for Tokkio LLM RAG plugin
Tokkio UI Web Assets
Prebuilt production-ready assets for the Tokkio UI
Endoscopy out of body Sample App Data
Holoscan Sample App data for Endoscopy out of body detection
Holoscan Cars Video
Video of cars for evaluating detection algorithms for Holoscan SDK.
Popular Helm Charts
View All
RAG Application: Multimodal Chatbot
This example showcases multi modal usecase in a RAG pipeline. It can understand any kind of images in PDF or .pptx (like graphs and plots) alongside text and tables.
RAG Application: Multiturn Chatbot
This example showcases a RAG workflow with multi-turn conversation capabilities.
RAG Application: Structured Data Chatbot
Sample RAG application which can handle question-answering from tabular data stored in CSV format.
RAG Application: Langchain Text QA Chatbot
A helm chart demonstrating a basic RAG pipeline built using langchain leveraging Nvidia NIM LLM's and Retrievers deployed on-prem.