NVIDIA NIM · Postman Collection

NVIDIA NIM Chat Completions API

OpenAI-compatible chat completions endpoint served by NVIDIA NIM. Available as a hosted service at https://integrate.api.nvidia.com/v1 and on every self-hosted NIM LLM container on port 8000. A single contract serves 100+ foundation models — Llama, Mistral, NVIDIA Nemotron, DeepSeek, Qwen, Phi, Gemma, Granite — through the standard /v1/chat/completions surface. Contact Support: Name: NVIDIA Developer Support

1
Requests
3
Folders
View on GitHub Raw JSON AIArtificial IntelligenceInferenceMicroservicesLLMFoundation ModelsGPUKubernetesNVIDIAOpenAI CompatiblePostman Collection

Overview

NVIDIA NIM Chat Completions API is a Postman Collection published by NVIDIA NIM on the APIs.io network.

OpenAI-compatible chat completions endpoint served by NVIDIA NIM. Available as a hosted service at https://integrate.api.nvidia.com/v1 and on every self-hosted NIM LLM container on port 8000. A single contract serves 100+ foundation models — Llama, Mistral, NVIDIA Nemotron, DeepSeek, Qwen, Phi, Gemma, Granite — through the standard /v1/chat/completions surface. Contact Support: Name: NVIDIA Developer Support

The collection contains 1 request organised into 3 folders.

Tagged areas include AI, Artificial Intelligence, Inference, Microservices, and LLM.

Requests & Folders

v1

Related API Specs

NVIDIA NIM Chat Completions API (OpenAPI) NVIDIA NIM Completions API (OpenAPI) NVIDIA NIM Embeddings API (OpenAPI) NVIDIA NIM Reranking API (OpenAPI) NVIDIA NIM Models API (OpenAPI) NVIDIA NIM Vision Language Models API (OpenAPI) NVIDIA NIM Health API (OpenAPI) NVIDIA NIM Image Generation API (OpenAPI) NVIDIA NIM Speech API (OpenAPI) NVIDIA NIM Biology (BioNeMo) API (OpenAPI)
Back to NVIDIA NIM · All Collections · GitHub