Hugging Face · Postman Collection

Hugging Face Text Generation Inference API

High-performance toolkit for deploying and serving large language models with optimized inference. Provides both a custom TGI API and an OpenAI-compatible Messages API for chat completions. Supports streaming, tool calling, structured output, grammar constraints, and multi-modal inputs. Contact Support: Name: Hugging Face Support

9
Requests
11
Folders
View on GitHub Raw JSON Postman Collection

Overview

Hugging Face Text Generation Inference API is a Postman Collection published by Hugging Face on the APIs.io network.

High-performance toolkit for deploying and serving large language models with optimized inference. Provides both a custom TGI API and an OpenAI-compatible Messages API for chat completions. Supports streaming, tool calling, structured output, grammar constraints, and multi-modal inputs. Contact Support: Name: Hugging Face Support

The collection contains 9 requests organised into 11 folders.

Requests & Folders

generate
generate_stream
v1
info
health
metrics
tokenize

Related API Specs

Hugging Face Inference API (OpenAPI) Hugging Face Hub API (OpenAPI) Hugging Face Inference Endpoints API (OpenAPI) Hugging Face Inference Providers API (OpenAPI) Hugging Face Dataset Viewer API (OpenAPI) Hugging Face Text Generation Inference API (OpenAPI)
Back to Hugging Face · All Collections · GitHub