Hugging Face · Postman Collection

Hugging Face Text Generation Inference API

High-performance toolkit for deploying and serving large language models with optimized inference. Provides both a custom TGI API and an OpenAI-compatible Messages API for chat completions. Supports streaming, tool calling, structured output, grammar constraints, and multi-modal inputs. Contact Support: Name: Hugging Face Support

Requests

Folders

View on GitHub Raw JSON Postman Collection

Overview

Hugging Face Text Generation Inference API is a Postman Collection published by Hugging Face on the APIs.io network.

The collection contains 9 requests organised into 11 folders.

Requests & Folders

generate

generate_stream

info

health

metrics

tokenize

Related API Specs

Hugging Face Inference API (OpenAPI) Hugging Face Hub API (OpenAPI) Hugging Face Inference Endpoints API (OpenAPI) Hugging Face Inference Providers API (OpenAPI) Hugging Face Dataset Viewer API (OpenAPI) Hugging Face Text Generation Inference API (OpenAPI)

Back to Hugging Face · All Collections · GitHub