Hugging Face Text Generation Inference API
High-performance toolkit for deploying and serving large language models with optimized inference. Provides both a custom TGI API and an OpenAI-compatible Messages API for chat completions. Supports streaming, tool calling, structured output, grammar constraints, and multi-modal inputs. Contact Support: Name: Hugging Face Support
Overview
Hugging Face Text Generation Inference API is a Postman Collection published by Hugging Face on the APIs.io network.
High-performance toolkit for deploying and serving large language models with optimized inference. Provides both a custom TGI API and an OpenAI-compatible Messages API for chat completions. Supports streaming, tool calling, structured output, grammar constraints, and multi-modal inputs. Contact Support: Name: Hugging Face Support
The collection contains 9 requests organised into 11 folders.