LangChain4j integration library for Hugging Face inference capabilities including chat, language, and embedding models
Complete API reference for HuggingFaceLanguageModel (deprecated).
This class is deprecated since version 1.7.0-beta13 and scheduled for removal.
Migration: Use OpenAiChatModel from langchain4j-open-ai module with Hugging Face's OpenAI-compatible endpoint.
See Migration Guide for complete details.
Provides simple text generation from prompts using Hugging Face language models.
Package: dev.langchain4j.model.huggingface
Status: ⚠️ Deprecated (scheduled for removal)
Interfaces: LanguageModel
package dev.langchain4j.model.huggingface;
@Deprecated(forRemoval = true, since = "1.7.0-beta13")
public class HuggingFaceLanguageModel implements dev.langchain4j.model.language.LanguageModel {
// Construction
public static Builder builder();
public static HuggingFaceLanguageModel withAccessToken(String accessToken);
// Generation operations
public dev.langchain4j.model.output.Response<String> generate(String prompt);
public dev.langchain4j.model.output.Response<String> generate(dev.langchain4j.model.input.Prompt prompt);
}public static final class Builder {
public Builder baseUrl(String baseUrl);
public Builder accessToken(String accessToken);
public Builder modelId(String modelId);
public Builder timeout(java.time.Duration timeout);
public Builder temperature(Double temperature);
public Builder maxNewTokens(Integer maxNewTokens);
public Builder returnFullText(Boolean returnFullText);
public Builder waitForModel(Boolean waitForModel);
public HuggingFaceLanguageModel build();
}public static Builder builder()Example:
HuggingFaceLanguageModel model = HuggingFaceLanguageModel.builder()
.accessToken(System.getenv("HF_API_KEY"))
.modelId("microsoft/Phi-3-mini-4k-instruct")
.temperature(0.8)
.maxNewTokens(150)
.build();public static HuggingFaceLanguageModel withAccessToken(String accessToken)Quick construction with defaults.
public dev.langchain4j.model.output.Response<String> generate(String prompt)Parameters:
prompt - Input prompt for generationReturns: Response containing generated text
Throws: RuntimeException on API errors
Example:
Response<String> response = model.generate("Write a haiku:");
String text = response.content();public dev.langchain4j.model.output.Response<String> generate(dev.langchain4j.model.input.Prompt prompt)Parameters:
prompt - Prompt objectReturns: Response containing generated text
See Configuration Guide for all options.
Key Parameters:
accessToken (required) - API keymodelId (recommended) - Model identifiertemperature (optional) - Sampling temperature (0.0-2.0)maxNewTokens (optional) - Max tokens to generatereturnFullText (optional) - Include prompt (default: false)waitForModel (optional) - Wait if loading (default: true)timeout (optional) - Request timeout (default: 15s)baseUrl (optional) - Custom endpointpackage dev.langchain4j.model.input;
public class Prompt {
public String text();
public static Prompt from(String text);
}package dev.langchain4j.model.output;
public class Response<T> {
public T content();
public static <T> Response<T> from(T content);
}HuggingFaceLanguageModel model = HuggingFaceLanguageModel.builder()
.accessToken(System.getenv("HF_API_KEY"))
.modelId("tiiuae/falcon-7b-instruct")
.build();
Response<String> response = model.generate("Explain AI:");
String text = response.content();HuggingFaceLanguageModel model = HuggingFaceLanguageModel.builder()
.accessToken(System.getenv("HF_API_KEY"))
.modelId("microsoft/Phi-3-mini-4k-instruct")
.temperature(0.2) // Lower for deterministic code
.maxNewTokens(200)
.build();
String prompt = "Write a Java function to check if prime:\n";
String code = model.generate(prompt).content();HuggingFaceLanguageModel model = HuggingFaceLanguageModel.builder()
.accessToken(System.getenv("HF_API_KEY"))
.modelId("tiiuae/falcon-7b-instruct")
.temperature(1.0) // Higher for creativity
.maxNewTokens(300)
.build();
String story = model.generate("Write a story about AI:").content();import dev.langchain4j.model.input.*;
import java.util.Map;
PromptTemplate template = PromptTemplate.from(
"Write a {{length}} story about {{topic}}:"
);
Prompt prompt = template.apply(Map.of(
"length", "short",
"topic", "robots"
));
Response<String> response = model.generate(prompt);try {
Response<String> response = model.generate(prompt);
} catch (RuntimeException e) {
// Error format: "status code: <code>; body: <body>"
System.err.println("Generation failed: " + e.getMessage());
}Common Errors:
LanguageModel (this class):
ChatModel (HuggingFaceChatModel):
Both deprecated. Use OpenAiChatModel for new code.
| Model | Size | Use Case |
|---|---|---|
tiiuae/falcon-7b-instruct | 7B | General purpose |
microsoft/Phi-3-mini-4k-instruct | Small | Efficient, 4K context |
mistralai/Mistral-7B-Instruct-v0.2 | 7B | High quality |
HuggingFaceH4/zephyr-7b-beta | 7B | Conversational |
Install with Tessl CLI
npx tessl i tessl/maven-dev-langchain4j--langchain4j-hugging-face@1.11.0