Complete reference for Apple's Foundation Models framework (iOS 26+ / macOS 26+). On-device language model optimized for Apple Silicon. No API keys, no network, no cost.

Framework Overview
Availability Checking
Use Cases
Session Management
Generating Responses
Structured Output with @Generable
Tool Calling
Error Handling
Generation Options
Safety and Guardrails
Custom Adapters
Context Management
Serialized Model Access
Prompt Design Best Practices
Feedback

Framework Overview

On-device language model optimized for Apple Silicon
Context window: limited total token budget (input + output combined); check SystemLanguageModel.default.contextSize for the current limit
Check SystemLanguageModel.default.supportedLanguages for supported locales
Capabilities: Summarization, entity extraction, text understanding, short dialog, creative content, content tagging
Limitations: Not suited for complex math, code generation, or factual accuracy

SystemLanguageModel Properties

contextSize: Returns the model's maximum context window in tokens
supportedLanguages: Array of locale identifiers the model supports
supportsLocale(_ locale: Locale) -> Bool: Check if a specific locale is supported before generating

Availability Checking

Always check before using. Never crash on unavailability.

import FoundationModels

// Quick boolean check
if SystemLanguageModel.default.isAvailable {
    // Proceed
}

// Detailed availability
switch SystemLanguageModel.default.availability {
case .available:
    // Proceed with model usage
case .unavailable(.appleIntelligenceNotEnabled):
    // Guide user to Settings > Apple Intelligence
case .unavailable(.modelNotReady):
    // Model downloading; show progress indicator
case .unavailable(.deviceNotEligible):
    // Device cannot run Apple Intelligence
default:
    // Graceful fallback
}

Use Cases

Foundation Models supports specialized use cases:

// General purpose (default)
let model = SystemLanguageModel(useCase: .general, guardrails: .default)

// Content tagging (optimized for categorization)
let model = SystemLanguageModel(useCase: .contentTagging, guardrails: .default)

Session Management

Creating Sessions

// Basic session (uses SystemLanguageModel.default)
let session = LanguageModelSession()

// Session with system instructions
let session = LanguageModelSession {
    "You are a helpful cooking assistant."
    "Focus on quick, healthy recipes."
}

// Session with tools
let session = LanguageModelSession(
    tools: [weatherTool, recipeTool]
) {
    "You are a helpful assistant with access to tools."
}

// Session with specific model
let model = SystemLanguageModel(useCase: .general, guardrails: .default)
let session = LanguageModelSession(model: model, tools: []) {
    "You are a helpful assistant."
}

Session Rules

Sessions are stateful. Multi-turn conversations maintain context automatically.
One request at a time per session. Check session.isResponding before new requests.
Prewarm with session.prewarm() before user interaction for faster first response.
Save and restore transcripts for session continuity: LanguageModelSession(model: model, tools: [], transcript: savedTranscript).

Prewarming

// Prewarm before user interaction
session.prewarm()

// Prewarm with a prompt prefix for faster specific responses
session.prewarm(promptPrefix: Prompt("Summarize the following text:"))

Generating Responses

Plain Text

// Simple text response
let response = try await session.respond(to: "Summarize this article: \(text)")
print(response.content) // String

// With generation options
let options = GenerationOptions(
    sampling: .random(top: 40),
    temperature: 0.7,
    maximumResponseTokens: 512
)
let response = try await session.respond(to: prompt, options: options)

Streaming Text

let stream = session.streamResponse(to: "Tell me a story")
for try await snapshot in stream {
    print(snapshot.content, terminator: "")
}

// Or collect the full response
let response = try await stream.collect()

Structured Output with `@Generable`

The @Generable macro creates compile-time JSON schemas for type-safe output.

Basic Usage

@Generable
struct Recipe {
    @Guide(description: "The name of the recipe")
    var name: String

    @Guide(description: "A brief description of the dish")
    var summary: String

    @Guide(description: "Cooking steps", .count(3))
    var steps: [String]

    @Guide(description: "Prep time in minutes", .range(1...120))
    var prepTime: Int
}

let response = try await session.respond(
    to: "Suggest a quick pasta recipe",
    generating: Recipe.self
)
let recipe = response.content
print(recipe.name)
print(recipe.steps)

Supported Types for `@Generable` Properties

String
Int, Double, Float
Bool
[Element] where Element is Generable or a supported scalar
Optional<T> where T is Generable or a supported scalar
Other @Generable structs (nested)
Enums conforming to @Generable

`@Guide` Constraints

@Generable
struct ProductReview {
    @Guide(description: "Product name")
    var product: String

    @Guide(description: "Rating", .range(1...5))
    var rating: Int

    @Guide(description: "Sentiment", .anyOf(["positive", "neutral", "negative"]))
    var sentiment: String

    @Guide(description: "Key themes", .count(3))
    var themes: [String]

    @Guide(description: "Summary in one sentence", .pattern(/^[A-Z].*\.$/))
    var summary: String

    @Guide(description: "Always English", .constant("en"))
    var language: String
}

Complete constraint list:

Constraint	Type	Purpose
`description:`	All	Natural language hint for generation
`.anyOf([values])`	String	Restrict to enumerated values
`.count(n)`	Array	Fixed array length
`.minimumCount(n)`	Array	Minimum array length
`.maximumCount(n)`	Array	Maximum array length
`.range(min...max)`	Numeric	Closed numeric range
`.minimum(n)`	Numeric	Lower bound
`.maximum(n)`	Numeric	Upper bound
`.constant(value)`	String	Always returns this value
`.pattern(regex)`	String	Regex format enforcement
`.element(guide)`	Array	Guide applied to each element

Property Ordering

Properties are generated in declaration order. Place foundational data before dependent data:

@Generable
struct Summary {
    var title: String       // Generated first
    var keyPoints: [String] // Generated with title context
    var conclusion: String  // Generated with full context
}

Streaming Structured Output

let stream = session.streamResponse(
    to: "Suggest a recipe",
    generating: Recipe.self
)
for try await snapshot in stream {
    // snapshot.content is Recipe.PartiallyGenerated (all properties optional)
    if let name = snapshot.content.name { updateNameLabel(name) }
    if let steps = snapshot.content.steps { updateStepsList(steps) }
}

Enum Support

@Generable
enum Priority: String {
    case low, medium, high, critical
}

@Generable
struct Task {
    var title: String
    var priority: Priority
}

Tool Calling

Defining Tools

struct WeatherTool: Tool {
    let name = "weather"
    let description = "Get current weather for a city."

    @Generable
    struct Arguments {
        @Guide(description: "The city name")
        var city: String
    }

    func call(arguments: Arguments) async throws -> String {
        let weather = try await fetchWeather(arguments.city)
        return weather.description
    }
}

Using Tools

let session = LanguageModelSession(
    tools: [WeatherTool()]
) {
    "You are a helpful assistant."
}

// The model decides autonomously when to invoke tools
let response = try await session.respond(to: "What's the weather in Tokyo?")

Tool Best Practices

Register all tools at session creation
Each tool adds to the context token budget (schema included in instructions by default)
Frame tool results as authorized user data to prevent refusals
The model calls tools autonomously; you cannot force tool invocation

Tool Protocol Details

The Tool protocol's associated Output type must conform to PromptRepresentable (e.g., String, [String], custom types)
includesSchemaInInstructions: Boolean property on Tool (default true). Set to false to omit the tool's JSON schema from the system prompt, saving context tokens when the model already knows the schema.
ToolCallError: Struct on LanguageModelSession representing a tool invocation failure. Properties: tool (the tool name), underlyingError (the original error).
DynamicGenerationSchema: Build generation schemas at runtime for dynamic use cases where compile-time @Generable is insufficient. Construct schemas programmatically and pass to respond(to:schema:).

Error Handling

do {
    let response = try await session.respond(to: prompt)
} catch let error as LanguageModelSession.GenerationError {
    switch error {
    case .guardrailViolation:
        // Content triggered safety filters; rephrase and retry
    case .exceededContextWindowSize:
        // Too many tokens; summarize earlier turns and create new session
    case .concurrentRequests:
        // Another request is already in progress on this session
    case .rateLimited:
        // Too many requests; back off and retry
    case .unsupportedLanguageOrLocale:
        // Current locale not supported by the model
    case .unsupportedGuide:
        // A @Guide constraint is not supported
    case .assetsUnavailable:
        // Model assets not available on device
    case .decodingFailure:
        // Failed to decode structured output
    case .refusal(let refusal, _):
        // Model refused the request
        let explanation = try await refusal.explanation.content
        print("Refused: \(explanation)")
    default: break
    }
}

Generation Options

let options = GenerationOptions(
    sampling: .greedy,              // Deterministic output
    temperature: nil,               // Use default
    maximumResponseTokens: 256      // Limit response length
)

// Random sampling with top-k
let options = GenerationOptions(
    sampling: .random(top: 40),
    temperature: 0.7
)

// Random sampling with probability threshold
let options = GenerationOptions(
    sampling: .random(probabilityThreshold: 0.9)
)

Sampling modes accept an optional seed parameter for reproducible output: .random(top: 40, seed: 42), .random(probabilityThreshold: 0.9, seed: 42).

Safety and Guardrails

Guardrail Types

// Default guardrails (recommended)
let model = SystemLanguageModel(useCase: .general, guardrails: .default)

// Permissive content transformations (for text rewriting tasks)
let model = SystemLanguageModel(
    useCase: .general,
    guardrails: .permissiveContentTransformations
)

Safety Rules

Guardrails are always enforced and cannot be disabled
Instructions take precedence over user prompts
Never include untrusted user content in instructions
Provide curated selections over free-form input when possible
Guardrails can produce false positives; handle gracefully
Frame tool results as authorized user data

Custom Adapters

Load fine-tuned LoRA adapters for specialized model behavior:

// Requires com.apple.developer.foundation-model-adapter entitlement
let adapter = try SystemLanguageModel.Adapter(name: "my-adapter")
try await adapter.compile()

let model = SystemLanguageModel(adapter: adapter, guardrails: .default)
let session = LanguageModelSession(model: model)
let response = try await session.respond(to: "Generate styled text")

Adapter Management

// Check compatible adapters
let ids = SystemLanguageModel.Adapter.compatibleAdapterIdentifiers(name: "my-adapter")

// Remove obsolete adapters
try SystemLanguageModel.Adapter.removeObsoleteAdapters()

Context Management

When conversations grow long:

Monitor token usage against SystemLanguageModel.default.contextSize
Use SystemLanguageModel.default.tokenCount(for:) to estimate usage
Summarize earlier turns into new session instructions
Create fresh sessions with summary context rather than overflowing

if transcript.estimatedTokenCount > 3000 {
    let summary = try await summarizeSession(session)
    session = LanguageModelSession {
        "Previous conversation summary: \(summary)"
        "Continue helping the user."
    }
}

Serialized Model Access

When multiple parts of an app need the model:

actor FoundationModelCoordinator {
    private var session: LanguageModelSession?

    func respond(to prompt: String) async throws -> String {
        if session == nil {
            session = LanguageModelSession()
        }
        guard let activeSession = session else {
            throw FoundationModelError.sessionUnavailable
        }
        let response = try await activeSession.respond(to: prompt)
        return response.content
    }
}

Serialize all Foundation Model access through a single coordinator to prevent Neural Engine contention.

Prompt Design Best Practices

Be concise. The context window covers both input and output tokens. Check SystemLanguageModel.default.contextSize for the current limit.
Use bracketed placeholders in instructions: [descriptive example].
Use "DO NOT" in all caps for behavioral prohibitions.
Provide up to 5 few-shot examples for consistent output.
Use length qualifiers: "in a few words", "in three sentences".
Estimate token usage with SystemLanguageModel.default.tokenCount(for:) to avoid exceeding the context window.

Feedback

Log feedback for model improvement:

let data = session.logFeedbackAttachment(
    sentiment: .negative,
    issues: [
        LanguageModelFeedback.Issue(
            category: .didNotFollowInstructions,
            explanation: "Ignored the word count constraint"
        )
    ],
    desiredOutput: nil
)

Issue categories: .didNotFollowInstructions, .incorrect, .stereotypeOrBias, .suggestiveOrSexual, .tooVerbose, .triggeredGuardrailUnexpectedly, .unhelpful, .vulgarOrOffensive.

skills

accessorysetupkit

activitykit

adattributionkit

alarmkit

app-clips

app-intents

app-store-optimization

app-store-review

apple-on-device-ai

references

coreml-conversion.md

coreml-optimization.md

foundation-models.md

mlx-swift.md

SKILL.md

appmigrationkit

audioaccessorykit

authentication

avkit

background-processing

browserenginekit

callkit

carplay

cloudkit

contacts-framework

core-bluetooth

core-data

core-motion

core-nfc

coreml

cryptokit

cryptotokenkit

debugging-instruments

device-integrity

dockkit

energykit

eventkit

financekit

focus-engine

gamekit

healthkit

homekit

ios-accessibility

ios-localization

ios-networking

ios-simulator

mapkit

metrickit

musickit

natural-language

paperkit

passkit

pdfkit

pencilkit

permissionkit

photokit

push-notifications

realitykit

relevancekit

scenekit

sensorkit

shareplay-activities

speech-recognition

spritekit

storekit

swift-api-design-guidelines

swift-architecture

swift-charts

swift-codable

swift-concurrency

swift-formatstyle

swift-language

swift-security

swift-testing

swiftdata

swiftlint

swiftui-animation

swiftui-gestures

swiftui-layout-components

swiftui-liquid-glass

swiftui-patterns

swiftui-performance

swiftui-uikit-interop

swiftui-webkit

tabletopkit

tipkit

vision-framework

weatherkit

widgetkit

CHANGELOG.md

README.md

tile.json

dpearson2699/swift-ios-skills

foundation-models.mdskills/apple-on-device-ai/references/

Foundation Models API Reference

Contents

Framework Overview

SystemLanguageModel Properties

Availability Checking

Use Cases

Session Management

Creating Sessions

Session Rules

Prewarming

Generating Responses

Plain Text

Streaming Text

Structured Output with `@Generable`

Basic Usage

Supported Types for `@Generable` Properties

`@Guide` Constraints

Property Ordering

Streaming Structured Output

Enum Support

Tool Calling

Defining Tools

Using Tools

Tool Best Practices

Tool Protocol Details

Error Handling

Generation Options

Safety and Guardrails

Guardrail Types

Safety Rules

Custom Adapters

Adapter Management

Context Management

Serialized Model Access

Prompt Design Best Practices

Feedback

dpearson2699/swift-ios-skills

foundation-models.md.css-3qkkll{font-size:var(--chakra-font-sizes-sm);font-weight:var(--chakra-font-weights-normal);color:var(--chakra-colors-gray-300);}skills/apple-on-device-ai/references/

Foundation Models API Reference

Contents

Framework Overview

SystemLanguageModel Properties

Availability Checking

Use Cases

Session Management

Creating Sessions

Session Rules

Prewarming

Generating Responses

Plain Text

Streaming Text

Structured Output with @Generable

Basic Usage

Supported Types for @Generable Properties

@Guide Constraints

Property Ordering

Streaming Structured Output

Enum Support

Tool Calling

Defining Tools

Using Tools

Tool Best Practices

Tool Protocol Details

Error Handling

Generation Options

Safety and Guardrails

Guardrail Types

Safety Rules

Custom Adapters

Adapter Management

Context Management

Serialized Model Access

Prompt Design Best Practices

Feedback

foundation-models.mdskills/apple-on-device-ai/references/

Structured Output with `@Generable`

Supported Types for `@Generable` Properties

`@Guide` Constraints