Implement structured output generation for both LlamaLanguageModel / MLXLanguageModel #75

eastriverlee · 2025-12-24T15:45:59Z

Related to #27

Copilot

Pull request overview

This PR implements structured output generation for LlamaLanguageModel and MLXLanguageModel by adding constrained token sampling to generate JSON that conforms to a schema. The implementation includes comprehensive tests covering various data types and structures.

Key changes:

Added ConstrainedJSONGenerator that uses token-level sampling to generate schema-conformant JSON
Implemented TokenBackend protocol with adapters for both Llama and MLX models
Enhanced GenerationGuide to store constraint values for min/max on numbers and arrays
Extended GenerationSchema with character validation and schema prompt generation

Reviewed changes

Copilot reviewed 9 out of 9 changed files in this pull request and generated 8 comments.

Show a summary per file

File	Description
Tests/AnyLanguageModelTests/StructuredGenerationTests.swift	Comprehensive test suite covering simple types, nested structs, enums, arrays, and optionals across all supported model types
Tests/AnyLanguageModelTests/GenerableMacroTests.swift	Added round-trip tests for enums, nested structs, and arrays
Sources/AnyLanguageModelMacros/GenerableMacro.swift	Refactored guide extraction to use a structured Constraints type and properly parse numeric ranges and array count constraints
Sources/AnyLanguageModel/StructuredGeneration.swift	New file implementing token-level constrained JSON generation with TokenBackend protocol and ConstrainedJSONGenerator
Sources/AnyLanguageModel/Models/SystemLanguageModel.swift	Updated to use schema-based generation for non-String types and added conversion to FoundationModels.DynamicGenerationSchema
Sources/AnyLanguageModel/Models/MLXLanguageModel.swift	Implemented MLXTokenBackend and structured JSON generation with proper token sampling and repetition penalty handling
Sources/AnyLanguageModel/Models/LlamaLanguageModel.swift	Implemented LlamaTokenBackend and structured JSON generation with batch-based decoding and sampler integration
Sources/AnyLanguageModel/GenerationSchema.swift	Added schemaPrompt() method, character validation for JSON strings, improved node equality checking, and support for constraint propagation
Sources/AnyLanguageModel/GenerationGuide.swift	Made GenerationGuide store actual constraint values (min/max, minCount/maxCount) for use during schema generation

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Sources/AnyLanguageModel/StructuredGeneration.swift

Sources/AnyLanguageModel/GenerationSchema.swift

Sources/AnyLanguageModel/Models/MLXLanguageModel.swift

Sources/AnyLanguageModel/StructuredGeneration.swift

Sources/AnyLanguageModel/Models/LlamaLanguageModel.swift

Sources/AnyLanguageModel/StructuredGeneration.swift

mattt · 2026-01-05T14:27:48Z

@eastriverlee Thank you for your contribution! And thank you for your patience. I'll have a chance to look a this soon.

…nerator

mattt · 2026-01-20T14:31:02Z

@eastriverlee Thanks again for your patience. I just rebased, resolving the conflicts as best I could. I recently merged #59, which takes a slightly different approach for schema conversion. I'm working to harmonize these implementations now...

…rompt via JSONSchema

…hema

mattt requested a review from Copilot January 5, 2026 13:56

Copilot started reviewing on behalf of mattt January 5, 2026 13:56 View session

Copilot AI reviewed Jan 5, 2026

View reviewed changes

eastriverlee and others added 9 commits January 20, 2026 06:01

Fix SystemLanguageModel to pass schema for structured generation

ded134c

Implement logit-constrained structured generation for LlamaLanguageModel

0fa246d

Implement logit-constrained structured generation for MLXLanguageModel

be79024

Fix duplicate type crash in schema generation

887964a

Enforce count + numeric range guides

ecae1bd

Respect temperature for structured generation

9fb930b

Refactor Llama and MLX structured generation to shared constrained ge…

77950e3

…nerator

swift format -i -r .

c8fdb9d

Restore SystemLanguageModel.swift from HEAD of main

200886b

mattt force-pushed the main branch from 63bcd14 to 200886b Compare January 20, 2026 14:29

mattt added 3 commits January 20, 2026 06:41

Align structured generation prompts and defaults, and enrich schema p…

51294c6

…rompt via JSONSchema

Respect schema prompt flag and enhance structured prompts with JSONSc…

2488201

…hema

Add documentation comments to helper methods

04650e1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement structured output generation for both LlamaLanguageModel / MLXLanguageModel #75

Implement structured output generation for both LlamaLanguageModel / MLXLanguageModel #75

Uh oh!

eastriverlee commented Dec 24, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mattt commented Jan 5, 2026 •

edited

Loading

Uh oh!

mattt commented Jan 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Implement structured output generation for both LlamaLanguageModel / MLXLanguageModel #75

Are you sure you want to change the base?

Implement structured output generation for both LlamaLanguageModel / MLXLanguageModel #75

Uh oh!

Conversation

eastriverlee commented Dec 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mattt commented Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mattt commented Jan 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

eastriverlee commented Dec 24, 2025 •

edited

Loading

mattt commented Jan 5, 2026 •

edited

Loading