Qwen3-VL-2B export by carinapeng · Pull Request #68 · apple/coreai-models

carinapeng · 2026-06-29T17:00:16Z

CoreAI export for Qwen3-VL-2B

Produce vlm bundle (main decoder + embedding + vision + tokenizer) conforming to the CoreAISequentialVLMEngine contract, tight co-design with Add VLM inference infrastructure: engine, protocol, and CLI support #65

Add Qwen3-VL vision-language model export support. - gpu/qwen3_vl.py: Qwen3VLForCausalLM (text decoder, input_ids) and Qwen3VLForCausalLMEmbeddings (inputs_embeds variant for VLM fusion) - primitives/macos/cache_scatter.py: slice_scatter-based explicit KV cache (avoids stateful mutable_slice_update Metal kernel issues) - export_qwen3vl_explicit_kv.py: text decoder export (inputs_embeds, explicit KV) - export_vision_encoder_224.py: vision encoder export (448x448 -> 196 visual tokens) - registry.py: register qwen3_vl model entry

stikves

Overall looks good, but we might want to move the model to a better home

models/qwen3-vl or models/vlm?

…models into carina/qwen3vl-export

stikves

Thank you, please run one final end to end test before merging

carinapeng · 2026-06-30T18:24:30Z

Thanks! There is a gap in our VLM runner to show perf metrics, let's take that as a todo, will file an issue

DawerG · 2026-07-01T21:35:10Z

+from coreai_models.primitives.macos.sdpa import SDPA
+
+
+class Attention(nn.Module):


Did we miss unit tests for this authored model?

carinapeng added 3 commits June 25, 2026 13:14

stateful-KV decoder + self-contained vision encoder

185996b

Linting

8efc99c

carinapeng requested a review from stikves June 29, 2026 17:35

Merge branch 'main' into carina/qwen3vl-export

29dc339

carinapeng marked this pull request as ready for review June 29, 2026 23:14

carinapeng mentioned this pull request Jun 29, 2026

Add VLM inference infrastructure: engine, protocol, and CLI support #65

Merged

stikves reviewed Jun 30, 2026

View reviewed changes

Comment thread python/src/coreai_models/primitives/macos/cache_scatter.py

stikves reviewed Jun 30, 2026

View reviewed changes

Comment thread python/export_qwen3vl.py Outdated

stikves reviewed Jun 30, 2026

View reviewed changes

Comment thread python/src/coreai_models/models/gpu/__init__.py

stikves reviewed Jun 30, 2026

View reviewed changes

carinapeng added 5 commits June 29, 2026 20:11

Comment

bc4133f

Merge remote-tracking branch 'upstream/main' into carina/qwen3vl-export

57af0de

Rename, platform specific

c4d88e6

Unit test VLM protocol

cb3cc1b

Merge branch 'carina/qwen3vl-export' of github.com:carinapeng/coreai-…

9295ac3

…models into carina/qwen3vl-export

stikves reviewed Jun 30, 2026

View reviewed changes

Comment thread .gitignore

carinapeng added 2 commits June 30, 2026 10:25

Abstraction for vlm support

57e8739

Minor

eb00a2e

stikves approved these changes Jun 30, 2026

View reviewed changes

carinapeng merged commit de896bf into apple:main Jun 30, 2026
3 checks passed

DawerG reviewed Jul 1, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Qwen3-VL-2B export#68

Qwen3-VL-2B export#68
carinapeng merged 11 commits into
apple:mainfrom
carinapeng:carina/qwen3vl-export

carinapeng commented Jun 29, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

stikves left a comment

Uh oh!

Uh oh!

stikves left a comment

Uh oh!

carinapeng commented Jun 30, 2026 •

edited by tjia1818

Loading

Uh oh!

Uh oh!

DawerG Jul 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		from coreai_models.primitives.macos.sdpa import SDPA


		class Attention(nn.Module):

Uh oh!

Conversation

carinapeng commented Jun 29, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

stikves left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

stikves left a comment

Choose a reason for hiding this comment

Uh oh!

carinapeng commented Jun 30, 2026 • edited by tjia1818 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

DawerG Jul 1, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

carinapeng commented Jun 30, 2026 •

edited by tjia1818

Loading