Skip to content

Add ortpackage recipes#491

Open
xiaoyu-work wants to merge 5 commits into
mainfrom
xiaoyu/modelpkg
Open

Add ortpackage recipes#491
xiaoyu-work wants to merge 5 commits into
mainfrom
xiaoyu/modelpkg

Conversation

@xiaoyu-work

Copy link
Copy Markdown
Contributor

This pull request adds new documentation, usage recipes, and supporting files to enable packaging both the Qwen3.5-0.8B and Phi-4-mini-reasoning models into ONNX Runtime .ortpackage format for GenAI workflows. It introduces step-by-step guides for exporting CPU and GPU variants, combining them, and running inference, as well as updating .gitignore and requirements for these workflows.

ONNX Runtime ortpackage support and documentation:

  • Added detailed README.md guides for both Qwen-Qwen3.5-0.8B/ortpackage and microsoft-Phi-4-mini-reasoning/ortpackage directories, describing how to export Hugging Face models to ONNX (CPU/GPU), combine them into a .ortpackage, and run inference using onnxruntime-genai. [1] [2]
  • Updated .gitignore files in Qwen-Qwen3.5-0.8B/ortpackage, microsoft-Phi-4-mini-reasoning/ortpackage, and microsoft-Phi-4-mini-reasoning/OpenVINO to exclude generated CPU/GPU folders and .ortpackage artifacts. [1] [2]
  • Added requirements.txt files specifying dependencies on olive-ai[mobius-ai] and onnxruntime-genai for both model directories.
  • Enhanced microsoft-Phi-4-mini-reasoning/OpenVINO/README.md with instructions for combining OpenVINO IR outputs into a .ortpackage for GenAI runtime.

Copilot AI review requested due to automatic review settings June 9, 2026 20:57

Copilot AI left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Adds new .ortpackage recipe documentation and supporting files to package and run Qwen3.5-0.8B and Phi-4-mini-reasoning with ONNX Runtime GenAI, including export (CPU/GPU), package generation, and inference usage.

Changes:

  • Added ortpackage usage guides + minimal dependency lists for Qwen/Qwen3.5-0.8B and microsoft/Phi-4-mini-reasoning.
  • Added per-recipe .gitignore rules to exclude generated CPU/GPU export folders and produced .ortpackage artifacts.
  • Extended the Phi-4-mini-reasoning OpenVINO recipe README with an ortpackage generation command, plus an ignore rule for the generated package.

Reviewed changes

Copilot reviewed 8 out of 8 changed files in this pull request and generated 3 comments.

Show a summary per file
File Description
Qwen-Qwen3.5-0.8B/ortpackage/README.md New end-to-end ortpackage recipe guide (export CPU/GPU, package, run inference).
Qwen-Qwen3.5-0.8B/ortpackage/requirements.txt Adds dependencies needed for the ortpackage workflow.
Qwen-Qwen3.5-0.8B/ortpackage/.gitignore Ignores generated export outputs and package directory.
microsoft-Phi-4-mini-reasoning/ortpackage/README.md New end-to-end ortpackage recipe guide (export CPU/GPU, package, run inference).
microsoft-Phi-4-mini-reasoning/ortpackage/requirements.txt Adds dependencies needed for the ortpackage workflow.
microsoft-Phi-4-mini-reasoning/ortpackage/.gitignore Ignores generated export outputs and package directory.
microsoft-Phi-4-mini-reasoning/OpenVINO/README.md Adds instructions to combine OpenVINO outputs into a .ortpackage.
microsoft-Phi-4-mini-reasoning/OpenVINO/.gitignore Ignores the generated OpenVINO ortpackage output.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread Qwen-Qwen3.5-0.8B/ortpackage/README.md Outdated
Comment thread Qwen-Qwen3.5-0.8B/ortpackage/README.md Outdated
Comment thread Qwen-Qwen3.5-0.8B/ortpackage/README.md Outdated
xiaoyu-work and others added 3 commits June 9, 2026 14:00
Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants