Skip to content

[Feature]: Sample for direct PDF ingestion & structured extraction with GPT-4.1 in Azure OpenAI Service #121

@jamesmcroft

Description

@jamesmcroft

Feature Description

Demonstrate how to use Azure OpenAI GPT-4.1 models to accept PDF files as the content input (based on the OpenAI file inputs feature https://platform.openai.com/docs/guides/pdf-files?api-mode=chat) and return a JSON structured output.

Purpose is to evaluate both the effectiveness of GPT-4.1, as well as the effectiveness of processing an entire document using the file input type.

Use Case

With GPT-4.1 and direct file input:

  • Engineering effort should drop to a single GPT call inside existing solutions.
  • Newer GPT-4.1 models should enhance the accuracy and effectiveness of document data extraction solutions.

Motivation

  • Simplified solution – reduces the effort required to build document ingestion pipelines with GPT models

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions