PDF parser

The smalot/pdfparser is a standalone PHP package that provides various tools to extract data from PDF files.

Maintenance status

This library is under limited maintenance. It is still kept compatible with supported PHP versions, and community contributions may be accepted. However, there is currently no active feature development and no guarantee that pull requests will be reviewed or merged in a timely manner. If you plan to contribute anything beyond a small, well-scoped fix, please read CONTRIBUTING.md first.

Features

Load/parse objects and headers
Extract metadata (author, description, ...)
Extract text from ordered pages
Support of compressed PDFs
Support of MAC OS Roman charset encoding
Handling of hexa and octal encoding in text sections
Create custom configurations (see CustomConfig.md).

Currently, secured documents and extracting form data are not supported.

License

This library is under the LGPLv3 license.

Install

This library requires PHP 7.1+ since v1. You can install it via Composer:

composer require smalot/pdfparser

In case you can't use Composer, you can include alt_autoload.php-dist. It will include all required files automatically.

Quick example

<?php

// Parse PDF file and build necessary objects.
$parser = new \Smalot\PdfParser\Parser();
$pdf = $parser->parseFile('/path/to/document.pdf');

$text = $pdf->getText();
echo $text;

Further usage information can be found here.

Documentation

Documentation can be found in the doc folder.

Name		Name	Last commit message	Last commit date
Latest commit History 462 Commits
.github		.github
dev-tools		dev-tools
doc		doc
samples		samples
src/Smalot/PdfParser		src/Smalot/PdfParser
tests		tests
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.php-cs-fixer.php		.php-cs-fixer.php
.scrutinizer.yml		.scrutinizer.yml
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.txt		LICENSE.txt
Makefile		Makefile
README.md		README.md
alt_autoload.php-dist		alt_autoload.php-dist
composer.json		composer.json
phpstan.neon		phpstan.neon
phpunit-windows.xml		phpunit-windows.xml
phpunit.xml		phpunit.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PDF parser

Maintenance status

Features

License

Install

Quick example

Documentation

About

Uh oh!

Releases 55

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PDF parser

Maintenance status

Features

License

Install

Quick example

Documentation

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 55

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages