PDFlib TET PDF IFilter
Optimizes PDF resource handling, and enhances language bindings for .NET 5, PHP 8, Perl 5.32 and Ruby 3.0.
Features
- Optimized PDF resource handling to improve performance for documents with excessive numbers of images, patterns or other resources.
- Security and performance updates of all third-party components.
- Harden processing of damaged and illegal PDF documents by testing the full
Issue Tracker
PDF corpus with tens of thousands ofstressful PDF files
. - Expanded platform and CPU support including Apple macOS on ARM64 and Linux on ARM64.
- Timeout can be specified to limit processing time for large or complex files.
- Enhancements in all language bindings and updates for the latest language versions including Microsoft .NET 5, PHP 8, Perl 5.32 and Ruby 3.0.
- Support for native UTF-8, UTF-16 and UTF-32 Unicode strings in C++17 and C++20.
- Implement detection of certain kinds of attacks using legal PDF constructs overly large data structures.
- Improved TETML output for edge cases.
- Improved word boundary, list and paragraph detection.
- Support for Unicode 13.
- Improved performance for Classic .NET binding.
- Many minor bug fixes and improvements.
- Updated CJK character collections and CMaps for PDF 2.0.