Improved merging of adjacent heading (H1..H6) tags

For accessible PDF (PDF/UA) (clean structured PDF) and for PDF files with "invisible" structure information, it is important that coherent texts are also displayed contiguously in the PDF structure. This does not always happen in the creation of the PDF.

The following PDF file contains a heading, which consists of two components.

Validate the PDF/UA-1 file

By clicking on the PDF/UA-1 button below, the result window will open to start the validation of the structure.
No problems are found during the validation. However, this is not yet sufficient for PDF/UA, which always requires an interactive analysis as well. For this, the "Semantic check" must be started. To do this, click on "Analyze".

Analyze the preflight report

Analyze the PDF/UA-1 file structure

Analyze the PDF/UA-1 file structure

During the analysis, the structure information of the PDF file is displayed. As you can see from the screenshot, the headline was split to two H1 "tags". To fix this problem, a predefined Fixup can be applied.

Fixup "Merge adjacent headings if heading level is the same"

Apply the Fixup "Merge adjacent headings if heading level is the same"

After you run this Fixup and saved the PDF file, click the PDF/UA button on the bottom menu bar again. Then execute PDF/UA validation again and click "Analyze" afterwards to view the structure.

Result: Analyze the PDF/UA-1 file structure

The previously separate parts of the header are now connected in one structure tag.

Analyze the PDF/UA-1 file structure