Improved merging of adjacent heading (H1..H6) tags (8.3)

For accessible PDF (PDF/UA) (clean structured PDF) and for PDF files with "invisible" structure information, it is important that coherent texts are also displayed contiguously in the PDF structure. This is not always happen in the creation of the PDF.

pdfToolbox 8.3 has a fix to merge adjacent heading tags in the PDF structure.

Launch pdfToolbox Desktop

Open the PDF file "Split headlines_Demo file.pdf"

  1. Go to "File".
  2. Click "Open" to launch the file load dialog box and navigate to the folder where the input PDF file "Split headlines_Demo file" is located.

Validate the PDF/UA-1 file

  1. Click "PDF/UA-1".

Analyze the preflight report

Analyze the preflight report

During validation of the PDF/UA-1 file no problems are found. But that is not sufficient for PDF/UA because it requires an extra analyze.

  1. Click "Analyze".

Analyze the PDF/UA-1 file structure

The PDF structure information is showed and distribute the header on two H1"tags".

Open the Fixups dialog

  1. Go to "Tools".
  2. Click "Fixups".

Apply the Fixup "Merge adjacent headings if heading level is the same"

Apply the Fixup "Merge adjacent headings if heading level is the same"
  1. In the search field search to "Merge adjacent headings if heading level is the same".
  2. Select the "Merge adjacent headings if heading level is the same" Fixup.
  3. Click "Fix" to apply the Fixup on the PDF file loaded in pdfToolbox.

Save the output PDF file

Save the output PDF file
  1. Save the output PDF file as "Split headlines_Demo file_Processed".
  2. Click "Save".

Analyze the preflight report

Analyze the preflight report

Validate the PDF/UA-1 file

  1. Click "PDF/UA-1".

Analyze the preflight report

Analyze the preflight report
  1. Click "Analyze".

Analyze the PDF/UA-1 file structure

The previously separate parts of the header are now connected in one structure tag.

0 Comments

Send Your Comment

E-Mail me when someone replies to this comment