when I use the PDFBOX and the PDFMergeUtility with either appendDocument or mergeDocuments, I noticed the the Tag Structure of each individual document is nested under . If I merge 7 documents, it appears the tag is nested under another 7 times. Is this by design ? Is there away to merge the documents so the tags are flattened and not nested ? The reason for merging the document is to eventually load the document into a content management using PPD and the Content Management will allow the user to retrieve each document separately. THe JAWS reader can still read the document but I noticed the Document Tags are nested heavily and could cause a performance issue.
PDFBox version 2.0.24 (released in June 2021) has an improved ordering, similar to what Adobe is doing. This was discussed in PDFBOX-5198.