Class ContentExtractor

    • Constructor Detail

      • ContentExtractor

        public ContentExtractor​(Content content)
                         throws java.io.IOException,
                                CorruptException

        Create a new content extractor

        Parameters:
        content - the content object of a page or group
        Throws:
        java.io.IOException - Error reading from the document
        CorruptException - The document is corrupt
        java.lang.IllegalArgumentException - if the document associated with the {@link com.pdftools.toolbox.pdf.content.ContentExtractor# content} object has already been closed
        java.lang.IllegalArgumentException - if the document associated with the content has already been closed
        java.lang.IllegalArgumentException - if the {@link com.pdftools.toolbox.pdf.content.ContentExtractor# content}'s document is an output document
        java.lang.IllegalArgumentException - if content is null
    • Method Detail

      • getUngrouping

        public UngroupingSelection getUngrouping()
        Configures the extractor's behavior regarding the selection of groups to be un-grouped. Default value: UngroupingSelection.NONE.
        Throws:
        java.lang.IllegalStateException - the object has already been closed
      • setUngrouping

        public void setUngrouping​(UngroupingSelection value)
        Configures the extractor's behavior regarding the selection of groups to be un-grouped. Default value: UngroupingSelection.NONE.
        Throws:
        java.lang.IllegalStateException - the object has already been closed
        java.lang.IllegalArgumentException - if value is null