Skip to main content

Archive PDF/A-1 workflow

This workflow is engineered specifically for preparing documents for archiving, specifically where PDF/A-2 or PDF/A-3 conformance is not allowed.

This workflow is disabled by default because it is recommended you use Archive PDF/A 2 workflow for archiving. Since PDF/A-2 is based on a newer version of the PDF standard, more PDF features are allowed such as transparency, layers, embedded files, and a less restrictive internal file structure.

For these reasons, achieve fewer conversion errors and better conversion quality with the Archive PDF/A-2 workflow. Particularly, if you use the Conversion Service in a Docker container where document content rasterization is unavailable, use PDF/A-2 workflow instead to avoid visual differences, removed transparency, or other potential issues.

info

More information on PDF/A-2, read PDF/A-2 overview.

Supported file formats for Archive PDF/A-1 workflow

This workflow supports these file formats:

Content typeFile type
Document formatsPDF 1.x, PDF 2.0, PDF/A-1, PDF/A-2, PDF/A-3
Image formatsJPEG, JPEG200, TIFF, BMP, GIF, JBIG2, PNG, HEIC, HEIF, WebP
EmailEML, MSG (without encryption)
WordDOC, DOT, DOCX, DOCM, DOTX, DOTM, RTF, XML (WordprocessingML 2003)
ExcelXLS, XLT, XLSX, XLSM, XLTX, XLTM, XML (SpreadsheetML 2003)
PowerPointPPT, PPS, PPTX, PPTM, PPSX, PPSM
OpenOfficeODT, ODS, ODP
OtherCSV, HTML, HTM (prepared for archiving), TXT, XML, ZIP (without password protection)
Note on OpenOffice formats

PDF conversion of OpenDocument Format depends on the rendering in Microsoft Word, Excel or PowerPoint. In particular, visual differences may occur with tables and tabs. The visual differences caused by the rendering of shapes are usually not acceptable.

Note on HTML format

HTML documents need to be self-contained (layout information and images are either inline or available on the web) and suited for portrait page layout. JavaScript content is disabled during processing.

Note on XML format

Layout information and images need to be available on the web.

The conversion of most file formats is enabled by default in the profile's Convert Mode Configuration for Child Documents (Attachments).

Features and limitations

Compared to the PDF/A-2 workflow, this workflow has the following features and limitations:

Convert to PDF/A-1

Collect mode configuration

The PDF/A-1 standard does not allow embedded files. Therefore, the only collect mode configuration available is Merge.

Document content rasterization

Certain graphical features of PDF documents are not allowed in PDF/A-1; for example, transparency. The removal of these features can lead to severe visual differences that may render the page's content unreadable.

The Windows version of the product can preserve the visual appearance of such documents by converting the pages' content to images (content rasterization).When doing so, a content rasterized warning is generated. If possible, content rasterization preserves the pages' extractable text, links, outlines and viewer preferences.

The Docker version cannot rasterize pages. Therefore, the conversion of such files leads to transparency removed or visual differences warning depending on whether the visual differences are caused by the removal of transparency.

PDF/A-1 specific conversion warnings

The following warnings occur more frequently when converting to PDF/A-1:

  • Annotation removed
  • Content rasterized
  • Layers removed
  • Transparency removed
  • Visual differences So the profile's conversion settings for these warnings is particularly important.
    Note: Content rasterized warning appears only in Windows where content rasterization is available; Transparency removed warning only appears in Docker image, where content rasterizaiton is not available.

Sign

Document signing is currently not implemented.

Job and document options for the PDF/A-1 workflow

The PDF/A-1 workflow lets you use job and document options to pass job- and document-specific values to be used when processing documents using the workflow.

Job options

Job options apply to all documents processed in the same job. Any subsequent jobs processed with the workflow profile use the profile's default settings.

TypeOptionDescription
Document compression and optimizationOPTIMIZETurn on or off document compression and optimization. All settings must be previously set up in the profile.
If true, documents included in the job are compressed and optimized according to the optimization profile set in the profile settings. If false, no document optimization is performed.
Documents can be optimized according to five profiles:
  • Web: Compresses the file without affecting viewing quality on digital devices
  • Print: Compresses the file without affecting print quality
  • Max: Removes redundant data and reduces image resolution to achieve a minimal viable file size
  • MRC: Profile designed to process mixed raster content
  • Archive: Prepares a document for archiving in PDF/A format
OCROCRTurn on and off optical character recognition for the job. All settings must be previously set up in the profile. If true, documents included in the job are processed to recognize any images as text (as appropriate). If false, no OCR is performed.
MetadataMETA.AUTHORThe author of the document
MetadataMETA.TITLEThe title of the document
MetadataMETA.SUBJECTThe subject of the document
MetadataMETA.KEYWORDSKeywords that apply to the document
note

Apart from the standard metadata properties, you can also set extended metadata properties.

Document options

Document options apply only to a specific input. It allows you to determine specific properties based on an individual document, rather than as a global setting (either determined by the job or the profile). Any subsequent jobs processed with the workflow profile use the profile's default settings.

TypeOptionDescription
Document propertyDOC.PASSWORDSet the password for the document.