• Skip to primary navigation
  • Skip to main content
  • Skip to primary sidebar
  • Skip to footer
TSB Alfresco Cobrand White tagline

Technology Services Group

  • Home
  • Products
    • Alfresco Enterprise Viewer
    • OpenContent Search
    • OpenContent Case
    • OpenContent Forms
    • OpenMigrate
    • OpenContent Web Services
    • OpenCapture
    • OpenOverlay
  • Solutions
    • Alfresco Content Accelerator for Claims Management
      • Claims Demo Series
    • Alfresco Content Accelerator for Policy & Procedure Management
      • Compliance Demo Series
    • OpenContent Accounts Payable
    • OpenContent Contract Management
    • OpenContent Batch Records
    • OpenContent Government
    • OpenContent Corporate Forms
    • OpenContent Construction Management
    • OpenContent Digital Archive
    • OpenContent Human Resources
    • OpenContent Patient Records
  • Platforms
    • Alfresco Consulting
      • Alfresco Case Study – Canadian Museum of Human Rights
      • Alfresco Case Study – New York Philharmonic
      • Alfresco Case Study – New York Property Insurance Underwriting Association
      • Alfresco Case Study – American Society for Clinical Pathology
      • Alfresco Case Study – American Association of Insurance Services
      • Alfresco Case Study – United Cerebral Palsy
    • HBase
    • DynamoDB
    • OpenText & Documentum Consulting
      • Upgrades – A Well Documented Approach
      • Life Science Solutions
        • Life Sciences Project Sampling
    • Veeva Consulting
    • Ephesoft
    • Workshare
  • Case Studies
    • White Papers
    • 11 Billion Document Migration
    • Learning Zone
    • Digital Asset Collection – Canadian Museum of Human Rights
    • Digital Archive and Retrieval – ASCP
    • Digital Archives – New York Philharmonic
    • Insurance Claim Processing – New York Property Insurance
    • Policy Forms Management with Machine Learning – AAIS
    • Liferay and Alfresco Portal – United Cerebral Palsy of Greater Chicago
  • About
    • Contact Us
  • Blog

Improving Viewing Performance of PDF Documents – PDF.js vs OpenAnnotate

You are here: Home / ECM Landscape / Improving Viewing Performance of PDF Documents – PDF.js vs OpenAnnotate

August 22, 2017

Over the past few years, TSG has leveraged PDF.js as a quick viewer for PDFs within our applications.  This post will present how clients are leveraging OpenAnnotate rather than PDF.JS to improve viewing times (while getting the ability to annotate PDF images).

To understand the performance differences of PDF.JS and OpenAnnotate requires a small understanding of how each work.

  • PDF.JS leverages 100% client side JavaScript to open a PDF image file for viewing.
  • OpenAnnotate relies on a server side component to break the PDF into separate image pages and allows viewing of individual pages. Once the pages are broken down, OpenAnnotate relies on 100% client side JavaScript to view and annotate an individual page.  The server side combines the annotations into a single annotation file for storage within the ECM system.

While the viewing performance with PDF.JS is often satisfactory, PDF.js falls short for a few key reasons:

  • 100% Client Side – PDF.js loads the entire PDF into the client via JavaScript. This works fine for moderately large documents (10 pages), however many of our clients have documents in the 300-700 page range. Larger files put a lot of strain on the network, and leaves minimal options when it comes to performance tuning.
  • Full PDF Download – PDF.js sends the entire PDF to the client, which provides the user full access to the document that may require more security and restrictions.

Leveraging OpenAnnotate, TSG has been able to create a new OpenAnnotate Streamline Viewer to optimize the load times for large PDFs, and restrict the requirement for the full PDF to be served to a client. Initial testing has shown an improvement of 45% in average load times for documents between 600-700 pages.  Moving the PDF processing into OpenAnnotate provides the following benefits:

  • Minimize Client Side Processing – PDFs are rendered into images which are streamed back to the browser proactively. This removes the requirement of requiring the browser to download the entire document and process it within the browser.
  • Increased Scalability and Tuning Options – When clients have had issues with opening large PDFs using PDF.js, there have been few options to improve performance. Moving the heavy lifting of image processing onto machines that are more powerful, removes the burden from the browser.
  • 1st Page Quick Load – OpenAnnotate proactively processes the first page of PDFs making them instantly viewable by a user while the rest of the pages are rendered for viewing. This process is ideal for users that need to quickly access large documents and don’t want to be forced to wait for an entire document to be downloaded before they can start their work – something not possible with PDF.js.
  • Limited PDF Download – Since the browser only receives the processed pages as images, clients do not need to worry about a PDF falling into the wrong hands, or being modified using tools like Adobe Acrobat.
  • Easily Configure Annotations – Use OpenAnnotate for fast viewing of PDFs, or enable the full set of robust annotation tools like: sticky notes, stamps, drawing, redaction and page manipulation. The functionality is available via simple configuration change.

Filed Under: ECM Landscape, ECM Solutions, OpenAnnotate

Reader Interactions

Comments

  1. Andrey Panfilov says

    August 22, 2017 at 12:59 pm

    > 100% Client Side – PDF.js loads the entire PDF into the client via JavaScript. This works fine for moderately large documents (10 pages), however many of our clients have documents in the 300-700 page range. Larger files put a lot of strain on the network, and leaves minimal options when it comes to performance tuning.

    That is definitely not true. PDF.js supports progressive loading since 2013.

    Reply
    • Ben Allen says

      September 8, 2017 at 10:21 am

      Andrey,

      You are correct in that pdf.js has some capabilities to load the PDF in chunks for some PDFs. What we see from clients are that many of their PDFs are not saved in a way that allows pdf.js or any PDF software to take advantage of the “Fast Web View” since they have to be saved in a special mode for “Fast Web View” which many PDF renderers embedded in Documentum/Alfresco do not do by default. Adobe has made this more evident in recent versions of Acrobat when saving PDFs, but many of the PDFs that we have seen over the years do not take advantage of it (https://helpx.adobe.com/acrobat/using/optimizing-pdfs-acrobat-pro.html#EnableFastWebViewinaPDF)

      Reply

Leave a Reply Cancel reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Primary Sidebar

Search

Related Posts

  • Alfresco Ranked a Leader in Independent Content Management Market Analysis
  • Alfresco – Do More With Redactions and Personally Identifiable Information
  • AWS with DynamoDB for Content Management – Reference Architecture & Cost Estimate
  • Google Cloud with BigTable for Content Management – Reference Architecture & Cost Estimate
  • Reference Architecture for Content Management on Azure HDInsight with HBase
  • Content Capture 2.0 – The Upcoming Disruptions – Thoughts and Analysis
  • DynamoDB 11 Billion Benchmark 11 Thousand Concurrent Users Success!!! – Lessons Learned
  • DynamoDB 11 Billion Benchmark Search Index Success!!! – Lessons Learned
  • Managing Security Videos – What are the issues?
  • Alfresco – Application Performance Monitoring

Recent Posts

  • Alfresco Content Accelerator and Alfresco Enterprise Viewer – Improving User Collaboration Efficiency
  • Alfresco Content Accelerator – Document Notification Distribution Lists
  • Alfresco Webinar – Productivity Anywhere: How modern claim and policy document processing can help the new work-from-home normal succeed
  • Alfresco – Viewing Annotations on Versions
  • Alfresco Content Accelerator – Collaboration Enhancements
stacks-of-paper

11 BILLION DOCUMENT
BENCHMARK
OVERVIEW

Learn how TSG was able to leverage DynamoDB, S3, ElasticSearch & AWS to successfully migrate 11 Billion documents.

Download White Paper

Footer

Search

Contact

22 West Washington St
5th Floor
Chicago, IL 60602

inquiry@tsgrp.com

312.372.7777

Copyright © 2022 · Technology Services Group, Inc. · Log in

This website uses cookies to improve your experience. Please accept this site's cookies, but you can opt-out if you wish. Privacy Policy ACCEPT | Cookie settings
Privacy & Cookies Policy

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these cookies, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may have an effect on your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.
Non-necessary
Any cookies that may not be particularly necessary for the website to function and is used specifically to collect user personal data via analytics, ads, other embedded contents are termed as non-necessary cookies. It is mandatory to procure user consent prior to running these cookies on your website.
SAVE & ACCEPT