• Skip to primary navigation
  • Skip to main content
  • Skip to primary sidebar
  • Skip to footer
TSB Alfresco Cobrand White tagline

Technology Services Group

  • Home
  • Products
    • Alfresco Enterprise Viewer
    • OpenContent Search
    • OpenContent Case
    • OpenContent Forms
    • OpenMigrate
    • OpenContent Web Services
    • OpenCapture
    • OpenOverlay
  • Solutions
    • Alfresco Content Accelerator for Claims Management
      • Claims Demo Series
    • Alfresco Content Accelerator for Policy & Procedure Management
      • Compliance Demo Series
    • OpenContent Accounts Payable
    • OpenContent Contract Management
    • OpenContent Batch Records
    • OpenContent Government
    • OpenContent Corporate Forms
    • OpenContent Construction Management
    • OpenContent Digital Archive
    • OpenContent Human Resources
    • OpenContent Patient Records
  • Platforms
    • Alfresco Consulting
      • Alfresco Case Study – Canadian Museum of Human Rights
      • Alfresco Case Study – New York Philharmonic
      • Alfresco Case Study – New York Property Insurance Underwriting Association
      • Alfresco Case Study – American Society for Clinical Pathology
      • Alfresco Case Study – American Association of Insurance Services
      • Alfresco Case Study – United Cerebral Palsy
    • HBase
    • DynamoDB
    • OpenText & Documentum Consulting
      • Upgrades – A Well Documented Approach
      • Life Science Solutions
        • Life Sciences Project Sampling
    • Veeva Consulting
    • Ephesoft
    • Workshare
  • Case Studies
    • White Papers
    • 11 Billion Document Migration
    • Learning Zone
    • Digital Asset Collection – Canadian Museum of Human Rights
    • Digital Archive and Retrieval – ASCP
    • Digital Archives – New York Philharmonic
    • Insurance Claim Processing – New York Property Insurance
    • Policy Forms Management with Machine Learning – AAIS
    • Liferay and Alfresco Portal – United Cerebral Palsy of Greater Chicago
  • About
    • Contact Us
  • Blog

Migrating FileNet with Daeja Annotations to AWS S3

You are here: Home / Adobe / Migrating FileNet with Daeja Annotations to AWS S3

August 8, 2018

Currently TSG is working with several clients who are looking to move away from FileNet to AWS S3 and/or a modern ECM.  A unique requirements is that the migration includes both documents and annotations.  In a previous post, we wrote about migrating from FileNet to AWS S3 and focused on how to move the documents with conversion of proprietary TIFF to PDF.  For this post we will address moving the annotations for those clients with a Daeja viewer annotations to a more standard, non-proprietary XFDF Adobe standard.

Daeja Annotation Structure

Two Unique FileNet/ Daeja characteristics that add complexity to converting these annotations to the standard XFDF format are:

  1. FileNet can store multi-paged documents as individual TIFF image files or as multi page TIFF files
  2. To align with FileNet document storage, Deaja stores each annotation in an individual file per annotation while the standard XFDF format stores all annotations for a document in one file.

Daeja uses an XML-based notation to store their annotation data. The XML contains all the metadata such as the type of annotation, creator, page of document, positioning, and text.   The metadata is sometimes dependent on the type of annotation such as a rectangle may have coordinates and line thickness while a text note may have information such as font size and typeface.

To convert Daeja XML annotations to an XFDF annotation, a mapping exercise needs to be performance to make several conversions:

  1. Convert multiple XML files (one per annotation) into one XFDF file. This will greatly increase performance on load time as well as fragmentation on the server.
  2. Reorganize the format to be XFDF compliant
  3. Convert the individual TIFF files (one for each page of the document) which are stored in FileNet into a single PDF

In the below example you can see a before and after of a migrated annotation, as well as the mapping from the old XML format to the Adobe XFDF standard.  The image below is a representation of a square annotation in red:

While there is some metadata in Daeja that is not applicable to an Adobe XFSF, most attributes translate directly and other require some mapping:

With the XFDF annotation, the annotation can be displayed correctly TSG’s OpenAnnotate tool

 

OpenMigrate Platform

TSG has been moving customers from FileNet to AWS or a modern ECM system like Alfresco for over 10 years.  (https://tsgrp.wpengine.com/2017/09/20/filenet-what-is-the-business-case-to-replace-it/) Typically, clients will only move the file and the metadata to the new system.  In the case of FileNet documents and annotations, clients will need additional logic to occur to transform both the image and annotation files at the time of migration.  By leveraging OpenMigrate’s extensible architecture, additional logic to transform documents and annotations can be configured in the OpenMigrate listener, making the migrating of annotations a relatively simple task.

For this scenario, the logic to convert the annotations occurs on the Target Layer:

For each document:

  1. All TIFF Images are retrieved from FileNet
  2. Individual TIFF Images are converted into a single PDF file and stored in the Target ECM system
  3. All XML annotation objects for a document being migrated are retrieved from FileNet
  4. For each annotation object, the type of annotation and the metadata is identified and for the given annotation type, all required annotation data is written to the new XFDF object on the Target.
  5. The newly created XFDF object is related to the newly migrated PDF document.

After the migration is completed, no further updates are required.  TSG’s OpenAnnotate viewer (https://tsgrp.wpengine.com/products/openannotate/) can provide the capabilities to both view and add to the PDF annotations.

Summary

Preserving Daeja annotations when moving from FileNet to AWS S3 or a modern ECM system requires planning as well as the tools to convert from proprietary formats to a more open format like XFDF.  TSG can provide software products OpenMigrate and OpenAnnotate to help simplify the movement of both the documents and the annotations.

For additional information on things to consider when planning a migration see the other migration blog posts below:

  • Planning a Successful Migration: https://tsgrp.wpengine.com/2017/02/17/ecm-migrations-top-ten-planning-tips-for-a-successful-migration/
  • Why Migrations are not easy: https://tsgrp.wpengine.com/2018/06/07/ecm-migrations-10-reasons-there-will-never-be-an-easy-button-for-migrations/
  • Migrating to AWS and S3:
    • https://tsgrp.wpengine.com/2017/11/10/filenet-migrating-to-amazon-s3/
    • https://tsgrp.wpengine.com/2018/01/22/filenet-and-content-manager-cmod-migration-to-amazon-web-services-why-now/

Filed Under: Adobe, Amazon, FileNet, OpenAnnotate, OpenMigrate

Reader Interactions

Trackbacks

  1. FileNet Migration to Amazon Web Services – Why now more than ever? says:
    May 7, 2019 at 4:03 pm

    […] See our related post as we worked with one client to design how they could move current FileNet content to S3 while continuing to link the content to FileNet.   We also posted on how we could move Deaja Annotations to AWS S3. […]

    Reply
  2. Http://Www.askmetatron.Com/ says:
    April 23, 2020 at 3:43 am

    http://Www.askmetatron.Com/

    Migrating FileNet with Daeja Annotations to AWS S3 — Technology Services Group

    Reply

Leave a Reply to FileNet Migration to Amazon Web Services – Why now more than ever? Cancel reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Primary Sidebar

Search

Related Posts

  • Alfresco Consulting – Documentum Disruptor #2
  • 11 Billion Documents, 12 Months Later – Thoughts and best practices 1 year after our industry leading document benchmark.
  • Amazon S3 – Viewing content fast and securely in-browser with the Alfresco Enterprise Viewer
  • Alfresco – Do More with OpenAnnotate
  • FileNet AWS Cloud Native? Thoughts on recent announcement
  • FileNet Migration – Not as hard as you think?
  • FileNet Support – Migrating to mitigate the risk of an unsupportable product
  • FileNet Migrations – Best Practices for Large Migrations
  • File Formats Lessons Learned – Legacy ECM Migrations
  • FileNet COLD Migration – Cracking the proprietary format issue

Recent Posts

  • Alfresco Content Accelerator and Alfresco Enterprise Viewer – Improving User Collaboration Efficiency
  • Alfresco Content Accelerator – Document Notification Distribution Lists
  • Alfresco Webinar – Productivity Anywhere: How modern claim and policy document processing can help the new work-from-home normal succeed
  • Alfresco – Viewing Annotations on Versions
  • Alfresco Content Accelerator – Collaboration Enhancements
stacks-of-paper

11 BILLION DOCUMENT
BENCHMARK
OVERVIEW

Learn how TSG was able to leverage DynamoDB, S3, ElasticSearch & AWS to successfully migrate 11 Billion documents.

Download White Paper

Footer

Search

Contact

22 West Washington St
5th Floor
Chicago, IL 60602

inquiry@tsgrp.com

312.372.7777

Copyright © 2022 · Technology Services Group, Inc. · Log in

This website uses cookies to improve your experience. Please accept this site's cookies, but you can opt-out if you wish. Privacy Policy ACCEPT | Cookie settings
Privacy & Cookies Policy

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these cookies, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may have an effect on your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.
Non-necessary
Any cookies that may not be particularly necessary for the website to function and is used specifically to collect user personal data via analytics, ads, other embedded contents are termed as non-necessary cookies. It is mandatory to procure user consent prior to running these cookies on your website.
SAVE & ACCEPT