• Skip to primary navigation
  • Skip to main content
  • Skip to primary sidebar
  • Skip to footer
TSB Alfresco Cobrand White tagline

Technology Services Group

  • Home
  • Products
    • Alfresco Enterprise Viewer
    • OpenContent Search
    • OpenContent Case
    • OpenContent Forms
    • OpenMigrate
    • OpenContent Web Services
    • OpenCapture
    • OpenOverlay
  • Solutions
    • Alfresco Content Accelerator for Claims Management
      • Claims Demo Series
    • Alfresco Content Accelerator for Policy & Procedure Management
      • Compliance Demo Series
    • OpenContent Accounts Payable
    • OpenContent Contract Management
    • OpenContent Batch Records
    • OpenContent Government
    • OpenContent Corporate Forms
    • OpenContent Construction Management
    • OpenContent Digital Archive
    • OpenContent Human Resources
    • OpenContent Patient Records
  • Platforms
    • Alfresco Consulting
      • Alfresco Case Study – Canadian Museum of Human Rights
      • Alfresco Case Study – New York Philharmonic
      • Alfresco Case Study – New York Property Insurance Underwriting Association
      • Alfresco Case Study – American Society for Clinical Pathology
      • Alfresco Case Study – American Association of Insurance Services
      • Alfresco Case Study – United Cerebral Palsy
    • HBase
    • DynamoDB
    • OpenText & Documentum Consulting
      • Upgrades – A Well Documented Approach
      • Life Science Solutions
        • Life Sciences Project Sampling
    • Veeva Consulting
    • Ephesoft
    • Workshare
  • Case Studies
    • White Papers
    • 11 Billion Document Migration
    • Learning Zone
    • Digital Asset Collection – Canadian Museum of Human Rights
    • Digital Archive and Retrieval – ASCP
    • Digital Archives – New York Philharmonic
    • Insurance Claim Processing – New York Property Insurance
    • Policy Forms Management with Machine Learning – AAIS
    • Liferay and Alfresco Portal – United Cerebral Palsy of Greater Chicago
  • About
    • Contact Us
  • Blog

Alfresco Bulk Import Tool Versus OpenMigrate – What are some of the differences?

You are here: Home / Alfresco / Alfresco Bulk Import Tool Versus OpenMigrate – What are some of the differences?

August 21, 2017

TSG has been conducting a number of large migrations for clients (upwards of 1 billion documents) with our OpenMigrate software product.  Alfresco offers the Bulk Import Tool as a free option for importing content into Alfresco.    While OpenMigrate with its direct file linking option incorporates all of the performance components of the Alfresco Bulk Import Tool, many times clients want to understand all of the differences between the two options.  This post will describe the differences between OpenMigrate and the Alfresco Bulk Import Tool for large file migrations.

Similarities Between Alfresco Bulk Import Tool and OpenMigrate

Both the Alfresco Bulk Import Tool and OpenMigrate support the concept of storing the file in the Alfresco content store and then “linking” the content location in Alfresco to dramatically improve content ingestion throughput.  This content linking process is also referred to an “in-place” load.  Content linking improves performance because it eliminates the need to stream content into Alfresco using the Alfresco API.  As performance is very tied to environment, both tools have the capability to hit 200 documents/second.  Alfresco recently reported a bulk load on AWS that achieved up to 500 documents/second.

OpenMigrate also supports direct content linking when migrating to Alfresco in order to optimize migration performance.

Key Differences between Alfresco Bulk Import Tool and OpenMigrate

To understand the key differentiators between the Alfresco Bulk Import Tool and OpenMigrate requires an understanding of how both tools were initially developed.  OpenMigrate was first developed by TSG to assist clients with Documentum migration and upgrade efforts.  As a migration tool rather than just an import tool, OpenMigrate contains both source and target components for not only extracting content and metadata from a variety of platforms (Documentum, Alfresco, FileNet, OpenText, SQL database…) but also target adapters for loading content into Alfresco, Documentum, Hadoop and Solr.  The Alfresco Bulk Import Tool was built more recently as an Alfresco Community project to be a fast way to load documents into Alfresco.  Based on their development histories, major differences between the two tools include:

  • Source Adapters – OpenMigrate has a full suite of source adapters for extracting documents from other repositories, including Documentum, FileNet, OpenText, SQL database, file system, Hummingbird, XML, and others. Content can be exported to a filesystem or database or can be real-time migrated to an Alfresco repository.  The Alfresco Bulk Import tool can only migrate content from a file system and has no other source adapter capabilities.
  • Types of Migrations – Because of both the source and target adapters, OpenMigrate can support a variety of different migration scenarios, including big bang, delta, hybrid, and on-demand/rolling. See our Webinar with Alfresco on migrating from Documentum to Alfresco for a more detailed understanding.  The Alfresco Bulk Import Tool only supports a big bang migration scenario.
  • Folder Structure – Built to quickly import documents, the Alfresco Bulk Import Tool assumes the folder structure that the files are placed in prior to import will be the folder structure that the objects will be loaded into in Alfresco. OpenMigrate allows for remapping of the folder structure or to create the folder structure on the fly based on other metadata.
  • Metadata – the Alfresco Bulk Import Tool can only pull metadata from a flat XML file (one per document to be imported) with very specific structure that must sit next to the content file. OpenMigrate can pull metadata from many different sources, including database tables, XML, Excel/CSV. OpenMigrate can also perform transformations on the metadata using its mapping layer.
  • Object Store – the Alfresco Bulk Import Tool can only do “in-place” migrations for content that’s in a filesystem-based Alfresco content store. OpenMigrate can do in-place migrations for content that is in an Alfresco filesystem-based store as well as other content store types, like S3 and Hitachi.
  • Fault Tolerance – the Alfresco Bulk Import Tool stops if a failure occurs at any point during the import process. The problem has to be fixed and then the bulk load must be run again. OpenMigrate tracks migrations failures but continues to run until all documents have been migrated, tracking any migration errors in database tables or log files.
  • Logging – OpenMigrate has a more sophisticated logging mechanisms, including the ability to log to CSV and/or database table. The Alfresco Bulk Import Tool provides only minimal logging to via Log4J.
  • Contentless Objects – OpenMigrate supports the migration of contentless objects, something not supported by the Alfresco Bulk Import Tool.
  • Version Numbering – the Alfresco Bulk Import Tool only supports major versioning (1.0, 2.0, 3.0) when migrating multiple versions. OpenMigrate supports migration any combination of major/minor versions (1.0, 1.1, 2.0, 3.0, 3.1) and can be customized to version documents that already exist, creating the version tree from initial to final.  See our latest post on TSG Chain Versioning for Alfresco in regards to additional capabilities that will affect migration speed.
  • Server Requirements – the Alfresco Bulk Import Tool must run directly within the Alfresco JVM. OpenMigrate can be run either as an embedded subsystem in the Alfresco JVM, or externally on a remote JVM using CMIS.
  • Renditions – the Alfresco Bulk Import Tool relies on the Alfresco Transformation server to create PDF renditions of the migrated content. By default, the Alfresco Transformation server converts documents synchronously during the migration, slowing migration times considerably.  TSG has created an add-on to OpenMigrate to move the Alfresco transformations to an asynchronous process to avoid a migration slow-down.

Summary

The Alfresco Bulk Import Tool  is a simple and efficient tool for moving documents from a file system into a simple structure in Alfresco.  OpenMigrate provides additional features and supports a variety of migration sources and scenarios appropriate for more complex migrations.

Let us know your thoughts below.

Filed Under: Alfresco, Migrations, OpenMigrate

Reader Interactions

Trackbacks

  1. ECM Migrations – 10 Reasons there will never be an “Easy Button” for migrations says:
    June 13, 2018 at 2:10 pm

    […] before the full production migration.  For example, as we pointed out earlier this year, Alfresco’s Bulk Import does not have an error logging capability or the ability to continue the migration if an error is […]

    Reply

Leave a Reply Cancel reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Primary Sidebar

Search

Related Posts

  • Alfresco – Do More with OpenMigrate Services
  • FileNet Migration – Recorded Alfresco/TSG Webinar – 05/29/2019
  • Migrating to Alfresco – Reducing Risk, Stress and Cost with a Rolling Migration
  • Redaction for AWS, Alfresco, Documentum and Hadoop – Bulk Redaction upon Ingestion or Migration
  • DocFinity – Migrating to Amazon S3
  • Alfresco Solutions of the Year 2017 – TSG wins Alfresco award for sixth year in a row
  • Data Visualization Dashboard for ECM Migrations
  • New York Philharmonic Throwback – Take A Trip Through 174 Years Of Classical Music
  • OpenText Migration to Alfresco – Best Practices with OpenMigrate
  • FileNet Migration to Alfresco with OpenMigrate

Recent Posts

  • Alfresco Content Accelerator and Alfresco Enterprise Viewer – Improving User Collaboration Efficiency
  • Alfresco Content Accelerator – Document Notification Distribution Lists
  • Alfresco Webinar – Productivity Anywhere: How modern claim and policy document processing can help the new work-from-home normal succeed
  • Alfresco – Viewing Annotations on Versions
  • Alfresco Content Accelerator – Collaboration Enhancements
stacks-of-paper

11 BILLION DOCUMENT
BENCHMARK
OVERVIEW

Learn how TSG was able to leverage DynamoDB, S3, ElasticSearch & AWS to successfully migrate 11 Billion documents.

Download White Paper

Footer

Search

Contact

22 West Washington St
5th Floor
Chicago, IL 60602

inquiry@tsgrp.com

312.372.7777

Copyright © 2023 · Technology Services Group, Inc. · Log in

This website uses cookies to improve your experience. Please accept this site's cookies, but you can opt-out if you wish. Privacy Policy ACCEPT | Cookie settings
Privacy & Cookies Policy

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these cookies, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may have an effect on your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.
Non-necessary
Any cookies that may not be particularly necessary for the website to function and is used specifically to collect user personal data via analytics, ads, other embedded contents are termed as non-necessary cookies. It is mandatory to procure user consent prior to running these cookies on your website.
SAVE & ACCEPT