• Skip to primary navigation
  • Skip to main content
  • Skip to primary sidebar
  • Skip to footer
TSB Alfresco Cobrand White tagline

Technology Services Group

  • Home
  • Products
    • Alfresco Enterprise Viewer
    • OpenContent Search
    • OpenContent Case
    • OpenContent Forms
    • OpenMigrate
    • OpenContent Web Services
    • OpenCapture
    • OpenOverlay
  • Solutions
    • Alfresco Content Accelerator for Claims Management
      • Claims Demo Series
    • Alfresco Content Accelerator for Policy & Procedure Management
      • Compliance Demo Series
    • OpenContent Accounts Payable
    • OpenContent Contract Management
    • OpenContent Batch Records
    • OpenContent Government
    • OpenContent Corporate Forms
    • OpenContent Construction Management
    • OpenContent Digital Archive
    • OpenContent Human Resources
    • OpenContent Patient Records
  • Platforms
    • Alfresco Consulting
      • Alfresco Case Study – Canadian Museum of Human Rights
      • Alfresco Case Study – New York Philharmonic
      • Alfresco Case Study – New York Property Insurance Underwriting Association
      • Alfresco Case Study – American Society for Clinical Pathology
      • Alfresco Case Study – American Association of Insurance Services
      • Alfresco Case Study – United Cerebral Palsy
    • HBase
    • DynamoDB
    • OpenText & Documentum Consulting
      • Upgrades – A Well Documented Approach
      • Life Science Solutions
        • Life Sciences Project Sampling
    • Veeva Consulting
    • Ephesoft
    • Workshare
  • Case Studies
    • White Papers
    • 11 Billion Document Migration
    • Learning Zone
    • Digital Asset Collection – Canadian Museum of Human Rights
    • Digital Archive and Retrieval – ASCP
    • Digital Archives – New York Philharmonic
    • Insurance Claim Processing – New York Property Insurance
    • Policy Forms Management with Machine Learning – AAIS
    • Liferay and Alfresco Portal – United Cerebral Palsy of Greater Chicago
  • About
    • Contact Us
  • Blog

Hadoop – OpenContent/HPI Product Plans

You are here: Home / Hadoop / Hadoop – OpenContent/HPI Product Plans

January 28, 2015

The first step in supporting all of the TSG products on Hadoop is building our OpenContent REST Web Services layer to access Hadoop in the same manner we access Documentum, Alfresco and other content management systems.  This post will present our plans and timelines for OpenContent along with associated TSG solutions.

OpenContent for Hadoop – Phase 1 – Minimally Viable Product

We have focused phase 1 to provide a minimally viable product to allow for content migration by OpenMigrate as well as basic access by HPI.  As a first vertical solution, we are targeting all of the functionality required for our insurance solution that includes:

  • Add Documents and Meta Data – index full-text and metadata with Solr
  • Retrieval of Documents
  • Delete Documents
  • Update Documents (no versioning yet)
  • Annotate with OpenAnnotate
  • Rendition to PDF
  • Searching (using Solr as we mentioned in our previous blog)

To enable these capabilities, we are supporting the following Web Services Calls

  • createDocument
  • readDocument
  • updateDocument
  • deleteDocument
  • getProperties
  • addRelation
  • addRendition
  • removeRendition
  • getContentFormats
  • search

As of today (01/28/2015) we are feature complete for Search/Retrieval, Add/Delete.  Plans for the upcoming weeks include:

  • Addition of Transformation Server – we typically rely on the vendor so are building our own.  Might make available for other users by utilizing open source libraries to perform the transformations.
  • Addition of PDF Annotations with OpenAnnotate

OpenContent for Hadooop – Phase 2 – Versioning, LifeCycle and Security

The second phase of OpenContent will be to add versioning, lifecycles, and security.  Some of the unique options we are planning for Hadoop will include:

  • Different properties supported on different versions.
  • Version Tree/Numbering consistent with Compliance Solution
  • ACLs, LDAP groups, as well as some content specific security

We are planning for this phase to be complete by  March ’15.  Additional OpenContent Services required will include:

  • checkout
  • checkin
  • cancelCheckout
  • getLockOwner
  • getDocumentVersion
  • getAllVersions
  • setPermissions

OpenContent for Hadoop – Future Phases

  • Future plans will include full support for all our solutions:
    • Document Control and Compliance Solution for Regulated Industries
    • Digital Archive and Retrieval Solution
    • Policy and Claim Solution for Insurance
    • Engineering Construction Management Solution
    • Corporate Form Solution

Let us know your thoughts in the comments below.

Filed Under: Hadoop, HBase, Lucene, OpenAnnotate, OpenContent Management Suite, Product Suite, R&D Tagged With: ECM, Hadoop, HPI, OpenAnnotate, OpenContent

Reader Interactions

Comments

  1. Rob Lancaster says

    January 29, 2015 at 5:25 pm

    Hadoop as a document store is a natural progression for those early adopter companies that are starting to standardize on HDFS the file system within an enterprise data hub. Increasingly, Hadoop users are recognizing the power of the big data platform as a “single source of the truth” for enterprise information, not just data, and not just things like log files. That said, I believe this is an opportunity for the ECM vendors that are agile enough to understand that they have a ton of value to add on top of document storage. Its early days for sure, but the Hadoop ecosystem evolving so quickly it will be fun to watch…

    Reply

Trackbacks

  1. Hadoop and its Opportunities for Enterprise Content Management | TSG Blog says:
    January 29, 2015 at 3:04 pm

    […] Hadoop – OpenContent/HPI Product Plans […]

    Reply

Leave a Reply Cancel reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Primary Sidebar

Search

Related Posts

  • Hadoop well documented – Adding ECM attributes "on the fly"
  • TSG Announces Creation of Hadoop Practice
  • Hadoop – Why Hadoop as a Content Store when Caching Content for ECM Consumers
  • Hadoop PDF Annotations with OpenAnnotate
  • Hadoop for Enterprise Content Management – Adding PDF Renditions with Adlib
  • Hadoop – Data Model for ECM applications
  • Alfresco / TSG Webinar – Using Alfresco for Controlled Documentation
  • Workshare Compare for More Efficient Review and Approval of Document Changes
  • Office 365 – Check-in and Check-out with Documentum, Alfresco or Hadoop
  • Documentum or Alfresco – Bulk Upload and “Heads up” indexing

Recent Posts

  • Alfresco Content Accelerator and Alfresco Enterprise Viewer – Improving User Collaboration Efficiency
  • Alfresco Content Accelerator – Document Notification Distribution Lists
  • Alfresco Webinar – Productivity Anywhere: How modern claim and policy document processing can help the new work-from-home normal succeed
  • Alfresco – Viewing Annotations on Versions
  • Alfresco Content Accelerator – Collaboration Enhancements
stacks-of-paper

11 BILLION DOCUMENT
BENCHMARK
OVERVIEW

Learn how TSG was able to leverage DynamoDB, S3, ElasticSearch & AWS to successfully migrate 11 Billion documents.

Download White Paper

Footer

Search

Contact

22 West Washington St
5th Floor
Chicago, IL 60602

inquiry@tsgrp.com

312.372.7777

Copyright © 2023 · Technology Services Group, Inc. · Log in

This website uses cookies to improve your experience. Please accept this site's cookies, but you can opt-out if you wish. Privacy Policy ACCEPT | Cookie settings
Privacy & Cookies Policy

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these cookies, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may have an effect on your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.
Non-necessary
Any cookies that may not be particularly necessary for the website to function and is used specifically to collect user personal data via analytics, ads, other embedded contents are termed as non-necessary cookies. It is mandatory to procure user consent prior to running these cookies on your website.
SAVE & ACCEPT