• Skip to primary navigation
  • Skip to main content
  • Skip to primary sidebar
  • Skip to footer
TSB Alfresco Cobrand White tagline

Technology Services Group

  • Home
  • Products
    • Alfresco Enterprise Viewer
    • OpenContent Search
    • OpenContent Case
    • OpenContent Forms
    • OpenMigrate
    • OpenContent Web Services
    • OpenCapture
    • OpenOverlay
  • Solutions
    • Alfresco Content Accelerator for Claims Management
      • Claims Demo Series
    • Alfresco Content Accelerator for Policy & Procedure Management
      • Compliance Demo Series
    • OpenContent Accounts Payable
    • OpenContent Contract Management
    • OpenContent Batch Records
    • OpenContent Government
    • OpenContent Corporate Forms
    • OpenContent Construction Management
    • OpenContent Digital Archive
    • OpenContent Human Resources
    • OpenContent Patient Records
  • Platforms
    • Alfresco Consulting
      • Alfresco Case Study – Canadian Museum of Human Rights
      • Alfresco Case Study – New York Philharmonic
      • Alfresco Case Study – New York Property Insurance Underwriting Association
      • Alfresco Case Study – American Society for Clinical Pathology
      • Alfresco Case Study – American Association of Insurance Services
      • Alfresco Case Study – United Cerebral Palsy
    • HBase
    • DynamoDB
    • OpenText & Documentum Consulting
      • Upgrades – A Well Documented Approach
      • Life Science Solutions
        • Life Sciences Project Sampling
    • Veeva Consulting
    • Ephesoft
    • Workshare
  • Case Studies
    • White Papers
    • 11 Billion Document Migration
    • Learning Zone
    • Digital Asset Collection – Canadian Museum of Human Rights
    • Digital Archive and Retrieval – ASCP
    • Digital Archives – New York Philharmonic
    • Insurance Claim Processing – New York Property Insurance
    • Policy Forms Management with Machine Learning – AAIS
    • Liferay and Alfresco Portal – United Cerebral Palsy of Greater Chicago
  • About
    • Contact Us
  • Blog

Documentum Cross-Repository Searching – an integrated open source approach

You are here: Home / Documentum / Documentum Cross-Repository Searching – an integrated open source approach

January 15, 2014

We just completed a very interesting integration for one of our clients to enable searching across multiple Documentum repositories.  The client was struggling with the Federated Search from Documentum and was looking for a better solution to access two separate Documentum instances with a cleaner search.  This post will detail our approach and some of the benefits of leveraging an integrated open source solution over a packaged solution from Documentum.

World Browse but limited Read Access

One interesting requirement was that, for most content, all of the users would be able to find the document in search, but may or may not be able to view the content.  This requirement was difficult to accomplish with standard Documentum as we didn’t want to update an already complex ACL structure (the client has pretty involved ACLs) to world Browse as well as add users to each docbase.

The TSG solution was constructed in such a way as to allow all users to see search results for all documents in the repositories, but upon clicking on the requested document, either the PDF rendition of the document would be launched or, if the user didn’t have read access, the user would be advised how to request read access.

Single Sign-on

Another interesting element was how the company’s current single sign-on could be leveraged to access the read-only listing and then leveraged again to access the document itself.  The process went as follows:

  • User would log on to the company environment as they normally would.
  • The user would select the new cross repository search that would not require a Documentum login.
  • The user would search across multiple repositories and receive a listing of documents.
  • If the user selected a document, the cross repository search would attempt to log the user into Documentum via the single sign-on and retrieve the PDF document in a browser if the user had access.  If the user wanted to edit the native content or other Documentum function, the browser contained a deep link to the Webtop location of the document and Webtop could be launched in another browser window.
  • If the user did not have access to the Documentum instance they would be advised how to obtain access.

Cross Repository Infrastructure and Approach

The components for the cross repository search included the following items:

  • Lucene/Solr – we installed a separate instance of Lucene/Solr to hold both the meta-data and full-text search indexes.
  • OpenMigrate  – was responsible for indexing all the content from the repositories for the initial Lucene index as well as ongoing indexing of new or revised content.
  • HPI  – was leveraged for the configurable search interface for retrieval as well as passing the link to Webtop instances.

Cross Repository Search

Advantages to the approach

The approach has a couple of key advantages over leverage of traditional Federated Search components from Documentum including:

  • User Friendly Search Interface – We have discussed this on this forum many times but the typical “build a search” Documentum search from Webtop or other Documentum tools is not very user friendly.  See related post comparing Documentum Webtop, D2, XCP and HPI.
  • Webtop Isolation – With the approach, nothing in the multiple Webtop instances need to change in any way.
  • Retrieval Only/Deep Dive into Webtop only when needed – The Federated Search approach from Documentum assumes an author role that someone would want to search and edit documents so they get Webtop or other interfaces in their full-function mode.  Most users are more interested in quick retrieval and can be hampered by the full-featured interfaces.
  • Performance – We have seen this many times with our complete Caching model, but typically performance of Webtop and Documentum in general can be tied back to expensive queries.  By leveraging a separated Lucene Instance, Cross Repository searches are isolated from Documentum without any performance impacts.
  • Future Expansion – Other Repositories – While the focus of the first implementation was on multiple Documentum instances, the environment can be configured to capture and index multiple non-Documentum documents as well.
  • Full Text Indexing – The ability to conduct full text searches is not lost in the cross-repository search.  Open Migrate  triggers an indexing event in Solr every time a document is created, modified, or deleted in Documentum so that index captures both full text in addition to all document metadata.

Summary

Leveraging an integration and open source approach for cross-repository searching has several unique and interesting advantages over full-function tools from Documentum regarding user interface, performance and expansion capabilities.

Let us know your thoughts below:

Filed Under: Documentum, Lucene, OpenContent Management Suite, OpenMigrate, Product Suite, Webtop

Reader Interactions

Leave a Reply Cancel reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Primary Sidebar

Search

Related Posts

  • Documentum 6.5 Upgrade – Character Encoding Issues
  • Documentum – Top 12 Tips
  • Documentum Search – Why the Google appliance just doesn’t cut it
  • Documentum – Replacing External SharePoint Sites with a Simple Cached Approach
  • Documentum Full Text Search with Lucene – Honoring ACL Security
  • TSG Open Source Product Plans
  • Redaction for AWS, Alfresco, Documentum and Hadoop – Bulk Redaction upon Ingestion or Migration
  • TSG Announces Creation of Hadoop Practice
  • Hadoop – Why Hadoop as a Content Store when Caching Content for ECM Consumers
  • Documentum – Momentum EMC World 2014 Recap – Some bunts, hits as well as some swings

Recent Posts

  • Alfresco Content Accelerator and Alfresco Enterprise Viewer – Improving User Collaboration Efficiency
  • Alfresco Content Accelerator – Document Notification Distribution Lists
  • Alfresco Webinar – Productivity Anywhere: How modern claim and policy document processing can help the new work-from-home normal succeed
  • Alfresco – Viewing Annotations on Versions
  • Alfresco Content Accelerator – Collaboration Enhancements
stacks-of-paper

11 BILLION DOCUMENT
BENCHMARK
OVERVIEW

Learn how TSG was able to leverage DynamoDB, S3, ElasticSearch & AWS to successfully migrate 11 Billion documents.

Download White Paper

Footer

Search

Contact

22 West Washington St
5th Floor
Chicago, IL 60602

inquiry@tsgrp.com

312.372.7777

Copyright © 2023 · Technology Services Group, Inc. · Log in

This website uses cookies to improve your experience. Please accept this site's cookies, but you can opt-out if you wish. Privacy Policy ACCEPT | Cookie settings
Privacy & Cookies Policy

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these cookies, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may have an effect on your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.
Non-necessary
Any cookies that may not be particularly necessary for the website to function and is used specifically to collect user personal data via analytics, ads, other embedded contents are termed as non-necessary cookies. It is mandatory to procure user consent prior to running these cookies on your website.
SAVE & ACCEPT