• Skip to primary navigation
  • Skip to main content
  • Skip to primary sidebar
  • Skip to footer
TSB Alfresco Cobrand White tagline

Technology Services Group

  • Home
  • Products
    • Alfresco Enterprise Viewer
    • OpenContent Search
    • OpenContent Case
    • OpenContent Forms
    • OpenMigrate
    • OpenContent Web Services
    • OpenCapture
    • OpenOverlay
  • Solutions
    • Alfresco Content Accelerator for Claims Management
      • Claims Demo Series
    • Alfresco Content Accelerator for Policy & Procedure Management
      • Compliance Demo Series
    • OpenContent Accounts Payable
    • OpenContent Contract Management
    • OpenContent Batch Records
    • OpenContent Government
    • OpenContent Corporate Forms
    • OpenContent Construction Management
    • OpenContent Digital Archive
    • OpenContent Human Resources
    • OpenContent Patient Records
  • Platforms
    • Alfresco Consulting
      • Alfresco Case Study – Canadian Museum of Human Rights
      • Alfresco Case Study – New York Philharmonic
      • Alfresco Case Study – New York Property Insurance Underwriting Association
      • Alfresco Case Study – American Society for Clinical Pathology
      • Alfresco Case Study – American Association of Insurance Services
      • Alfresco Case Study – United Cerebral Palsy
    • HBase
    • DynamoDB
    • OpenText & Documentum Consulting
      • Upgrades – A Well Documented Approach
      • Life Science Solutions
        • Life Sciences Project Sampling
    • Veeva Consulting
    • Ephesoft
    • Workshare
  • Case Studies
    • White Papers
    • 11 Billion Document Migration
    • Learning Zone
    • Digital Asset Collection – Canadian Museum of Human Rights
    • Digital Archive and Retrieval – ASCP
    • Digital Archives – New York Philharmonic
    • Insurance Claim Processing – New York Property Insurance
    • Policy Forms Management with Machine Learning – AAIS
    • Liferay and Alfresco Portal – United Cerebral Palsy of Greater Chicago
  • About
    • Contact Us
  • Blog

OpenContent Solr Services – New TSG Product Offering

You are here: Home / Alfresco / OpenContent Solr Services – New TSG Product Offering

March 13, 2018

As part of our campaign to make Documentum better, TSG is announcing the formalization of our OpenContent Solr Services as well as some new features that we are adding to our work with Solr.  We have been working with Lucene and later Solr for over 12 years but have recently discovered several client scenarios and opportunities that are driving our expansion of our Solr capabilities for Documentum, Alfresco, Hadoop and other repositories.  This post will detail the different scenarios as well as our initial OpenContent Solr Services roadmap.

OpenContent Solr Services – Scenarios

Our initial work with Lucene and later Solr came from our work with Alfresco where Solr is bundled into the Alfresco repository.  For Documentum clients, we have been implementing a publishing approach to push content out of Documentum with full-text and meta-data pushed to a Solr repository for consumer access for over 10 years.  As stated here many times, the publishing approach for consumers provides many benefits regarding fault tolerance, performance and user experience.  Building on our experience, we are currently building our solutions to address additional scenarios including:

  • One to Many Indices – As clients look for maximum performance and security for data separation, one giant Solr or full-text index for all content in the repository can cause performance and security concerns. OpenContent Solr Services will provide the ability to create and manage separate, small or large potentially sharded indices tailored to the client’s specific scenario where access is controlled to the indices itself. Document access can either be to a published copy, direct access to object store or via the ECM itself depending on security needs.
  • Documentum Clients – Documentum clients have struggled with xPlore, an old xDB/Lucene mis-mash of products that hasn’t been updated to Solr. OpenContent Solr services will provide an alternative for xPlore to allow Documentum modern tools for efficient searching provided clients leverage the OpenContent Management Suite Interface. Documentum clients can either pick one direct replacement for xPlore as well as multiple indices for performance, security, and other scenarios.
  • Multi-Tenant Clients – Whether Documentum, Alfresco, Hadoop or other repositories, clients have always struggled with allowing access to a large repository but implementing the security to allow clients to perform efficient searches against only their content. OpenContent Solr Services will allow for indices to be created given client requirements that offer maximum performance while protecting documents from other parties.
  • Consumer Access – OpenContent Web Services will include the ability to publish the meta-data and full-text to a Solr index and either the content itself (typically PDF) or a link to an object store for consumers. For an energy client, TSG is currently publishing 12 plant subsets of documents from a large Documentum instance to 12 individual plants for performance and fault tolerance.  While TSG has been providing this service for years, new features added to OpenAdmin will allow customers to better monitor and manage the separate indices.
  • Multiple Repositories – OpenContent Solr Services will provide the ability to publish content to a single index from more than one repository with either access to published content or back to the source repository. For clients struggling with multiple authoring applications, OpenContent Solr Services will provide a consistent secure search interface with different indices created for different client scenarios.
  • Data Scientists – OpenContent Solr Services would provide access to the Solr Admin tool or any other tool that supports Solr. Data Scientists can set up any indices for their analysis “out of the box”.
  • Administration – Leveraging OpenAdmin, administrators will have the ability to create and maintain the many indices as well as create new indices when necessary.

OpenContent Solr Services – Offering components

OpenContent Solr Services are made of up several of our existing products.  The architecture could look something similar to the following:

Components of the solution and their capabilities include:

  • OpenMigrate – would be used to create and maintain the Solr index. OpenMigrate already provides an enterprise level high performance multi-threaded migration platform to move content between repositories.  OpenMigrate can be configured to build the initial indices as well as monitor the repository to update the indices on an interval basis when new content is added.
  • OpenContent Web Services – OpenContent Web Services is already used to query and update Documentum, Alfresco, Hadoop and Solr for add/delete capabilities. Additional capabilities would update the 1 to many Solr indices for add/update/delete in Documentum, Alfresco or Hadoop repository to maintain an up to date Solr indices.
  • OpenContent Management Suite – Would provide configurable access to the Solr indices as well as ECM repositories or cached content.
  • OpenAdmin – would provide the ability to identify both 1 to many indexes as well as identify properties of how the index would be accessed (security). Configurations could include whether the content is cached (published), accessed directly to the object store (add link) or accessed via the ECM repository.

Summary

Maintaining one “do all” index for a repository can result in security, performance and maintenance concerns.  OpenContent Solr Services adds administration, creation and maintenance of 1 to many Solr indices for addressing a variety of different user and performance scenarios.  Components of the solution have been in production for years at different clients.  Look for more posts here as this solution matures over the coming year.

Filed Under: Alfresco, Documentum, Hadoop, OpenContent Management Suite, OpenMigrate, OpenSearch, Search

Reader Interactions

Trackbacks

  1. Documentum – How to make it better – 2018 Blog Series says:
    March 13, 2018 at 11:13 am

    […] Improving Search with TSG’s new OpenContent Solr Services product […]

    Reply
  2. Artificial Intelligence, Machine Learning and Robotic Process Automation – What are some of the opportunities for ECM? says:
    June 13, 2018 at 2:20 pm

    […] TSG has already been implementing many of the underlying requirements for Analytics with our Solr as well as Elastic Search add-ons for ECM.  Scenarios in the roadmap […]

    Reply
  3. DynamoDB and Hadoop/HBase as a Document Store — Technology Services Group says:
    February 26, 2020 at 4:54 pm

    […] If the ability to perform deep analytics on the attributes and full-text content of documents is a requirement, we typically recommend separate indexes in Solr/Elasticsearch targeted for the specific use cases that the data scientists are requesting. TSG no longer recommends one massive index for all of the attributes and full-text content for all documents in the entire repository as it can be problematic, especially if this is the same index that is being used by end users. See our thoughts on creating separate indexes with Solr this post. […]

    Reply
  4. TSG joins the Alfresco Family — Technology Services Group says:
    March 22, 2020 at 6:01 am

    […] TSG Elastic Stack and Solr capabilities for customers looking for […]

    Reply
  5. Content Service Platform Scaling - How Good Key Design and NoSQL can avoid the need for Elastic/Solr or other indexes — Technology Services Group says:
    April 16, 2020 at 6:16 am

    […] we have posted from our Solr and Elasticsearch practices, a much more scalable approach to search is to create separate indexes […]

    Reply

Leave a Reply Cancel reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Primary Sidebar

Search

Related Posts

  • Documentum – Top 12 Tips
  • Redaction for AWS, Alfresco, Documentum and Hadoop – Bulk Redaction upon Ingestion or Migration
  • Ephesoft Accounts Payable Solution for Alfresco, Documentum and Hadoop
  • Hadoop – Why Hadoop as a Content Store when Caching Content for ECM Consumers
  • Next Generation ECMS – Architecture Thoughts
  • Third Annual TSG Client Briefing – June 3rd – 2010
  • Content Service Platform Scaling – How Good Key Design and NoSQL can avoid the need for Elastic/Solr or other indexes
  • ECM 2.0 – Can you build it yourself?
  • Migrating to Alfresco – Reducing Risk, Stress and Cost with a Rolling Migration
  • Adobe Acrobat Alternative – Doing more with just a web browser

Recent Posts

  • Alfresco Content Accelerator and Alfresco Enterprise Viewer – Improving User Collaboration Efficiency
  • Alfresco Content Accelerator – Document Notification Distribution Lists
  • Alfresco Webinar – Productivity Anywhere: How modern claim and policy document processing can help the new work-from-home normal succeed
  • Alfresco – Viewing Annotations on Versions
  • Alfresco Content Accelerator – Collaboration Enhancements
stacks-of-paper

11 BILLION DOCUMENT
BENCHMARK
OVERVIEW

Learn how TSG was able to leverage DynamoDB, S3, ElasticSearch & AWS to successfully migrate 11 Billion documents.

Download White Paper

Footer

Search

Contact

22 West Washington St
5th Floor
Chicago, IL 60602

inquiry@tsgrp.com

312.372.7777

Copyright © 2023 · Technology Services Group, Inc. · Log in

This website uses cookies to improve your experience. Please accept this site's cookies, but you can opt-out if you wish. Privacy Policy ACCEPT | Cookie settings
Privacy & Cookies Policy

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these cookies, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may have an effect on your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.
Non-necessary
Any cookies that may not be particularly necessary for the website to function and is used specifically to collect user personal data via analytics, ads, other embedded contents are termed as non-necessary cookies. It is mandatory to procure user consent prior to running these cookies on your website.
SAVE & ACCEPT