LinkedIn


Mike will be speaking about RDM and Master Data Management at Collaborate 2011.

Overview

This white paper focuses on the principals of Master Data Management (MDM) within the context of an overall Data Governance Strategy. It qualifies the value proposition MDM introduces while empowering organizations with the agility to be reactive in today’s economy. It dives into the principals of MDM using the framework of Oracle’s Data Relationship Manager (DRM) tool as a backdrop. In so doing, terms such as Versions, Hierarchies, Nodes, Properties, Validations, Verifications, and Exports are discussed.

Master Data Management Background

A key component to an organization’s Data Governance and Stewardship Strategy is Master Data Management (MDM).  Above all other information management disciplines, MDM promotes one of the highest value-add benefits.  Consequently, is it also one of the most resource intensive to implement.  MDM spans enterprise jurisdictions, political fiefdoms, organizational hierarchies, technical architectures, business processes, and applications.  No wonder the institution of an MDM strategy is as much about people and process as it is about technology.  And, despite their best efforts, IT often struggles with realizing the promise of MDM while at the same time not crippling the business with bureaucracy and inflexibility.  Assisting with this dilemma are technologies such as Oracle’s Data Relationship Manager (DRM). And, although data governance is not addressed by technology alone, DRM offers a foundation for establishing the disciplines, processes, and governance required to administer an Enterprise MDM Strategy.

What Is Master Data Management

Master Data Management (MDM) encompasses the people, processes, standards, and technology for ensuring the consistency and use of data throughout the enterprise and across business processes.  Said another way - “The ability to analyze data across business functions and business processes is dependent upon linking data from disparate systems, applications and external sources. This ability to effectively link data from disparate sources is dependent upon master data.” – Jonathan Wu, DM Review

When we breakdown the information administered by MDM principals we find the following metadata components:

  • Standard Naming Conventions
  • Data Definitions
  • Dimensions and/or Hierarchical Relationships
  • Properties
  • Mappings
  • Verifications/Validations
  • Versioning

Standard Naming Conventions. Naming standard for a master data element (node).  In most cases long and short names are represented.

Data Definitions. Technical and Business description of the node being represented.

Dimension and/or Hierarchical Relationships. Represents the relationship one node has with others within its context (Dimension).  This is usually denoted in a hierarchical fashion.

Property. Attributes that describe the nature of the node.  Properties are organized into categories and can be leveraged during data extraction, mapping, verifications, and validations.

Mappings. Describes the relationship a node has in its context (Dimension/Hierarchy) with node(s) in alternative Dimensions/Hierarchies.

Verifications/Validation. Pre-defined rules that are executed upon altering the state of a node or performed in batch.

Versioning. Records the history and lineage of changes made to the master data records.

The Current Challenge

Part of the problem is the lack of understanding as to the deliverables from an MDM Strategy, and more importantly articulating the positive affect they have on the business’ capability to execute.  In order to address this, one must first appreciate that most medium and all large organizations support multiple systems/application in order to conduct business.  A bank for example may have separate credit card systems from their deposit systems, and yet others for processing mortgages, and still more for servicing Home Equity Lines.  Even with the advent of the Enterprise Resource Planning (ERP) system, organizations must contend with capturing, storing, moving, integrating, and reporting information from multiple disparate systems. 

This challenge is compounded by the fact that many of these systems are owned by siloed business units, with different leaders, often with individualistic and in the worst case conflicting charges.  Additionally, the owners of these solutions are not rewarded for maintaining consistency across the organization and business processes.  In fact, they are often encouraged to “make an exception” and bypass policies in the interest of “getting the business what they need ASAP; no matter the cost.  We’ll fix it later…” – sound familiar?

This is not to suggest that exception handling is not are reality of doing business.  In truth, every MDM Strategy must have a process for dealing with exceptions to the governance process.  Unfortunately without a well vetted MDM process, exceptions become the norm and their employment becomes “just another day at the office”.  Here in lies the problem.  An exception should be a BIG DEAL.  One in which the proverbial alarms sound up and down the corporate corridors.  An extension of this occurrence is an “override” of the MDM processes administered by Oracle’s DRM solution.

Oracle’s Data Relationship Manager (DRM) as a Solution

Although master data challenges are not addressed by simply throwing technology at the problem, organizations are turning to software solutions to serve as the foundation for the establishment of the standards, disciplines, and governances required to maintain master data constructs. Oracle’s Data Relationship Manager (DRM) helps organizations solve the costly, labor-intensive problem of administering master data that is otherwise managed through phone calls, spreadsheets, and e-mails.  Ultimately, Oracle DRM empowers the Information Worker by enabling them to become a vital/integral part of the master data management process.

Key benefits introduced by Oracle’s DRM solution consist of:

  • Empowers business users to manage master data
  • Improves change management processes and communications
  • Ensures reporting integrity
  • Controls and secures master data across the enterprise
  • Establishes audit controls and accountability
  • Centralizes, automates, and simplifies complex business rule management
  • Delivers master data publishing, viewing, and downloading capabilities
  • Accelerates financial reporting and eliminate redundancies
  • Supports Versioning (pre-merger, reorganization, etc.)
  • Promotes validation & verification rules
  • Enforces balanced hierarchy (if required)
  • Allows business users to become direct contributors to the iterative process of managing complex, rapidly changing metadata

In short, DRM’s value lies in systematically defining a set of governances that allow for the dissemination of master data management outside of IT and push it into the hands of business owners.  Most of us in IT would suggest this proposition is “scary” at best and “suicide” at worst.  Why – Because the business does not always have a clear understanding of the implications associated with master data changes and the systems for which this information is subscribed.  But, with DRM the safeguards are systematically represented.  For example, a Knowledge Worker is not allowed to add a new account to the Oracle R12 GL Chart of Accounts (COA) master record unless they provide representation for every Segment Field and map the new account to a corresponding node in the Hyperion Planning COA master record.

Master Data Management – The Big Picture

Serving as the foundation of an organization’s Master Data Management Strategy, Oracle’s DRM solution has multiple components.  Each concept has relevance with regards to enforcing governance and oversight in the management of master data.  Some key terminologies are…

  • Dimensions, Hierarchies and Nodes: The master data “skeleton”.  Provides the context for which each master record (node) is defined and correlated with other nodes.
  • Versions: Identifies current versus prior manifestations of master records.  Tracks history and lineage of changes.
  • Properties and Property Categories: Qualifies and organizes metadata attributes that describe each node.
  • Validations and Verifications: Rules that enforce data standards, process alignment, and system integration.
  • Node Types: Special property that qualifies the disposition of the node from a business use perspective.  i.e. Account Node, Entity Node, Product Node, etc.
  • Security and Node Access Groups: Security provisioning that defines Group based access privileges to the master data 
  • Hierarchy Compares: Process by which one master data structure (Dimension) is compared with another whereby difference are reported.
  • Imports: Facility for importing master record layouts in mass.
  • Exports: Facility for publishing master data to subscribing systems.

As mentioned earlier, Oracle’s DRM solution is an excellent foundation for administering the disciplines necessary to support a Master Data Strategy, but this is only a component of the equation.  People, process, and standards are equally important.

Some guiding principles to consider when deploying Master Data Management principals consist of:

Process before technology. Business processes rather than technology are the driving factor to MDM solutions.

Business ownership. Solution should be owned and driven by the business with IT playing a supporting role in ensuring that best practice and standards are adhered to.

Incremental deployment. Do not underestimate the time and expertise needed to develop a “foundational” MDM Strategy. A transformational approach is warranted.  With the Goal State in mind, limit the scope of the initial deployment (i.e. Product Master Data).

Consider process impacts. Recognize that the application of MDM controls will bring up-front process performance penalties the dividends of which are realized strategically downstream.

Institute data governance policies and processes. Allow time and funding for people and process change management.  The “politics” of MDM (Data Governance) will pose a challenge.  For example - Finance may see the value of centralizing product master data management, but other factions may be reluctant to give up data ownership and control.

System impact analysis. Re-engineering Master Data structures will have an impact on critical processes and subscribing systems. Plan on a transition strategy that allows for scheduled and on-demand metadata synchronization.

Oracle DRM – Master Data Management Catalyst

When approaching Master Data Management initiatives using Oracle’s DRM technology, the solution is represented by four (4) Layers:

  1. Data Management Layer – Naming Standards, Hierarchies, Dimensions, Properties, Version Control, Mappings
  2. Business Logic Layer – Business Rules/Validations, Node Types
  3. System Interface Layer – Data Imports, Data Exports, Compares
  4. Governance – Security, Administration, Processes

As is so often the case with Master Data initiatives, IT and the Business often do not know what they want from the solution until it is shown to them (“playback”).  The DRM utility is intended not only as the foundation for the Master Data Strategy, but as a tool for supporting the iterative nature of requirements gathering, design, and development.  Within the context of each Layer, DRM promotes an iterative, graphical interface that facilitates the capture, integration, and playback of the business and technical requirements.

Is it conceivable to institute a Master Data Management Strategy in the absence of technology such as Oracle DRM?  Yes.  But, recognize that adoption is one of the largest hurdles to overcome with MDM initiative.  Leveraging a tool that intuitively facilitates the capture and presentation of standards and rules for which master data will be governed is an enormous benefit.  Also, realize that if the MDM Strategy includes the dissemination of master data control outside of IT, this vision will never be fully realized in the absence of the systemic controls and validations that can only be enforced with technology.   

In the end, there needs to be an appreciation that the institution of MDM principals is a journey for which there is never truly a destination.  As technical, business, economic, policy, and compliance mandates change, so will the need for your MDM solution to adapt.  Using technology such as Oracle’s DRM solution reduces project risk, accelerate MDM initiatives, and extend master data management capabilities into the enterprise

“Pre-Installation Requirements”

In installment #1 of this guide, we reviewed the architecture considerations and defined a simplistic architecture to use as a reference moving forward.  I recommend you read the previous post before you pick up this one.  I also recommend reading

Oracle Hyperion Enterprise Performance Management System Installation Start Here Release 11.1.1.2.pdf (128 pages)” from the Oracle Documentation Library.

To reiterate our general approach, the Hyperion architecture establishment and installation activities in our organization cover the following five areas.

  1. Defining an Architecture – Work with the client to define the hardware, software, and the distribution of Hyperion components
  2. Provide Pre-Installation Requirements – Provide the client with a detailed list of activities prior to the installation
  3. Installation – Running the installation and configuration utilities
  4. Validation – Perform all functional activities necessary to validate the environment readiness
  5. Documentation – Provide the client with all the details of the environment as it is configured.

In this post, I will go through step 2 that the Hyperion architect, should deliver.  Steps 3-5 will be available in the coming weeks.  For the sake of simplicity I will be using the example of a common installation, primarily Hyperion Planning, Hyperion Financial Management (HFM), and the core BI applications.

As part of any installation, some items need to take place before the Fusion Installer is started.  I like to create a checklist of things that need to be done.  Often times these things are out of my control and I will rely on Database Administrator, Network Administrators, and other System Administrators.  This checklist contains the following elements.

  • Web Application Server Specifications
  • Relational Repository Information
  • General System Administration
  • Network Information
  • Additional Components
  • DCOM Configuration
  • IIS and .NET Configuration

I’ll start with the Web Application Server Specification.  Once the web application server platform is chosen from the table below, the installation and configuration often falls on System Administrators.  Items such as clustering, system account management, and JVM setting are managed outside of the Hyperion installation.  Other times, I’ll get admin access and manage it myself.  The first item to do is to validate the application server is certified.  This is directly from Oracle Enterprise Performance Management System - Supported Platforms Matrices “Oracle Enterprise Performance Management System, Fusion Edition Release 11.1.1.2)” in the Oracle document library.  I recommend reviewing this document.  It can change from release to release.

 

Server

Notes

Oracle Application Server 10g (10.1.3.3.x) a

If Oracle Application Server is used as the Web application server, Oracle HTTP Server is also required.  Profitability and Cost Management supports only Oracle Application Server 10.1.3.x.

Oracle WebLogic Server 9.2 (MP1 minimum) / 9.2.xb

Shared Services requires WebLogic Server patch CR283953” for all platforms. You can obtain this patch at the BEA web site.

IBM WebSphere 6.1.0.17 / 6.1.x C

 


Embedded Java container d

 


a Supports these editions: Java, Standard One, Standard & Enterprise. Includes support for Oracle Application Server Single Sign-On.

b WebLogic Express is supported for each supported version of WebLogic Server; non-base versions are supported only via manual deployment.

c WebSphere Express, ND, and XD Editions are supported for each supported version of WebSphere; ND and XD are supported only via manual deployment.

d For this release, Apache Tomcat 5.5.17 is the embedded Java container that is installed automatically on all platforms. Apache Tomcat is supported only in this capacity. If future EPM System releases embed different Java application servers, Apache Tomcat will no longer be supported. For deployments that require high availability or failover, Oracle recommends using a commercially supported Web application server that supports high availability and failover.

I request the URL and authentication information since this will be needed during the deployment.  If I am doing a manual deployment, I will request contact information from the web application server administrator and work in collaboration on the deployment.

The next item on my checklist is to get the Relational Repositories Information set up.  This is mostly straightforward.  In general, I like to create a tablespace/database for each component ((Hyperion Foundation, Essbase Admin Services / Business Rules, EPMA, Planning, Financial Management, etc).  A distinct tablespace/database for each component makes it easier to manage in my opinion.  Although it may not be strictly necessary, the documentation does not seem to be clear on the matter.  I say ‘better safe than sorry’.  For the installation and deployment, I’ll need credentials for each table.  Based upon some Q&A, I’ll make initial size recommendations. 

The target installation servers have a General System Administration checklist containing the information that I’ll need to execute the installation.  This is made of the following items.

  • Operating Systems version/build
  • Account on each server to run the Hyperion services and account requirements
  • External Authentication information (MSAD, LDAP, or OID if applicable)
  • Drive/Volume information identified for installation of the Hyperion software.
  • DCOM and .NET account information if HFM or FDM are to be installed

Next, I identify the Network Information necessities for appropriate communication between servers.  This includes IP addresses, DNS information, validation of name resolution, trace between servers, subnet configuration, etc.  This is vital so the components can communicate via Fully Qualified Domain Name, Short Name, and IP address.  Some components use different variations of name resolution probably because the components were developed separately and have not been fully standardized.

In addition to the Hyperion Software, Web Application Servers, and Relational Repositories there are a few Additional Components that need to be installed.  A PDF writer is needed for Reports Server to render .pdf reports in Workspace.  This can be GhostScript or Acrobat distiller.  I suggest referring to the “Start Here” documentation to see what is currently supported but we often go with GhostScript due to its cost.

For the Windows Administration, we provide the DCOM Configuration information needed to support FDM, EPMA, and FDM.  This includes the DCOM account information, permissions, and authentication information.  Although this is spelled out in detail in the “Start Here” manual, I like to provide step-by-step information with screen shots since DCOM is often confusing…well it is to me at least.

The last thing we review is the IIS and .NET Configuration.  IIS is often not installed as part of a standard OS build.  We make sure this requirement is specified, ensuring .NET is installed, and validate it is the appropriate version.

As with any installation, I recommend the Installation Architect read, and re-read, the Hyperion Manuals on there own rather than relying on this information or intuition.  It can always change and your installation may have some caveats that I have not covered.  For our purposes, with all the above activities completed and validated, we should be ready to start laying out the binaries and start the Hyperion Installation.  We will review the Fusion Installer and Hyperion Configuration Utility in our next installment. 

 

Hyperion Enterprise Performance Management System Installation and Configuration Guide Release 11.1.1.2 (252 Pages) and Oracle Hyperion Enterprise Performance Management System Manual Deployment Guide Release 11.1.1.2 (160 Pages) are a wealth of knowledge.  I know because I have read a great deal of it and keep them quite close during each engagement.  Most of what you need to know is available in the documentation and a good deal of additional knowledge can be found on the support forums.  I do not want to rehash the specifics of the installation and configuration.  There are just too many variables to cover everything without reposting most of the already prepared material.  What I hope to provide here is just a primer on architecture considerations and general installation activities with this release.

The Hyperion architecture establishment and installation activities in our organization cover the following five areas.

  1. Defining an Architecture – Work with the client to define the hardware, software, and the distribution of Hyperion components
  2. Provide Pre-Installation Requirements – Provide the client with a detailed list of activities prior to the installation
  3. Installation – Running the installation and configuration utilities
  4. Validation – Perform all functional activities necessary to validate the environment readiness
  5. Documentation – Provide the client with all the details of the environment as it is configured.

In this post, I will go through step 1 that you, as the Hyperion architect, should deliver.  Steps 2-5 will be available in the coming weeks.  For the sake of simplicity I will be using the example of a common installation, primarily Hyperion Planning, Hyperion Financial Management (HFM), and the core BI applications.

Defining the architecture is somewhat difficult, especially if you do not have a good understanding of your organizations’ usage patterns or platform preferences.  Below are the core questions that need to be answered to define the necessary architecture.

How many servers will be needed?

The number of servers depends on a lot of things.  Below is a sample of some of the questions I typically ask when recommending the server infrastructure.

  • How many users do you have?
  • Will you be leveraging an existing Relational Database Platform?
  • What products did you purchase?
  • What products do your users use most frequently?
  • Will you be using loadbalancing?

Let’s assume the answer is 100 users, the client has an existing Oracle database I can leverage, the client purchased HFM and Planning, and the client uses a lot of Financial Reports.

With these assumptions, you will want to make sure the Workspace and Reporting platforms are a decent size and the reporting application services reside on a separate server from the application servers.  In addition, since you are going to leverage an existing relational database, you can plan on having a stand-alone data tier for Planning and Essbase.  Just to further simplify the matter, let’s assume there is no load balancing and clustering.  For our example discussion, I will assume 3 servers will be included in the architecture.

What would be the system specifications and Operating System?

Once again, the answer to this question can vary widely based on the answers below.

  • What operating systems does your organization support?
  • How long do you intend to use this hardware?
  • Will you be using virtual machines?
  • Will you be using shared or local disks?
  • How much data do you have now and what does your growth look like?

Keep in mind that HFM requires a Windows Server so I will need at least one of those.  It may be simpler from an administrative perspective to have a homogeneous OS infrastructure but it is not necessary.  If you have no preference, this is what I would typically go with.  If you have a propensity for some flavor of UNIX, then I would go that route where applicable.  Many organizations keep hardware in place for up to 3 years.  With that in mind, I think the best approach is to buy the most robust and powerful hardware you can afford.  The cost of migrating in 24 months will often be greater than the hardware spend.  VMware is quickly growing in popularity and I see deployments in virtual environments quite often with success.  Although Essbase does not require a physical server, I recommend that it not reside on virtual machines due to higher disk I/O.  It is also the application that is expecting the most growth.

What Web Application Server should we use?  Should it be Oracle Application Server 10g, WebSphere, WebLogic, or the Embedded Java container (Apache Tomcat 5.5.17)?

  • What does your organization support?
  • Do you have a license for anything?
  • Will you need clustering?

First, some comments in favor of the Apache Tomcat server.  It is free and Hyperion does do some development on that platform.  So far, I have found it to be stable in most situations.  If your organization has a Web Application group, it may not be standardized on a version that is required by Hyperion.  In these instances, it may make sense to deploy Tomcat as opposed to a version of WebLogic or WebSphere that your organization does not support.  Additionally, if the organization is upgrading its application server farm in the future, Hyperion may be an outlier.

Now for the downside, there isn’t really any help desk you can call for Tomcat outside of Hyperion.  Unless your organization has a few other Tomcat servers, there is likely to be a learning curve in the administration and support.  Then, there is always the question of scalability.  Clustering with Tomcat is quite a bit more complex.  Our common recommendation on application servers is that you should use what you know.  If you do not know anything about application server administration and do not have a huge user base, Tomcat is probably your best choice due to its ease of installation.

One rule of thumb is that you can expect a single Tomcat instance to handle 25-30 concurrent threads comfortably.  The practical maximum is 50 per instance.  If you are using other sorts of Java servers, it is a good idea to review with those administrators their experience with free servers.  In a development environment, one could benchmark how Hyperion EAR and WAR files behave in comparison to those of other Java apps.

With all these assumptions, I deliver a Viso diagram detailing which components go on which servers with general recommendations for server sizing and operating systems. 

In our 2nd installment, I will review the pre-installation activates that need to occur before the installation begins.

Overview

As companies grow and evolve, it becomes essential to manage master data across information silos that result from mergers and acquisitions, departmental initiatives, or legacy system proliferation. Data consistency, integrity, quality and accuracy can suffer creating a process that lacks integrity.  Oracle Hyperion Data Relationship Management provides enterprises with a solution to build consistency within master data assets despite endless changes within the underlying transactional and analytical systems.

Product History

Data Relationship Management was developed in the mid-1990s by a small Texas company called Razza Solutions.  The product was then known as Razza Dimension Server and was used by about 30 Fortune 500 companies.  With the acquisition of Razza by Hyperion in 2005, the product was renamed to Hyperion Master Data Management (MDM) and integrated into Hyperion’s BI foundation.

Now part of Oracle’s Master Data Management Suite, the name has been changed to Hyperion Data Relationship Management (DRM).  While DRM is Oracle’s core product for Financial and Analytical master data management, Oracle also has other operational MDM applications available including Oracle Customer Hub and Product Hub.

Strengths / Benefits

  • Change management and control - Unify cross-functional perspectives to a master record while giving users the freedom to make changes and construct alternate departmental views of the data that are consistent and accurate.
  • Best-of-breed hierarchy management - Streamline master data changes using drag-and-drop hierarchy maintenance. Enable side-by-side comparison and one-click navigation across both, balanced and ragged hierarchies, with built-in referential integrity.
  • Audit and compliance - Provide a framework for query, comparison, and full history of all master data changes, including documented versioning and roll-back at any point in time or recorded modification to master data elements.
  • Workflow integration - Hot-pluggable integration with workflow tools using web services.

Resources