{"id":87,"date":"2022-09-27T20:12:32","date_gmt":"2022-09-27T20:12:32","guid":{"rendered":"https:\/\/sites.psu.edu\/jaredmcuevas\/?p=87"},"modified":"2022-09-27T20:12:32","modified_gmt":"2022-09-27T20:12:32","slug":"topic-3-1-data-management-the-data-warehouse-data-lake-data-hub-and-master-data-management","status":"publish","type":"post","link":"https:\/\/jaredmcuevas.com\/?p=87","title":{"rendered":"Topic 3.1 \u2013 Master Data Management"},"content":{"rendered":"<p><strong>Introduction<\/strong>.\u00a0 In this third blog installment (#3.1 and #3.2) at Penn State University\u2019s College of Information Sciences and Technology\u2019s graduate course, EA874 \u2013 Enterprise Information Technology Architecture, we focus on data architecture. We focus on defining and contrasting data management concepts:\u00a0 master data management (MDM), data warehouses, data lakes, and, briefly, data hubs.<\/p>\n<p><strong>Master Data Management (MDM) Definition<\/strong>.\u00a0 To open this discussion, we examine and define a critical component of modern data management &#8211; MDM.\u00a0 To quote Emad Yowakim&#8217;s LinkedIn OP-ED (<em>cited below<\/em>):<\/p>\n<p>&#8220;Master Data Management (MDM) refers to the process of creating and managing data that an organization must have as a single master copy, called the master data&#8230;[it] is important because it offers the enterprise a single version of the truth.&#8221;<\/p>\n<p><strong>Benefits of MDM<\/strong>.\u00a0 By focusing on and identifying a single version of data entities, MDM attempts to eliminate redundant data. This process creates efficiency and removes complications resulting from multiple versions of a single data entity.\u00a0 Redundant data entries can lead to inefficiencies because each COULD contain conflicting information, or have been updated by multiple sources at multiple times. Data-driven enterprises who don&#8217;t implement some form of MDM leave the potential for inconsistency and confusion. The use of consistent systems and processes throughout an enterprise is key to MDM, achievable through a solid and well-planned IT governance program.<\/p>\n<p><strong>MDM Applications &#8211; <em>Opinion<\/em><\/strong>.\u00a0 In Emad&#8217;s OP-ED, &#8220;MDM is typically more important in larger organizations. In fact, the bigger the organization, the more important the discipline of MDM is, because a bigger organization means that there are more disparate systems within the company, and the difficulty on providing a single source of truth, as well as the benefit of having master data, grows with each additional data source.&#8221; However, I would provide a counter-argument; while I acknowledge that larger organizations struggle with keeping single sources of data due to their higher data volume and larger number of data sources, I would offer that single source data can also be critical to smaller organizations. Redundant, incorrect data in smaller enterprises can be equally damaging to internal processes such as manufacturing and production, or problematic for external processes such as customer service or marketing. Can you imagine, as a customer, receiving conflicting answers (sets of data) from multiple sources in the same company, and how confusing and frustrating this can be? Or identical manufacturing pieces receiving two different sets of manufacturing data from their controlling machines? The principles of MDM are critical at every level of those organizations which rely on precision to operate effectively.<\/p>\n<p><strong>Challenges in MDM<\/strong>.\u00a0 Mr. Yowakim&#8217;s LinkedIn article addresses one of the primary challenges related to MDM in the business environment &#8211; mergers and acquisitions. He addresses a common challenge related to the requirement for enterprise architecture planning &#8211; &#8220;how to merge the two sets of data will be challenging.&#8221; In addition to leveraging MDM concepts, a solid enterprise architecture governance program, and general EA tenants such as consistency, enterprise-wide planning, and central repositories, Emad also suggests appointing a dedicated steward for MDM, which &#8220;can also be a group&#8230; such as a data governance committee or a data governance council.&#8221;<\/p>\n<p><strong>Personal Observations.<\/strong>\u00a0 In my experience as a user and sometimes contributor to large information systems in the U.S. government, a common complain encountered is the lack of a &#8220;system of record&#8221; for all functions of the organization.\u00a0 In recent years, my organization has taken a number of steps to address this and has instituted enterprise-wide services such as a single login throughout the organization.\u00a0 However, information management and establishing a &#8220;single truth&#8221; similar to MDM which eliminates redundancy among entities remains a challenge.\u00a0 Recently, our group discussed some advanced alternatives which would link and automatically consolidate redundant entities based on metadata, but these advanced tools have yet to be applied throughout the enterprise uniformly.<\/p>\n<p><strong>Sources<\/strong>.<br \/>\n(1) https:\/\/www.linkedin.com\/pulse\/master-data-management-vs-warehousing-emad-yowakim<br \/>\n(2) Gartner, &#8220;Data Hubs, Data Lakes and Data Warehouses: How They Are Different and Why They Are Better Together&#8221;, Refreshed 2 June 2021, Published 13 February 2020. https:\/\/www.gartner.com\/document\/3980938?ref=d-linkShare<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Introduction.\u00a0 In this third blog installment (#3.1 and #3.2) at Penn State University\u2019s College of Information Sciences and Technology\u2019s graduate course, EA874 \u2013 Enterprise Information Technology Architecture, we focus on data architecture. We focus on defining and contrasting data management concepts:\u00a0 master data management (MDM), data warehouses, data lakes, and, briefly, data hubs. Master Data [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-87","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/jaredmcuevas.com\/index.php?rest_route=\/wp\/v2\/posts\/87","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/jaredmcuevas.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/jaredmcuevas.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/jaredmcuevas.com\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/jaredmcuevas.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=87"}],"version-history":[{"count":0,"href":"https:\/\/jaredmcuevas.com\/index.php?rest_route=\/wp\/v2\/posts\/87\/revisions"}],"wp:attachment":[{"href":"https:\/\/jaredmcuevas.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=87"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/jaredmcuevas.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=87"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/jaredmcuevas.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=87"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}