insideAI News
Illuminating AI's Frontiers: Your Go-To News Destination.
Subscribe
  • Enterprise AI
  • GenAI/LLMs
  • Industries
    • AI for Agriculture
    • AI for Financial
    • AI for Healthcare, Medical and Life Sciences
    • AI for Manufacturing
    • AI for Security
    • AI for Transportation
  • White Papers
  • Resources
    • Book Reviews
    • Events Calendar
    • Podcasts
    • Reports
    • Video
  • Data Science
  • Deep Learning
  • Machine Learning
  • Subscribe
  • Search

What Data Governance Means to Cloud Computing (And Vice Versa)

July 16, 2021 by Editorial Team
Print Friendly, PDF & Email
  • share 
  • share 
  • share  
  • share  
  • email 

The symbiosis between data governance and cloud computing is apparent to any organization with significant cloud investments. The cloud has a plethora of resources that enhance data governance, which is critical for regulatory compliance, risk mitigation, and long term profitability of data assets.

Simultaneously, however, data governance hallmarks of metadata management, data cataloging, and data stewardship are requisite for maximizing the cloud’s utility by illustrating where data are and employing them for singular use cases like customer 360s, real-time product or service recommendations, and advanced analytics.

Although it’s difficult to say which of these capabilities is more advantageous to the enterprise, they clearly complement each other.

For data governance, the cloud’s chief value proposition is that “from a standpoint of connectivity, or data integration, or data quality, or cataloging, everything’s included so you have an all-in-one solution that’s possible because we’re in a microservices world where you can deploy efficiently and have things kind of seamlessly interoperable,” Informatica MDM General Manager Manouj Tahiliani revealed.

The cloud is also a significant contributor to dispersing resources across locations which, if left unchecked, could easily result in silos and ungoverned use. According to Profisee CTO Eric Melcher, the cloud creates “the much more distributed enterprise where you’ve got lots of applications: our HR applications, CRM applications, ERP applications, our customer experience platform. We’ve got applications across the line. And by the way, the data’s not in that server in the corner, it’s now all over the place. So, I need governance to understand where everything is.”

Data governance is essential for keeping cloud deployments orderly, while the cloud enriches the very means of effecting governance by allowing organizations to position resources wherever they’re most advantageous for modern computing—particularly when supported by governance staples of metadata management, data cataloguing, data modeling, and data stewardship.

Metadata Intelligence

Metadata is utilitarian for governing data in cloud settings. It’s particularly beneficial for assembling data across numerous sources to use in a single domain such as customer, product, supply chain information, and more. Tahiliani referred to the notion of “metadata intelligence” as foundational to implementing data quality and data integration. Coupling metadata intelligence with Master Data Management exploits the sundry of cloud sources by enabling organizations to govern how they “get data from different sources, curate it, and share it across the enterprise,” Tahiliani observed.

Metadata plays a couple different roles in this basic functionality. It’s the key to connecting data to various downstream systems through MDM’s logical model. Firms can also “use the power of metadata to help them discover elements that need to be mastered, rapidly provide integration between the system using that metadata knowledge and share that data, as well as use metadata to use machine learning capabilities around matching and data quality rules,” Tahiliani commented. These capabilities are instrumental for combining sources for customer insight, for example, in a well governed fashion.

Data Cataloging

Metadata also helps the data cataloging process Tahiliani described, which identifies what data are where in cloud settings. In fact, there are several cloud tools that reinforce this aspect of data governance by empowering organizations so “the first thing they do is catalog what they have,” Melcher noted. “Now that we’ve cataloged it, let’s start classifying and understanding what it is and figuring out what’s sensitive and who knows about the data here and there.” Once data are classified, it’s relatively straightforward to tag them and ascribe ownership to them, which provides the framework for suitably governing data across any variety of cloud sources.

Credible catalogs can point at sources to provide this functionality or allow sources to input their metadata for cataloging purposes. The latter is particularly effectual with MDM because it lets organizations push “the logical definitions of Master Data…and register assets to what the end user would actually expect,” Melcher mentioned. The benefit of this approach is data already adheres to MDM conventions of data quality, definitions, and other aspects of its logical model, which aids in creating catalogs and classifications that enhance enterprise use cases—while ensuring those use cases are well governed.

Data Modeling

MDM is also useful for unifying the different data models that abound throughout cloud sources so organizations can combine them for analytics, for example. Data modeling is a pivotal aspect of data governance that can be particularly time consuming (delaying time to value) with traditional methods. MDM, however, lets organizations create models for products or any other domain partly by using data discovery mechanisms. “As you’re creating the models and what attributes you need to have, the discovery aspect allows you to understand across the source systems for those different entities what’s the metadata that’s being held,” Tahiliani explained. “That provides information about how often it’s used and updated to really determine if that should be an attribute within your master data model.”

The most expressive data models unify terminology and include specific vocabularies to describe customers or supply chain needs, for instance. The metadata intelligence Tahiliani alluded to assists this facet of data modeling while allowing organizations to redress disparities in how entities are represented “when you know which source systems the metadata you’re pulling in is from, and what the vocabulary is across those source systems,” Tahiliani added.

Data Stewardship

All of the above dimensions of data governance—metadata intelligence, data cataloging, and data modeling—are invaluable to data stewards attempting to ensure governance standards are met across the enterprise in heterogeneous cloud, edge, and on-premise applications. The enhanced data modeling traits pertaining to definitions and attributes that are found in MDM (including information from data cataloging tools) are critical to data stewards validating data governance protocols.

For example, if a steward needs at-a-glance information about the data model for a customer entity, he’ll see “this is a description of a customer associated with this glossary term called business partner or whatever the case may be, and Bob in Sales is an expert on what customer is,” Melcher indicated. With this approach MDM “participates in governance by pushing…metadata into a catalog, but then, as governance occurs, the output of governance is then available for end users to consume back in [MDM],” Melcher concluded.

Master Data Governance

Data governance is indispensable for making good on the cloud’s premise of real-time access to a distributed data landscape with elastic scalability in a pay-per-use pricing model. Additionally, the cloud contains many tools—such as cloud native MDM solutions, data cataloging instruments, and others—that streamline governance capabilities to decrease data’s risk while boosting its enterprise value over the long term.

MDM sits between each of these constructs as a viable means of balancing the usefulness of one with the other. “You shouldn’t do governance in MDM,” Melcher cautioned. “You should do governance and MDM. You should be governing across more than just your Master Data Management platform.” 

About the Author

Jelani Harper is an editorial consultant servicing the information technology market. He specializes in data-driven applications focused on semantic technologies, data governance and analytics.

Sign up for the free insideAI News newsletter.

Join us on Twitter: @InsideBigData1 – https://twitter.com/InsideBigData1

  • share 
  • share 
  • share  
  • share  
  • email 
Filed Under: Featured, Google News Feed, News / Analysis, Opinion, Uncategorized Tagged With: cloud computing, data governance, metadata, Weekly Newsletter Articles
«
»
»
«

Sponsored Guest Articles

  • Re-Engineering Ethernet for AI Fabric

    [SPONSORED GUEST ARTICLE]   For years, InfiniBand has been the go-to networking technology for high-performance computing (HPC) and AI workloads due to its low latency and lossless transport. But as AI clusters grow to thousands of GPUs and demand open, scalable infrastructure, the industry is shifting. Leading AI infrastructure providers are increasingly moving ….

White Papers

From Legacy to Leading Edge: How Mainframe Data Can Transform AI and Analytics

From Legacy to Leading Edge: How Mainframe Data Can Transform AI and Analytics explores how enterprise data leaders can unlock untapped value by integrating mainframe data into modern AI and analytics ecosystems. Despite powering critical operations, mainframe data remains largely siloed. This whitepaper outlines strategic insights, common roadblocks, and a practical roadmap to help data leaders […]

Download
More White Papers

Join Us On Social Media

Featured From
InsideHPC

RSS Featured RSS Feed

  • AWS Announces AI Servers with NVIDIA Blackwell
    Amazon Web Services announced general availability of P6e-GB200 UltraServers with NVIDIA Grace Blackwell Superchips. The servers are designed for training and deploying the large-scale AI models, and they follow the launch earlier this year of P6-B200 instances, also with NVIDIA Blackwell GPUs, for AI and high-performance computing workloads. The new servers are AWS’s most powerful GPU […]

RSS More News from insideHPC

  • At ISC 2025: HPE Talks All Things ‘At Scale’ – AI Factories, Data Center Infrastructure, Top 500 HPC, Cloud
  • AWS Announces AI Servers with NVIDIA Blackwell
  • AI Forcing New Technology for Data Center Network Interconnectivity
  • Canadian Scientists Use D-Wave in Quantum-AI Simulations
  • Registration Opens for SC25 in St. Louis, Nov. 16-21
  • Denodo Announces Deep Research AI Capability
  • AI for Science: Livermore Lab Expands Deployment of Anthropic AI
  • About insideAI News
  • Contact
  • Advertise with insideAI News
  • Visit Our Other Site – insideHPC
  • Terms of Service & Copyright
  • Privacy Policy
insideAI News
Copyright © 2025