Michael presents a simple model showing the interaction between key business use cases, key business processes and core data components necessary to enable GenAI to drive business value
Organizations often jump into GenAI with exciting use cases while leaving Data Quality, Master Data Management (MDM), and Data Governance for “later.” Typically we see Commercial teams cleanse customer records, whilst Supply Chain focus on standardizing the definition of product, specifically finished goods here, bill of materials there as well as other derivatives of products including parts, components, materials and services. Meanwhile, Data Quality initiatives and MDM programs are run on an ad hoc basis, in silos, outside a single and unified Data Governance framework. The result? Different naming conventions, conflicting definitions, variable degrees of granularity, and ultimately data models that generate inconsistencies in reporting across different business functions and units.
Considering that LLMs will aim to provide the most probable answers as opposed to the right answer, lack of alignment between Data Governance, Data Quality and MDM typically leads to the following implications when deploying GenAI:
Michael Norejko, Data Engineering Lead, Cloud &Digital, PwC Poland
Michael Norejko brings 15 years of experience building data and analytics capabilities with a focus on aligning Data Quality, Master Data Management, and Data Governance initiatives as part of large digital transformation programmes. Successfully deploying LLMs is as much dependent on the compute as it is on the availability of data and a consistent ontology that defines the business.
GenAI only scales when Data Quality, and Master Data Management, fall within a single Data Governance framework, as opposed to running as three separate initiatives. If master records are inconsistent and quality is managed in silos, GenAI will just amplify the confusion and faster.
To mitigate instances of bias, hallucinations and spurious correlations in LLMs, ensure that master and meta data is continuously harmonized across multiple sources, standardized against a set standard and enriched where there are instances of missing attributes. The standards need to be provided as part of a single Data Governance framework shared across key business functions, domains and processes priority business. Consider the following point of interactions:
As an example, lets look at this from the perspective of a Commercial Manager responsible for overseeing Order-to-Cash process. Sales representative enters “Acme Ltd.”, Finance analyst sees “ACME” and Customer service agent uses “ACME Holdings UK.” Without MDM matching & merging logic and supporting Data Quality rules, GenAI may allocate discounts as per an agreed contracts against three different entities which in reality belong to one and the same customer. With standardized Customer names, IDs, address attributes, and harmonized master records from different sources, the same LLM can reconcile invoices to purchase orders and identify missing discounts against the same Customer.
To support this practical example, research shows that Deploying GenAI with robust Data Governance and supporting Master Data Management practices accelerates time-to-value up to 40% (Gartner, 2025). Furthermore, organizations that invest in data governance and master data management see up to 3x the return on their data investments versus those without governance.
If you want GenAI to move beyond experiments, do not treat Data Quality, Master Data Management and Data Governance as afterthought. Furthermore, do not run these as separate initiative and instead move towards a synergy focused approach where all three initiatives are consolidated into a single program.
For complementary viewpoints on turning GenAI into measurable results:
Mariusz Chudy
Marek Chlebicki
Mariusz Strzelecki