If there’s anything that’s defining thriving businesses today, it’s a strong understanding and strategizing the use of a company’s data.
However, it brings up a whole range of questions, from both users and stakeholders- What data exists in my company? Where is it stored? What is the best data for my problem? When you have figured that out, more questions arise- How do I access it? Can I trust it?
Providing and controlling data access, ensuring data quality and data protection – all come under data governance.
A data catalog takes care of finding and understanding your data part effectively. Now it is also combining the capabilities of a data governance toolset. The merger of data cataloging and data governance is very opportune. That is because their functions are so intertwined.
Here, I will discuss the framework of effective data governance. First, I will layout the functions of data governance and then the features required to support those functions.
Following are the functions of data governance.
For an organization to govern a variety of data, it needs to have specific policies in the following area.
Roles and responsibilities are a crucial part of data governance, which comes down to identifying and managing the roles of owners and stewards of data.
A data steward is a role to ensure the fitness of data elements – both the content and metadata. The tasks they do:
A data owner is an individual accountable for a data asset. The tasks they do:
A traditional company can move towards self-service, or in a reverse scenario, a fast-moving start-up can start having more controls. In both cases, we need a change management program. Data Governance team should be equipped to provide various training to adapt to these changes.
A company has to align overall data strategy with the business strategy of the company. Only then the data governance programs are successful.
Ethical data handling can increase the trustworthiness of an organization and the organization’s data and process outcomes. Like W. Edward Deming’s statement on quality, ethics means “doing it right when no one is looking.”
Data classification helps the teams find, organize, and secure relevant data. We can classify data as per various categories:
Source: DAMA International
Since each organization’s data is unique, a plan to data valuation needs to begin. That includes articulating general cost and benefit categories that can be applied consistently within an organization.
Policy Implementation Module
A module that ensures that the data policies are implemented.
A module through which roles like those of data steward, data owner, and data custodian can be assigned.
Data Lineage is a visual representation of where the data is coming from, where it moves and what transformations it undergoes over time. It provides the ability to track, manage, and view the data transformation along its path from source to destination. This a key feature in maintaining data quality.
Business glossary is a document which enables data stewards to build and manage a common business vocabulary. This vocabulary can be linked to the underlying technical metadata to provide a direct association between business terms and objects.
A workflow by which users can request for data and data owners can grant access for a specific time period.
Understanding the framework of data governance is the vital first step in data governance.