At a time when data-driven technology is increasingly becoming the core competitiveness, effective data governance is no longer an option, but a basic and strategic project for the government to improve governance efficiency and enterprises to optimize their operational decisions. In the face of common challenges such as data silos, uneven quality, security risks, and delayed decision-making, the key to building a data governance system from scratch lies in clear strategic positioning, a scientific top-level framework, and a pragmatic implementation path. This article aims to provide a set of practical framework ideas for product managers and PMs.
1. The urgency and core value of data governance
1. Analysis of typical pain points
Data silos hinder collaboration:Data fragmentation across departments/systems is the norm. Basic data (such as population and legal persons) are collected repeatedly but difficult to communicate between government lines, which affects the efficiency of policy implementation (such as differences in tax and market supervision data). Data barriers in sales, production, and financial systems within the enterprise cause information delays (such as sales orders not being able to drive production schedules in real time, causing inventory or delivery issues).
Data quality constraints trustworthiness:Errors and missing data entries and lagging updates can lead to irregularities. government statistics are distorted due to deviations in grassroots reporting; Outdated customer information (such as contact information and address) directly affects marketing reach and service experience; Credit data in the financial sector is not allowed to cause risk control failures (such as loan approval errors).
High security compliance risks:Data breaches are frequent, covering citizen privacy (government), trade secrets (enterprises), etc. Security threats come from external attacks and internal breaches, which can lead to serious trust crises and legal/financial losses.
Weak data support decision-making:The data is scattered and of poor quality, making it difficult for decision-makers to obtain panoramic, accurate and timely information support. Lack of real-time enterprise/industry insights into government industrial policy-making; Enterprises lack reliable analysis basis for market decision-making and miss opportunities.
2. Core business values
Enhancing Operational Efficiency: Break down silos and automate data sharing and processes. The government promotes “one network office” to reduce duplicate entry and improve service efficiency. Internal data integration optimizes production, supply and marketing coordination, shortens the cycle and reduces costs.
Driver Service Upgrade:High-quality data empowers accurate services. the government optimizes the allocation of public resources (such as hospital layout based on population and medical data); Enterprises achieve personalized product/service recommendations to improve customer satisfaction and loyalty (such as e-commerce accurate recommendation to increase conversion).
Enhanced Decision Intelligence:Complete, accurate and timely data is the cornerstone of scientific decision-making. the government uses big data to analyze the socio-economic situation and formulate policies; Enterprises analyze market trends and customer needs through data analysis, adjust strategies agilely, and enhance competitiveness.
To achieve these three challenges, product managers will only continue to appreciate
Good product managers are very scarce, and product managers who understand users, business, and data are still in demand when they go out of the Internet. On the contrary, if you only do simple communication, inefficient execution, and shallow thinking, I am afraid that you will not be able to go through the torrent of the next 3-5 years.
View details >
Ensure compliance and hedging:Systematic governance ensures that data processing activities comply with the requirements of the Data Security Law, the Personal Information Protection Law, and other laws and regulations, and significantly reduces legal and reputational risks (the financial industry meets regulatory audits, and the government protects citizens’ information privacy).
2. Strategic positioning and goal setting
1. Organizational strategy analysis
PM core actions:In-depth study of the core strategy of the organization (government: modernization of social governance, optimization of the business environment; enterprises: business growth, market share increase). Take the initiative to conduct multiple rounds of communication and discussion with senior management (CXO, department heads).
Key outputs:Clearly explain how data governance isDirect supportAchieving strategic goals (e.g., data interoperability supports the optimization of the business environment through “one network office”; master data consistently supports the improvement of enterprise supply chain efficiency). Use industry benchmarking practices to enhance persuasiveness.
Signs of success:Obtain the strategic value of high-level data governanceconsensusand the project initiatedClarify support and resource commitments。
2. Target quantitative tracking
Goals need to be SMART (specific, measurable, achievable, relevant, time-bound):
Unified view of core data:
- Target:Within X months, integrate core business domains (such as government: industry and commerce, taxation, social security related to enterprise services; enterprise: sales, production, inventory) data to a unified platform (data warehouse/data lake) to provide one-stop business data query and analysis capabilities.
- PM Focus:Clarify the scope of “core business domains”, data source system lists, and unified view of technical carriers (BI platforms?) Data catalog? ), user roles and access rights design.
Key Data Quality Improvements:
- Target:Set specific quality indicators (accuracy, completeness, consistency, timeliness) and improvement goals for selected key data items (e.g., customer contact number, financial transaction amount) and improvement goals (e.g., increase the accuracy rate of customer contact calls from 70% to 90% within 6 months; the financial reconciliation difference rate is reduced to less than 1%).
- PM Focus:Identify key data items and their business impact, define specific quality rules and measurement methods, design data quality monitoring dashboards, and develop a closed-loop process (discovery-report-rectification).
Core regulatory compliance:
- Target:In accordance with the Data Security Law, the Personal Information Protection Law and industry norms, achieve full compliance with data processing activities (collection, storage, use, sharing, and destruction) within the Y period, and pass key audit points.
- PM Focus:Sort out applicable regulatory provisions, identify current gaps (Gap Analysis), formulate compliance transformation checklists (such as data classification and grading, permission control, privacy statement updates, data lifecycle management policies), and plan compliance verification mechanisms.
3. Top-level frame design
1. Organizational structure
Data Governance Committee (Decision-Making Level):
- Constitute:It is led by senior leaders (CEO/mayor/leader in charge), and is composed of heads of core business departments, IT leaders, compliance/risk control leaders, and data management leaders (such as CDOs).
- Duty:Approve data governance strategies, policies, and standards; decision-making on material matters (e.g., data sharing dispute adjudication); approving budgets and resource allocation; Supervise the effectiveness of governance.
- PM Focus:Promote the formulation of the committee’s charter, clarify the rules of procedure, ensure high-level participation, and establish a regular reporting mechanism.
Data Owner:
- Role:The head of the business unit or their designated representative is primarily responsible for the quality, security, definition, and compliance of specific data domains such as “customer data”, “product data”, “employee data”.
- Duty:Define the meaning and rules of data business; Approve data access requests; leading data quality problem solving; Ensure data usage compliance.
- PM Focus:Clearly divide data fields and their owners; Formulate the list of responsibilities and assessment indicators of the owner; Provide necessary support (e.g., training, tools).
Data Steward (Executive Layer):
- Role:It is usually served by the data management team or IT department as the operational support of the data owner.
- Duty:perform data quality checks and cleaning; managing metadata; Maintaining data standards; Handle day-to-day data management work orders; Monitor data security policy enforcement.
- PM Focus:Clarify the collaboration process between the butler and the owner; Provide data management tools (such as data quality tools, metadata tools); Design a work order circulation mechanism (e.g., using JIRA).
2. Policy system
Policy is the “law” of governance. The following key policies are required to be developed and issued (subject to approval by the Governance Committee):
Data Security Policy:
Content:enforce data classification and grading (e.g., public, internal, secret, top secret); Define access control models (RBAC/ABAC), encryption requirements (TLS/SSL in transit, encryption at rest), storage and backup policies for different levels of data; clarify the requirements of audit logs; Define security incident response processes.
PM Focus:promote the formulation of classification and grading standards (you can refer to national standards or industry practices); Coordinate the implementation of technical control (IAM system, encryption gateway, DLP) by the security team; Design audit reports.
Privacy Policy:
Content:Strictly follow the principles of personal information processing (legal and legitimate necessity, purpose limitation, minimum sufficiency, etc.); Standardize the entire process of personal information collection (obtaining explicit consent), use (limited scope), storage (term management), sharing/transfer (security assessment and agreement); Protect user rights (inquiry, correction, deletion, withdrawal of consent); Develop a Privacy Impact Assessment (PIA) mechanism.
PM Focus:sort out business processes and systems involving personal information; design user consent management mechanism; planning the data subject rights response process; Promote the implementation of PIA templates and processes.
Data Quality Standards:
Content:Define the dimensions of core data quality (accuracy, completeness, consistency, uniqueness, timeliness, validity) and specific metrics; establish a data quality rule base; Formulate a closed-loop process for quality problem discovery, evaluation, reporting, assignment, rectification and verification; Clarify the quality assessment and accountability mechanism.
PM Focus:Formulate quality rules for key data items with business parties; Select and deploy data quality tools (e.g., Great Expectations, Talend DQ); Design quality monitoring boards; Establish an issue tracking process.
Metadata Management Specifications:
Content:Define metadata scopes (business, technical, operational); Standardize metadata collection (automated interface/manual entry), storage (metadata warehouse), maintenance (update process and responsible person), and use (data catalog, ancestry analysis) requirements; Ensure accuracy and consistency in metadata.
PM Focus:Promote the selection of metadata management tools (e.g., Apache Atlas, Collibra, Alation); Design business glossary; Plan the automatic collection of technical metadata (connect to databases, ETL tools, BI tools).
Data Lifecycle Management Policy:
Content:Define management requirements and retention periods for data at all stages from creation/collection, storage, use, archiving to destruction; Develop archiving strategies (hot and cold data layering); standardize destruction methods and audit requirements; Designed to balance data value with storage cost/compliance risk.
PM Focus:Determining data retention periods with legal/compliance; Coordinate storage teams to design archiving scenarios (such as object storage); Plan for automated lifecycle management processes (leveraging tools or scripts).
3. Key competency areas
Metadata Management:
- Core:Establish a centralized metadata warehouse to collect and manage business terms, technical sheet structures, field meanings, data lineage, data evolution, and other information.
- Value:Improve data discoverability, understandability, and trustworthiness; Support impact analysis (the impact of upstream changes on downstream), root cause analysis (data problem traceability).
- PM Focus: promote the implementation of tools; Ensure that metadata of critical business systems is collected; Design an easy-to-use data catalog for business personnel to query; Promote the application of pedigree analysis.
Data Standards Management:
- Core:Develop and enforce unified data definitions, coding rules (e.g., national administrative division codes, industry classification codes), data formats (e.g., date YYYY-MM-DD), and data models (e.g., customer master data models).
- Value:Eliminate ambiguity, ensure semantic consistency, and achieve cross-system interconnection and sharing.
- PM Focus:Identify key data domains (e.g., customers, products, suppliers) that need to be unified; Organize cross-departmental standard-setting workshops; promote the implementation of standards in the transformation of new systems and existing systems (through interface specifications and data exchange platform constraints); Manage standard versions.
Data Quality Management:
- Core:Based on data standards and quality policies, define specific quality rules (such as mobile phone number format verification, non-empty check, uniqueness constraints, value domain checks, logical consistency checks); Implement automated inspections (batch/real-time); perform data cleaning (correction, completion, deduplication) and standardization; Continuously monitor and report on quality conditions.
- Value:Increase data confidence directly for accurate analysis, automated processes, and compliance requirements.
- PM Focus:Define rules with the business owner; Selective and deploy DQ tools; Design quality scorecards and alarm mechanisms; Establish cleaning operation development and scheduling processes; Track quality problem resolution efficiency.
Master Data Management:
- Core:Identify the organization’s key business entities (e.g., customers, suppliers, products, materials, employees); Establish a master data management system (MDM) or governance process to ensure the uniqueness, accuracy, consistency, and authority of these core entity data across the organization.
- Value:Eliminate cross-system redundancy and conflicts to provide a consistent, reliable “single view” of business processes (CRM, ERP, SCM).
- PM Focus:prioritize master data domains; design master data model and distribution mechanism (publish-subscribe); Evaluate MDM tools (e.g., Informatica MDM, Reltio) or data center-based lightweight solutions; Formulate approval processes for master data creation, change, merger, and invalidation.
4. Implementation roadmap
1. Pilot breakthroughs and verification closed loops
Selection Principles:Select core business domains with high business value, prominent data problems, high-level concerns, and a certain willingness to cooperate (e.g., government: data involved in a high-frequency government service; Enterprise: sales and production data of core product lines).
Core Mission:In the pilot domain, fully implement the data governance process: clarify the owner-> sort out metadata – > formulate/apply standards – > deployment quality inspection and cleaning – > pilot security policies – > establish master data (if applicable) – > display of governance results (such as efficiency improvement and report accuracy improvement).
Key Responsibilities of a PM:Deep participation and coordination of resources; Design pilot schemes and metrics; Quickly iteratively solve the problems exposed in the pilot (organization, process, technology); summarizing reusable lessons learned (playbook); Cultivate core team capabilities.
Cycle:Typically 3-6 months, the goal is to run through the process and produce measurable local value.
2. Replicate experience and deepen coverage
Foundation: Based on the successful experience of the pilot and the optimized scheme.
Tactics:Develop a phased rollout plan based on business priorities (strategic importance, pain points) and data relevance (master data impact). Prioritize promoting areas that are closely related to or similar to the pilot domain.
Key supports:strengthen communication and training to enhance the cognition of all employees; solidify processes and tools to reduce the difficulty of promotion; Establish cross-domain collaboration mechanisms (such as cross-owner committees).
Key Responsibilities of a PM:Develop detailed promotion roadmaps and resource plans; Manage resistance to change and continuously communicate value; monitoring the progress and risks of promotion; coordinate and solve cross-domain problems; Apply governance tools at scale.
Cycle:Typically 6-12 months or more, depending on the size and complexity of the organization.
3. Continuous operation and dynamic optimization
Establish an operation system:Incorporate data governance activities such as metadata maintenance, quality monitoring, standards audits, security audits into daily operational processes. Clarify the daily responsibilities of each role.
Measurement and Improvement:Regularly (e.g., quarterly) data governance maturity and value contribution are assessed (vs. initial goals). Continuously optimize governance frameworks, policies, processes, and tools based on assessment results, changes in business needs (e.g., new business launches, new regulations), and technological developments (e.g., new tools and AI applications).
Key Responsibilities of a PM:design governance maturity assessment model and value measurement system; establish a regular review and improvement mechanism (Retro); Pay attention to industry dynamics and introduce best practices and new technologies; Promote the establishment of a governance culture.
4. Resource planning and risk management
Resource planning:
- Manpower:Dedicated data governance team (PM, architect, data engineer, data steward), business owner and representative, IT support (development, operation and maintenance, security), compliance and legal affairs. PMs need to clearly define their roles, responsibilities, and investment ratios.
- Fund:Tool procurement/licensing (metadata, data quality, MDM, data catalog, security tools), platform construction (data center/lakehouse), consulting fees, training fees, internal labor costs.
- Technology:Data platform infrastructure, network and security protection capabilities, and toolchain integration support.
Risk response plan:
- Departmental resistance/collaboration difficulties:strengthen high-level communication and endorsement; clarify value and common benefit; establish effective cross-departmental collaboration and communication mechanisms; Incorporate governance effectiveness into departmental/individual assessment (use with caution).
- High technical complexity/difficult integration:Fully evaluate existing technology stacks; Prioritize tools that are easy to integrate or offer open APIs; set up a technical research team; Consider phased technology implementation.
- Funds:invest in stages, giving priority to core tools and pilots; fully demonstrate ROI to strive for budget; Explore open source tools (maintenance costs need to be assessed); Seek external cooperation or funding (government projects).
5. The core value of product managers
In the Data Governance 0-1 project, the PM is not only the project manager, but also the bridge between business and technology, strategy and execution, and its key role is reflected in:
Demand Insights:
- Use methods such as user interviews, process sorting, and data analysis to dig deep into the real data needs and management pain points of various business departments.
- Core competencies: Accurately transform vague business demands (such as “inaccurate reports” and “difficult to find numbers”) into specific and executable data governance goals and tasks (such as “improving the integrity of core indicators of XX reports to 95%), establishing customer master data models and connecting A and B systems”.
Strategic Communication:
- To the upper (senior): Clearly understand and decode the strategic intent and value proposition of data governance, report and communicate with the top management concerns (efficiency, growth, risk control) in business language and strive for continuous support.
- Downwards (executive level): Effectively communicate high-level strategies and goals to the executive team (data team, business owner, IT) to ensure the same direction. At the same time, we should keenly capture the difficulties, risks and grassroots feedback in the implementation, provide timely and accurate feedback upwards, and promote strategy adjustment or resource coordination.
- Core competencies: excellent up-and-down management ability; excellent communication skills (written and oral); Information refinement and transformation capabilities.
Frame Design Coordination:
- In the top-level framework design stage, lead or deeply participate in the discussion of organizational structure design, policy system formulation, and implementation plans in core areas.
- Core competencies: Coordinate the different perspectives and demands of the technical team (feasibility, technology selection), business team (demand satisfaction, process adaptation), and management team (compliance, resources) to find the best balance between meeting business value, technical feasibility and management compliance, and promote consensus.
Cross-departmental resource integration:
- Data governance is naturally cross-domain. PMs need to proactively identify and integrate the necessary resources (human experts, operational knowledge, budgets, system authority) scattered across departments.
- Core competencies: strong influence and negotiation skills; building trusting relationships; design a win-win mechanism; Be brave and good at breaking down departmental walls and promote the establishment of a collaborative culture driven by the value of data.
Roadmap Planning and Prioritization:
- Based on the current organizational status, resource constraints, strategic priorities and business pain points, scientifically formulate a phased and value-driven implementation roadmap.
- Core Competencies: Use prioritization frameworks (such as value/complexity matrix, RICE model) to make decisive decisions on massive governance tasks, ensure that the most critical issues are prioritized under limited resources, maximize early value presentation, and maintain project momentum.
The construction of government and enterprise data governance systems from scratch is a systematic project that integrates strategy, organization, policy, technology and execution. The key to success lies in solving core pain points and driving business value as the starting point, clarifying measurable goals, designing and implementing an organization with clear rights and responsibilities, complete policies and focusing on key capability areas, and steadily advancing through a pragmatic path of pilot verification, step-by-step promotion, and continuous operation.