图书情报知识 ›› 2025, Vol. 42 ›› Issue (5): 19-30.doi: 10.13366/j.dik.2025.05.019

• 专题(1)·可信数据空间 • 上一篇    下一篇

可信数据空间协同治理下的高质量数据集建设与长效运营路径

林镇阳1,2, 胡鑫3, 郭明军4, 何玥5, 张艳6, 王明珠7   

  1. 1.深圳清华大学研究院多模态人工智能数据工程研发中心,深圳,518071;
    2.武汉数据智能研究院,武汉,430000;
    3.中国社会科学院大学国际政治经济学院,北京,102401;
    4.国家信息中心大数据发展部,北京,100045;
    5.中国人民大学法学院,北京,100872;
    6.北京大学法学院,北京,100871;
    7.首都师范大学管理学院,北京,100048
  • 出版日期:2025-09-10 发布日期:2025-11-13
  • 通讯作者: 郭明军(ORCID: 0000-0002-6767-2448),博士,高级经济师,研究方向:算力基础设施与数据治理协同,Email: guomjnature@126.com。
  • 作者简介:林镇阳(ORCID: 0000-0003-2119-9189),博士,高级工程师,研究方向:数字经济与技术创新,Email: lin-zy15@tsinghua.org.cn;胡鑫(ORCID: 0009-0002-6093-1475),硕士,研究方向:数字经济与数据要素市场化,Email: hh11690@163.com;何玥(ORCID: 0009-0003-6663-4479),博士研究生,研究方向:经济法、数字法,Email: heyue370950285@163.com;张艳(ORCID: 0009-0004-7198-4497),硕士,研究方向:电子商务法,个人信息保护法,Email: yanzhang15@tsinghua.org.cn;王明珠(ORCID: 0000-0001-9307-5084),博士,讲师,研究方向:社会博弈、公共政策、数据治理,Email: 7204@cnu.edu.cn。
  • 基金资助:
    本文系国家自然科学基金专项项目“数据交易场所的功能定位、运营机制与治理机制研究”(72442030)和湖北省数据局2024年研究课题项目“湖北省高质量数据集建设研究”(IM2409E085N1))的研究成果之一。

The Construction and Long-Term Operation Path of High-Quality Dataset Under Collaborative Governance in Trusted Data Spaces

LIN Zhenyang1,2, HU Xin3, GUO Mingjun4, HE Yue5, ZHANG Yan6, WANG Mingzhu7   

  1. 1. Multimodal Artificial Intelligence Data Engineering R&D Center, Tsinghua University Shenzhen Graduate School, Shenzhen, 518071;
    2. Wuhan Institute of Data Intelligence, Wuhan, 430000;
    3. School of International Politics and Economics, University of Chinese Academy of Social Sciences, Beijing, 102401;
    4. Big Data Development Department, National Information Center, Beijing, 100045;
    5. Renmin Law School, Renmin University of China, Beijing, 100872;
    6. Peking University Law School, Beijing, 100871;
    7. School of Management, Capital Normal University, Beijing, 100048
  • Online:2025-09-10 Published:2025-11-13
  • Contact: Correspondence should be addressed to GUO Mingjun, Email: guomjnature@126.com, ORCID: 0000-0002-6767-2448
  • Supported by:
    This is an outcome of the Special Project "Research on the Functional Positioning, Operational Mechanism and Governance Mechanism of Data Trading Venues"(72442030)supported by National Natural Science Foundation of China, and the 2024 Research Project "Research on the Construction of High-Quality Data Sets in Hubei Province"(IM2409E085N1)supported by Hubei Provincial Data Bureau.

摘要: [目的/意义]针对政企数据要素融通中存在的质量参差、流通壁垒与权责失衡等问题,以数智融合背景下高质量数据集建设及运营需求为导向,探索可信数据空间驱动的协同治理路径,旨在破解当前数据集数量少、质量差、难使用等多重困境。[研究设计/方法] 通过构建基于可信数据空间的“数据提质—数据集市—数算一体—数据众创”四位一体的高质量数据集综合运营平台架构,设计“城市—行业—企业”三级联动的长效运营机制,解析可信数据空间驱动的政企数据要素融通中的治理规则、技术适配与场景耦合逻辑,提出“场景驱动—机制协同—安全保障”三位一体的推进范式。[结论/发现]构建高质量数据集综合运营平台,实现数据集全流程的高效流通、价值释放和数据反哺的良性循环;打造数据质量“反哺”模型可形成“流通—增值—提质”闭环机制,验证“法规约束+技术支撑+商业激励”协同治理框架的有效性。[创新/价值]提出“四位一体”基于可信数据空间的高质量数据集建设模式的理论扩展与实践路径,突破传统的“重建设轻运营、重技术轻制度”模式的局限,为我国“数据要素×”和“人工智能+”行动计划的行业落地提供参考与决策支持。

关键词: 数据空间, 数据要素, 政企数据, 高质量数据集, 数据治理, 数智融合

Abstract: [Purpose/Significance] Aiming at the challenges of quality inconsistencies, circulation barriers, and unbalanced rights and responsibilities in the integration of data elements between government and enterprise, guided by the needs for high-quality dataset construction and operation in the context of digital-intelligent integration, this study explores a collaborative governance pathway driven by trusted data spaces, with the aim of resolving multiple dilemmas currently faced by datasets, such as limited quantity, poor quality, and difficulty in use. [Design/Methodology] By constructing a four-in-one integrated operation platform architecture for high-quality datasets based on trusted data spaces, encompassing "data quality improvement–data marketplace–data computation integration–data crowdsourcing innovation," this study designs a long-term operation mechanism with three-level linkage of "city-industry-enterprise". It analyzes the governance rules, technological adaptation, and scenario coupling logic in the integration of government and enterprise data elements driven by trusted data spaces. It proposes a three-pronged promotion paradigm of "scenario driven-mechanism coordination-security guaranteed". [Findings/Conclusion] The construction of the high-quality dataset integrated operation platform achieves the efficient circulation of the entire dataset process,value realization, and a virtuous cycle of data feedback. The development of a data quality "feedback" model forms a closed-loop mechanism of "circulation–value addition–quality improvement", validating the effectiveness of the collaborative governance framework of "regulatory constraints+technical support+commercial incentives". [Originality/Value] This study puts forward the theoretical expansion and practical pathways of "four-in-one" high-quality dataset construction model based on trusted data spaces, breaking through the limitations of the traditional model that "emphasizes construction over operation and technology over institutions". It provides references and decision support for the industry to implement the action plans of "data elements ×" and "artificial intelligence +" in China.

Keywords: Data spaces, Data elements, Government-enterprise data, High-quality datasets, Data governance, Digital-intelligence convergence