信息质量视角下AIGC虚假信息问题及根源分析

doi:10.13366/j.dik.2023.04.032

图书情报知识 ›› 2023, Vol. 40 ›› Issue (4): 32-40.doi: 10.13366/j.dik.2023.04.032

• 学术聚焦（1）· 人工智能生成内容（AIGC）治理 • 上一篇下一篇

信息质量视角下AIGC虚假信息问题及根源分析

莫祖英，盘大清，刘欢，赵悦名

郑州航空工业管理学院信息管理学院，郑州，450046

出版日期:2023-07-10 发布日期:2023-08-16
通讯作者: 莫祖英（ORCID：0000-0003-0661-9333）,博士,教授,研究方向：信息质量､虚假信息质量,Email:mozuying611@163.com.
作者简介:盘大清（ORCID：0000-0003-0712-4957）,硕士研究生,研究方向：虚假信息识别,Email:qcountpan@163.com;刘欢（ORCID：0009-0006-9571-2984）,硕士研究生,研究方向:虚假信息治理､用户行为,Email:979243359@qq.com;赵悦名（ORCID：0009-0000-5784-208X）,硕士研究生,研究方向:虚假信息治理,Email:1023200761@qq.com｡
基金资助:
本文系国家社科基金项目“社交媒体情境下网络虚假信息传播行为干预研究”（21BTQ049）的研究成果之一｡

Analysis on AIGC False Information Problem and Root Cause from the Perspective of Information Quality

MO Zuying PAN Daqing LIU Huan ZHAO Yueming

School of Information Management, Zhengzhou University of Aeronautics, Zhengzhou, 450046

Online:2023-07-10 Published:2023-08-16
Contact: Correspondence should be addressed to MO Zuying, Email:mozuying611@163.com, ORCID: 0000-0003-0661-9333
Supported by:
This is an outcome of the project "Research on the Intervention of Online False Information Dissemination Behavior in the Context of Social Media "（21BTQ049）supported by National Social Science Foundation of China.

摘要/Abstract

摘要： [目的/意义] 探讨AIGC中存在的各种虚假信息类型及其特征，对理解虚假信息产生的根源、减少AIGC中虚假信息的生成具有积极作用。[研究设计/方法] 采用数据测试实验方法，立足于信息质量视角，通过采集AIGC系统一手的测试数据和收集二手的AIGC虚假信息来剖析AIGC虚假信息类型及特征；以人工智能语言模型的信息生成过程为着力点，探析AIGC中虚假信息生成的根源。[结论/发现] AIGC虚假信息主要包括事实性虚假和幻觉性虚假两种类型，事实性虚假信息主要集中在数据错误、作者作品错误、客观事实错误、编程代码错误、机器翻译错误五个方面，而幻觉性虚假信息主要集中在虚假新闻事件、虚假学术信息、虚假健康信息和偏见与歧视方面；AIGC虚假信息产生的根源与大规模语言模型、预训练数据集和人工标注三个要素有关。[创新/价值] 采用了数据测试实验方法，并辅以二手数据的收集，全面分析了各种AIGC虚假信息的类型，并根据生成机理与表现形式将其划分为事实性虚假信息和幻觉性虚假信息，为AIGC虚假信息的进一步研究提供了理论基础。

关键词: 人工智能生成内容（AIGC）, 虚假信息, 信息质量, 事实性虚假信息, 幻觉性虚假信息, 根源分析

Abstract: [Purpose/Significance] This paper aims to analyze the types and characteristics of false information in AIGC,which has a positive role in understanding the root causes of false information and reducing its generation. [Design/Methodology] In this study, the method of data testing experiment was adopted. Based on the perspective of information quality, the types and characteristics of false information generated by AIGC were analyzed through collecting first-hand testing data of AI systems and second-hand false information of AIGC. Further, focusing on the information generation process of artificial intelligence language models, we explored the origins of false information generation in AIGC. [Findings/Conclusion] False information in AIGC mainly consists of two types: factual false information and hallucinatory false information. Factual false information is primarily focused on errors in five aspects: data errors, author and his works errors, errors in objective facts, programming code errors, and machine translation errors. On the other hand, hallucinatory false information is mainly concentrated in the areas of fake news events, false academic information, false health information, and bias and discrimination. The origins of false information in AIGC are related to three factors: large-scale language models, pre-training datasets, and human annotations. [Originality/Value] This study employes a data testing experimental approach, complemented by the collection of second-hand data, comprehensively analyzes various types of false information in AIGC, and divides false information into factual false information and hallucinatory false information based on the generation mechanisms and manifestations, which provides a theoretical foundation for further research on false information in AIGC.

Keywords: Artificial intelligence generated content（AIGC）, False information, Information quality, Factual false information, Hallucinatory false information, Root cause analysis

莫祖英, 盘大清, 刘欢, 赵悦名. 信息质量视角下AIGC虚假信息问题及根源分析[J]. 图书情报知识, 2023, 40(4): 32-40.

MO Zuying, PAN Daqing, LIU Huan, ZHAO Yueming. Analysis on AIGC False Information Problem and Root Cause from the Perspective of Information Quality[J]. Documentation, Informaiton & Knowledge, 2023, 40(4): 32-40.

[1]	张奎, 王秀伟. 生成式AI在传统文化传播中的媒介呈现与风险治理[J]. 图书情报知识, 2024, 41(4): 98-109.
[2]	龚芙蓉. ChatGPT类生成式AI对高校图书馆数字素养教育的影响探析[J]. 图书情报知识, 2023, 40(5): 97-106,156.
[3]	王鹏涛, 徐润婕. AIGC介入知识生产下学术出版信任机制的重构研究[J]. 图书情报知识, 2023, 40(5): 87-96.
[4]	詹希旎, 李白杨, 孙建军. 数智融合环境下AIGC的场景化应用与发展机遇[J]. 图书情报知识, 2023, 40(1): 75-85.
[5]	黄雨婷, 冯婕. 信息素养视域下的虚假信息甄别：国际进展与我国对策[J]. 图书情报知识, 2021, 38(2): 121-132.
[6]	张宁，袁勤俭. 用户视角下的学术社交网络信息质量影响因素研究——基于扎根理论方法[J]. 图书情报知识, 2018, 0(5): 105-113.
[7]	余梅，吴志强. 从失败学视角构建政府信息系统质量控制模型的研究[J]. 图书情报知识, 2016, 0(2): 83-91.
[8]	夏前龙，施国洪，张晓慧. 移动图书馆服务质量的内涵、结构及其测度[J]. 图书情报知识, 2015, 0(1): 47-55.
[9]	丁敬达. 维基百科词条信息质量启发式评价框架研究[J]. 图书情报知识, 2014, 0(2): 11-17.

信息质量视角下AIGC虚假信息问题及根源分析

Analysis on AIGC False Information Problem and Root Cause from the Perspective of Information Quality

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 9

编辑推荐

Metrics

本文评价