在线社区中人工智能生成内容的识别方法研究

doi:10.13366/j.dik.2024.02.028

Documentation, Informaiton & Knowledge ›› 2024, Vol. 41 ›› Issue (2): 28-38,149.doi: 10.13366/j.dik.2024.02.028

• Academic Focus: Artificial Intelligence Generated Content （AIGC）Governance • Previous Articles Next Articles

Identification Methods of Artificial Intelligence Generated Content in Online Communities

DENG Shengli， WANG Fan， WANG Haowei

School of Information Management, Wuhan University, Wuhan, 430072

Online:2024-03-10 Published:2024-05-14
Contact: Correspondence should be addressed to WANG Fan, E-mail:1161252028@qq.com, ORCID:0000-0003-0100-0320
Supported by:
This is an outcome of the project "Research on Evaluation and Prediction of User Contribution Behavior in Online Knowledge Community from the Perspective of Information Ecological Chain"（71974149）supported by National Natural Science Foundation of China and the Major Project "Research on Reconstruction and Application of Information Service System Driven by Humanistic Artificial Intelligence"（22&ZD324）supported by National Social Science Foundation of China.

Abstract

Abstract: [Purpose/significance] Generative artificial intelligence will cause a certain degree of AI information pollution in online communities. The various AIGC identification methods studied in this paper are of great significance to prevent the negative impact of rapidly evolving generated artificial intelligence. [Design/Methodology] This paper first constructed a HAC data set in multiple online community platforms with 54 major categories of topics of Sina Weibo, which contained 100,873 pieces of information written by humans and generated artificial intelligence respectively. Then it explored whether the current 6 kinds of mainstream deep learning and 7 kinds of machine learning methods can identify whether the information in the online community was written by human beings or generated by artificial intelligence. Finally, the BEM-RCNN method was proposed to further improve the recognition of AIGC precision. [Findings/Conclusion] From the perspective of constructed data set, it is found that generated artificial intelligence has a strong "human-like expression", which can simulate human beings to post and reply on social media platforms. The experimental results show that the method proposed in this paper has an accuracy of 96.4%, which can well identify whether the content on the online community is written by humans or AI. It is superior to the 13 other mainstream methods such as BERT, ERNIE, and TextRNN in terms of precision, recall rate, F1-value, and accuracy, verifying its performance advantages. At the same time, many exploratory experiments have also proved that although the current mainstream machine learning methods are less accurate than the method in this paper, they can also be competent for some AIGC recognition tasks. [Originality/Value] Multiple methods are used in this paper to identify AIGC on social media, and prevent information pollution caused by generative artificial intelligence on social media platforms.

Keywords: Generative artificial intelligence, Artificial Intelligence Generated Content（AIGC）, Online communities, Machine learning, AI information pollution

DENG Shengli, WANG Fan, WANG Haowei. Identification Methods of Artificial Intelligence Generated Content in Online Communities[J]. Documentation, Informaiton & Knowledge, 2024, 41(2): 28-38,149.

[1]	WANG Jun, XIE Qingling, LIU Chang. User's Interaction Behavior with Generative Artificial Intelligence in Daily Life Contexts [J]. Documentation, Informaiton & Knowledge, 2025, 42(2): 60-69, 93.
[2]	WU Xinyu, WU Zhenxin. Review on the Application of Artificial Intelligence in the Field of Long-term Preservation of Digital Resources [J]. Documentation, Informaiton & Knowledge, 2025, 42(1): 146-157.
[3]	DAI Wenyi, XIAO Dongmei. Legal Responses of Copyright Law to Generative Artificial Intelligence Training Data: Solutions to Copyright Compliance [J]. Documentation, Informaiton & Knowledge, 2025, 42(1): 89-100.
[4]	WEI Yuanshan. Copyright Law Response to Generative Artificial Intelligence Training Data: Is It Necessary to Set Fair Use Rules? [J]. Documentation, Informaiton & Knowledge, 2025, 42(1): 78-88.
[5]	ZHANG Kui, WANG Xiuwei. Media Presentation and Risk Management of Generative AI Within Traditional Culture Communication [J]. Documentation, Informaiton & Knowledge, 2024, 41(4): 98-109.
[6]	ZHANG Chunchun, SUN Ruiying. How to Get Out of AIGC's "Collingridge Dilemma": Full Process Dynamic Data Compliance Governance [J]. Documentation, Informaiton & Knowledge, 2024, 41(2): 39-49,66.
[7]	SONG Xiaokang, ZHAO Yuxiang, SONG Shijie, ZHU Qinghua. The Features, Theoretical Framework and Research Prospects of AI-enabled Surrogate Information Searching: A Sociotechnical System Paradigm [J]. Documentation, Informaiton & Knowledge, 2023, 40(4): 111-121.
[8]	ZHU Yu, CHEN Guanze, LU Yongrong, FAN Wei. Generative Artificial Intelligence Governance Action Framework:Content Analysis Based on AIGC Incident Report Texts [J]. Documentation, Informaiton & Knowledge, 2023, 40(4): 41-51.
[9]	MO Zuying, PAN Daqing, LIU Huan, ZHAO Yueming. Analysis on AIGC False Information Problem and Root Cause from the Perspective of Information Quality [J]. Documentation, Informaiton & Knowledge, 2023, 40(4): 32-40.
[10]	. The Technical Features and Aromorphosis of Artificial Intelligence Generated Content (AIGC) [J]. Documentation, Informaiton & Knowledge, 2023, 40(1): 66-74.
[11]	. Application Scenarios and Development Opportunities of AIGC in the Digital Intelligence Integration Environment [J]. Documentation, Informaiton & Knowledge, 2023, 40(1): 75-85.
[12]	. Cryptocurrency Terrorist Financing Regulation: Transaction Pattern Analysis and Abnormal Entity Identification [J]. Documentation, Informaiton & Knowledge, 2022, 39(6): 55-66.
[13]	. Adoption and Influence of Machine Learning Algorithms in Information Science Research in China: From the Perspective of CSSCI Journal Papers [J]. Documentation, Informaiton & Knowledge, 2022, 39(5): 96-108.
[14]	. Exploring the Factors Influencing LIS Scholars Citing Other's Works: An Empirical Research Based on Algorithmic Attribution [J]. Documentation, Informaiton & Knowledge, 2022, 39(2): 83-97.
[15]	. Bibliometric Analysis of Research on Artificial Intelligence in Information Science [J]. Document,Informaiton & Knowledge, 2020, 0(1): 53-62.

Identification Methods of Artificial Intelligence Generated Content in Online Communities

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

Comments