Documentation, Informaiton & Knowledge ›› 2024, Vol. 41 ›› Issue (1): 80-91.doi: 10.13366/j.dik.2024.01.080

Previous Articles     Next Articles

Inheritance and Application of Bibliographic Mechanism in the Structuring Process of Unstructured Data

PENG Xianzhe, ZHENG Jianming, LI Jiaxin, SHI Jin   

  1. School of Information Management, Nanjing University, Nanjing, 210023
  • Online:2024-01-10 Published:2024-03-04
  • Contact: Correspondence should be addressed to SHI Jin, Email:shijin@nju.edu.cn, ORCID:0000-0002-1621-6944
  • Supported by:
    This is an outcome of the project "Research on Scientific Intelligence Situation Awareness for National Security"(21BTQ012) supported by National Social Science Foundation of China.

Abstract: [Purpose/Significance] With the increasingly emergence of multi-type data in the era of information resources, analyzing bibliographic ideas in the process of data structuring is helpful to the management and application of unstructured data. [Design/Methodology] This study analyzes the essential process of data structuring, reveals the bibliographic mechanism, indexing and classification thought contained in this process, and explains the feasibility of using bibliographic ideas to guide data structuring. Based on the literature description, indication, classification,and organization used in catalog work, the characteristics of unstructured data are identified, and the main structural processes are guided to complete, so as to realize "Distinguishing to Show the Academy, Researching to Define the Origins". [Findings/Conclusion] Data structurization basically inherits the bibliographic ideas that mainly include classification and indexing, and reflects the continuation and development of bibliography as a practical science in the current environment. [Originality/Value] Above mentioned approach displays the standard routine of data structurization, and further strengthen reusability of this process.

Key words: Bibliography, Unstructured data, Classification for data, Organization for data