图书情报知识 ›› 2024, Vol. 41 ›› Issue (1): 80-91.doi: 10.13366/j.dik.2024.01.080

• 图书、文献与交流 • 上一篇    下一篇


彭贤哲, 郑建明, 李佳新, 石进   

  1. 南京大学信息管理学院,南京,210023
  • 出版日期:2024-01-10 发布日期:2024-03-04
  • 通讯作者: 石进(ORCID:0000-0002-1621-6944),博士,教授,研究方向:智能目录学、大数据分析,Email:shijin@nju.edu.cn。
  • 作者简介:彭贤哲(ORCID:0000-0002-0131-8227),博士研究生,研究方向:目录学、大数据分析与技术,Email:pengxz_tm@163.com;郑建明(ORCID:0000-0002-7989-4435),博士,教授,研究方向:数字信息资源管理、目录学基础理论,Email:zhengjm@nju.edu.cn;李佳新(ORCID:0000-0001-7780-3723),硕士研究生,研究方向:信息组织、情报学,Email:mf21140060@smail.nju.edu.cn。
  • 基金资助:

Inheritance and Application of Bibliographic Mechanism in the Structuring Process of Unstructured Data

PENG Xianzhe, ZHENG Jianming, LI Jiaxin, SHI Jin   

  1. School of Information Management, Nanjing University, Nanjing, 210023
  • Online:2024-01-10 Published:2024-03-04
  • Contact: Correspondence should be addressed to SHI Jin, Email:shijin@nju.edu.cn, ORCID:0000-0002-1621-6944
  • Supported by:
    This is an outcome of the project "Research on Scientific Intelligence Situation Awareness for National Security"(21BTQ012) supported by National Social Science Foundation of China.

摘要: [目的/意义]信息资源时代下,数据类型多元化特征显著,透析数据结构化过程中蕴含的目录学思想,有助于解决非结构化数据管理与利用的难题。[研究设计/方法]首先辨析数据结构化的本质过程,并揭示其中蕴含的目录学机理和标引分类思想,说明用目录学思想指导数据结构化过程的可行性,并借由目录工作运用的文献揭示、书目索引编纂、文献标引分类、文献组织等传统方法,解析不同类型非结构化数据的特点,指导其关联整合、索引指示、标引分类、组织重构等主要结构化过程,最终实现非结构化数据的“辨章学术、考镜源流”。[结论/发现]数据结构化基本承袭了以分类标引等为核心的书目思想,在本质上是作为致用之学的目录学在当下环境的延续和发扬。[创新/价值]有助于制定数据结构化过程的范式流程,增强非结构化数据结构化解析过程的复用性。

关键词: 目录学, 非结构化数据, 数据分类标引, 数据组织

Abstract: [Purpose/Significance] With the increasingly emergence of multi-type data in the era of information resources, analyzing bibliographic ideas in the process of data structuring is helpful to the management and application of unstructured data. [Design/Methodology] This study analyzes the essential process of data structuring, reveals the bibliographic mechanism, indexing and classification thought contained in this process, and explains the feasibility of using bibliographic ideas to guide data structuring. Based on the literature description, indication, classification,and organization used in catalog work, the characteristics of unstructured data are identified, and the main structural processes are guided to complete, so as to realize "Distinguishing to Show the Academy, Researching to Define the Origins". [Findings/Conclusion] Data structurization basically inherits the bibliographic ideas that mainly include classification and indexing, and reflects the continuation and development of bibliography as a practical science in the current environment. [Originality/Value] Above mentioned approach displays the standard routine of data structurization, and further strengthen reusability of this process.

Key words: Bibliography, Unstructured data, Classification for data, Organization for data