图书情报知识 ›› 2014, Vol. 0 ›› Issue (4): 94-105.doi: 10.13366/j.dik.2014.04.094

• 情报、信息与共享 • 上一篇    下一篇

大型文献数字化项目的信息资源整合研究

宋琳琳,李海涛   

Research on Information Resource Integration in Mass Digitization Project

摘要:

调查发现知名的大型文献数字化项目信息资源整合开展范围有限,实现方式单一。为了更好地满足用户的信息需求,文章提出根据整合范围采取差异性整合方式与技术。以原始文献的元数据为基础,实现机构内部的资源整合;以项目协作、检索协议和元数据互操作实现机构间的资源整合;以数据复用、页面解析和关联数据实现其与外部资源的整合。研究发现,要真正深化信息资源聚合的层次,必须从整合对象即数字资源的加工和组织着手,细化其描述粒度,揭示更为丰富的属性关系,以此为基础构建项目资源的数据模型,才能实现数字资源的深度集合。

关键词: 信息资源整合, 元数据互操作, 关联数据, 大型文献数字化项目, 深度聚合, 数据复用

Abstract:

The survey found that well-known mass digitization projects carried limited information resource integrations by a single way. In order to meet user’s information needs better, the paper proposed to take different integrated approach and technology according to the scope. Using original document’s metadata realized the integration within the organization; using retrieval protocols and metadata interoperability achieved the integration between agencies; and using data reuse, page parsing and linked data achieved the integration with external resources.The study found that to make deep aggregation of information resources, mass digitization projects should process from digital resources objects, refine its description granularity, and reveal richer property relationships as a basis to building the data model project for digital resources.

Key words: Information resource integration, Metadata interoperability, Linked data, Mass digitization project