图书情报知识 ›› 2015, Vol. 0 ›› Issue (5): 89-98.doi: 10.13366/j.dik.2015.05.089

• 知识、学习与管理 • 上一篇    下一篇

专利地图和知识图谱视角的大数据比较研究

刘桂锋,卢章平,宋新平   

  • 出版日期:2015-09-10 发布日期:2015-09-10

Comparative Study of Big Data Based on Patent Map and Knowledge Mapping

  • Online:2015-09-10 Published:2015-09-10

摘要:

最近大数据技术受到国内外学界和业界的广泛关注。为全面深入了解大数据的研究成果,以Derwent Innovations Index(德温特专利数据库)和Web of Science(WOS)数据库为数据源,利用专利地图和知识图谱方法,从年份、国家、研究机构、高被引文献、关键词五个方面进行专利和论文的可视化比较。专利和论文的视角均表明,大数据技术发展呈现两个明显的阶段,目前正处于快速发展阶段。美国在大数据研究领域优势突出,我国在大数据专利方面数量领先。无论是从专利还是论文的角度,IBM公司的数量显著,并且研究主题包括大数据技术的系统、获取、存储、分析、管理、应用等方面。在揭示大数据核心技术方面,共被引论文的角度比高被引专利更具优势。ThemeScape专利地图从微观的视角深入和具体的展示大数据的技术进展,关键词共现图谱从宏观的视角全面和系统的展示大数据的研究进展。总体来说,目前大数据研究呈现4个方面的特征:研究热潮正在袭来、美国实力超群、互联网企业引领研究方阵、核心技术集中在MapReduce、Hadoop、云计算等。

关键词: 大数据, 专利地图, 知识图谱, 信息可视化, 专利分析, 数据挖掘, 云计算, 共被引

Abstract:

Big data technology arouses widespread concern among academic circle and industries in recent years. In order to deeply understand the research achievements of big data, using patents and papers as the data source, we compared visualization of patents in Derwent Innovations Index (DII) and papers in Web of Science (WOS) database from five aspects using patent map and knowledge mapping method. From the view of patents and papers of big data, it has two distinct stages, and now develops rapidly. America leads first in papers of big data, while China does in patents of big data. IBM’s research topics include system, access, storage, analysis, management, and application of big data technology from the perspective of patents and papers. Cocitation of document is in advantage of highly cited patent in revealing the core technology of big data. ThemeScape patent map shows the technology development in big data deeply from the micro perspective, while keywords knowledge mapping shows the research development in big data systematically from the macro perspective. The result shows that: big data era is coming soon, US is a leading research country, Internet enterprise is far ahead, and the core technology of big data is MapReduce, Hadoop, cloud computing, etc.

Key words: Big data, Patent map, Knowledge mapping, Information visualization, Patent analysis, Data mining, Cloud computing, Co-citation