DOCUMENTATION,INFORMATION & KNOWLEDGE ›› 2014, Vol. 0 ›› Issue (4): 63-67.doi: 10.13366/j.dik.2014.04.063

Previous Articles     Next Articles

Study on the Generation Algorithm and Implementation of the Theme Abstract in Scientific and Technical Literature Based on hLDA:Taking Papers in Power Industry as Example

  

Abstract:

With the advent of the era of information explosion, and the rapid growth of the number of science and technology literature, the requirement of obtaining effective information for science and technology literature to the science and technology workers is becoming higher. This paper proposes a scientific literature theme abstract automatic generation algorithm. We make modeling theme to the data set of scientific literature by usinghLDA model, and automatically generate the abstract for the potential themes in science and technology literature through the choice of the candidate words and scoring strategy to the sentences of integrating multiple factors. In the experiment, we put forward the evaluation method based on topic coverage. The experimental results verify the validity of the generation algorithm for the theme abstract proposing in the paper.

Keywords: Scientific and technical literature, Theme abstract, Generation algorithm, hLDA