
Literature mining for non-coding base sequence
Online published: 2013-10-31
安建福 , 孟丽莉 . 非编码碱基序列文献的挖掘[J]. 上海交通大学学报(医学版), 2013 , 33(10) : 1343 . DOI: 10.3969/j.issn.1674-8115.2013.10.006
Objective To improve the recall rate and precision rate of non-coding base sequence literature retrieval with neural network algorithm. Methods The related literatures were obtained from PubMed as examples. After the sample literatures were dealt, the terms were selected with term frequency (TF) and inverse document frequency (IDF) methods, then the retrieval model based on back-propagation (BP) neural network algorithm was built. Results When 100 terms were selected, the precision rate, recall rate, area under the receiver operating characteristic curve (ROCAUC), specificity, sensitivity and accuracy rate were 91.49%, 71.23%, 0.823, 93.37%, 71.23% and 82.30% respectively. Conclusion Compared with common methods such as key words and MeSH retrieval, the retrieval model with neural network algorithm can effectively retrieve the literatures related tbo a particular topic.
/
| 〈 |
|
〉 |