RBP结合位点预测的深度学习方法进展
Development of deep learning methods for RBP binding sites prediction
-
摘要: 预测RNA结合蛋白(RBP)的结合位点对于理解RNA结合蛋白如何在基因调控中发挥作用起至关重要的作用。近年来, 随着高通量实验数据的大量积累和深度学习的快速发展, 深度学习方法在RBP结合位点预测领域上的应用越来越广泛。通过深度学习模型可以在海量的生物数据中挖掘隐藏的模式, 与传统实验方法相比, 具有低消耗、高速度、高鲁棒性的优势。研究实验证明, 深度学习方法已经取得了显著的性能, 并且在逐步完善。本文总结了常用的RNA-蛋白质结合位点数据库, 介绍了RNA序列的编码方法及经典的深度学习模型, 主要回顾了近年来深度学习在RBP结合位点预测领域上的成功应用, 然后进一步总结了RBP绑定模体挖掘方法。最后讨论了目前深度学习方法在RBP结合位点预测的应用上的局限性与潜力以及潜在的改进方向。Abstract: Predicting the binding sites of RNA-binding protein (RBP) plays a crucial role in understanding how RBP participates in gene regulation. In recent years, with the accumulation of a large amount of high-throughput experimental data and the rapid development of deep learning, deep learning methods have been more and more widely used in the field of RBP binding sites prediction. Deep learning models can detect hidden patterns in massive biological data, and has the advantages of high speed and high robustness compared with traditional experimental methods. The experiments of various studies have proved that deep learning methods have achieved remarkable performance and they are gradually improving. In this review, commonly used RNA-protein binding site databases are summarized, and classic deep learning models including RNA sequence coding methods are introduced, and the successful applications of deep learning in the field of RBP binding site prediction in recent years are mainly reviewed. Then the RBP binding motif mining methods are further summarized. Finally, the limitations and potentials of the current deep learning methods in the application of RBP binding site prediction, as well as potential improvements are discussed.