What is it about?
Despite the growing recognition of microRNAs (miRNAs) as critical biomarkers in cancer, current approaches to their analysis remain fragmented, disjointed, and poorly integrated with emerging computational advances. This lack of cohesion limits progress toward reproducible and clinically actionable biomarker discovery. To address this unmet need, we present a review that unifies the latest findings and tools in bioinformatics, machine learning (ML), and large language models (LLMs) for miRNA analysis in oncology, thereby bridging a significant methodological gap in the field. We begin by critically synthesizing, benchmarking, and evaluating algorithms, including miRDeep2 and DIANA-miRPath, within a functional pipeline that spans next-generation sequencing (NGS) data processing to multi-omics integration. Building on this foundation, we review ML-augmented layers incorporating supervised and deep learning (DL) algorithms, specifically support vector machines (SVMs), convolutional neural networks (CNNs), and recurrent neural networks (RNNs), to enable robust miRNA signature identification, classification, and target prediction. Furthermore, we explore the integration of generative models and LLMs to support hypothesis generation and enhance reproducibility in biomarker discovery workflows. This comprehensive framework enhanced with artificial intelligence (AI) is contextualized through cancer-specific datasets, with particular emphasis on translational applications for early detection, prognosis, and therapy selection. By systematically organizing fragmented methodologies into a scalable and reproducible pipeline, our work provides a strategic roadmap to accelerate the development of miRNA-based precision cancer.
Featured Image
Photo by Sangharsh Lohakare on Unsplash
Why is it important?
Despite the growing recognition of microRNAs (miRNAs) as critical biomarkers in cancer, current approaches to their analysis remain fragmented, disjointed, and poorly integrated with emerging computational advances. This lack of cohesion limits progress toward reproducible and clinically actionable biomarker discovery. To address this unmet need, we present a review that unifies the latest findings and tools in bioinformatics, machine learning (ML), and large language models (LLMs) for miRNA analysis in oncology, thereby bridging a significant methodological gap in the field. We begin by critically synthesizing, benchmarking, and evaluating algorithms, including miRDeep2 and DIANA-miRPath, within a functional pipeline that spans next-generation sequencing (NGS) data processing to multi-omics integration. Building on this foundation, we review ML-augmented layers incorporating supervised and deep learning (DL) algorithms, specifically support vector machines (SVMs), convolutional neural networks (CNNs), and recurrent neural networks (RNNs), to enable robust miRNA signature identification, classification, and target prediction. Furthermore, we explore the integration of generative models and LLMs to support hypothesis generation and enhance reproducibility in biomarker discovery workflows. This comprehensive framework enhanced with artificial intelligence (AI) is contextualized through cancer-specific datasets, with particular emphasis on translational applications for early detection, prognosis, and therapy selection. By systematically organizing fragmented methodologies into a scalable and reproducible pipeline, our work provides a strategic roadmap to accelerate the development of miRNA-based precision cancer.
Perspectives
Ultimately, the future of miRNA bioinformatics lies in building robust, interpretable, and regulation-ready pipelines. Strategies such as the creation of global consortia to harmonize data standards, incorporation of explainable AI to foster clinical trust, and proactive collaboration with regulatory agencies can accelerate translation into practice. By addressing these barriers, miRNA-based diagnostics and therapeutics can progress from research settings into reliable tools for early detection, prognosis, and personalized oncology.
Piotr Remiszewski
Maria Sklodowska- Curie National Research Institute of Oncology
Read the Original
This page is a summary of: MicroRNA bioinformatics in precision oncology: an integrated pipeline from NGS to AI-based target discovery, Journal of Applied Genetics, October 2025, Springer Science + Business Media,
DOI: 10.1007/s13353-025-01024-9.
You can read the full text:
Resources
doi
Dey, M., Remiszewski, P., Piątkowski, J. et al. MicroRNA bioinformatics in precision oncology: an integrated pipeline from NGS to AI-based target discovery. J Appl Genetics (2025). https://doi.org/10.1007/s13353-025-01024-9
link
Dey, M., Remiszewski, P., Piątkowski, J. et al. MicroRNA bioinformatics in precision oncology: an integrated pipeline from NGS to AI-based target discovery. J Appl Genetics (2025). https://doi.org/10.1007/s13353-025-01024-9
pubmed
Dey M, Remiszewski P, Piątkowski J, Golik P, Teterycz P, Czarnecka AM. MicroRNA bioinformatics in precision oncology: an integrated pipeline from NGS to AI-based target discovery. J Appl Genet. 2025 Oct 31. doi: 10.1007/s13353-025-01024-9. Epub ahead of print. PMID: 41168533.
Contributors
The following have contributed to this page







