Open Access Open Access  Restricted Access Subscription Access

Topic modelling-based analysis of COVID-19 vaccine articles published in the preprint server MedRxiv


Affiliations
1 CSIR-Unit for Research and Development of Information Products, Pune, Maharashtra, India., India
2 Department of Pharmacy Management, Manipal College of Pharmaceutical Science, MAHE, Manipal, Karnataka., India
 

Two thousand one hundred and ninety-eight research publications on COVID-19 vaccines in MedRxiv preprint repository during January 01, 2020 and December 31, 2021 were analyzed for topic modelling with unsupervised inference method. Latent Dirichlet Allocation (LDA) method was used to investigate the thematic structure of the preprints. It was observed that the published articles were related to either clinical trials or patient responses to vaccine or modelling for various applications such as infection transmission, vaccine allocation, vaccine hesitancy etc.

Keywords

COVID-19, Vaccine, Preprints, LDA, Topic modelling.
User
Notifications
Font Size

  • Canouï, E., and Launay O. , "Histoire et principes de la vaccination." Revue des maladies respiratoires36(1) (2019): 74-81. DOI: 10.1016/j.rmr.2018.02.015
  • Chung, K. J., "Preprints: What is their role in medical journals?." Archives of Plastic Surgery47(0)2 (2020): 115-117. DOI: 10.5999/aps.2020.00262
  • Fry, N K., Helina M., and Tasha M. C., "In praise of preprints." Access Microbiology1(2) (2019). DOI: 10.1099/acmi.0.000013
  • Älgå, A., Oskar E., and Nordberg M., "The development of preprints during the COVID‐19 pandemic." Journal of Internal Medicine290(2) (2021): 480. DOI: 10.1111/ joim.13240
  • Bloom, T. "Shepherding preprints through a pandemic." BMJ: British Medical Journal371 (2020): m4703. DOI: 10.1136/bmj.m4703
  • van Schalkwyk, M. C., Hird, T. R., Maani, N., Petticrew, M., and Gilmore, A. B. (2020). "The perils of preprints." BMJ: British Medical Journal (Online)370 (2020). DOI: 10.1002/alr.22732
  • Hopkins, Claire. "Preprints—expediting access or compromising quality?." International Forum of Allergy & Rhinology. 11(5). 2021.
  • Blei, D. M., Andrew Y. N., and Jordan M. I. "Latent dirichlet allocation." Journal of machine Learning research3.Jan (2003): 993-1022.
  • Maier, D., Waldherr, A., Miltner, P., Wiedemann, G., Niekler, A., Keinert, A., Pfetsch, B., Heyer, G., Reber, U., Häussler, T. and Schmid-Petri, H., "Applying LDA topic modelling in communication research: Toward a valid and reliable methodology." Communication Methods and Measures12(2-3) (2018): 93-118. DOI: 10.1080/ 19312458.2018.1430754
  • Elgesem, D., Steskal, L. and Diakopoulos, N., "Structure and content of the discourse on climate change in the blogosphere: The big picture." Environmental Communication9(2) (2015): 169-188. DOI: 10.1080/17524032.2014.983536
  • Melton, C. A., Olusanya, O. A., Ammar, N., & Shaban-Nejad, A. "Public sentiment analysis and topic modelling regarding COVID-19 vaccines on the Reddit social media platform: A call to action for strengthening vaccine confidence." Journal of Infection and Public Health14(10) (2021): 1505-1512. DOI: 10.1016/j.jiph.2021.08.010
  • Dong, M., Cao, X., Liang, M., Li, L., Liu, G., & Liang, H. "Understand research hotspots surrounding COVID-19 and other coronavirus infections using topic modelling." MedRxiv (2020): 2020-03. DOI: 10.1101/2020.03.26.20044164
  • Johansson, R., & Engström Heino, O. Topic propagation over time in internet security conferences: Topic modelling as a tool to investigate trends for future research. Bachelors Dissertation. Linköping University, Sweden, 2021.
  • McCallum, A. K., (2002), "MALLET: A Machine Learning for Language Toolkit." Available at https://mimno.github.io/Mallet/about (Accessed on 6 March 2023)
  • He J, Larson M, and De Rijke M."Using coherence-based measures to predict query difficulty." Advances in Information Retrieval: 30th European Conference on IR Research, ECIR 2008, Glasgow, UK, March 30-April 3, 2008. Proceedings 30. Springer Berlin Heidelberg, 2008.
  • Dong, M., Cao, X., Liang, M., Li, L., Liu, G. and Liang, H., "Understand research hotspots surrounding COVID-19 and other coronavirus infections using topic modelling." MedRxiv (2020): 2020-03. DOI: 10.1101/2020.03.26.20044164
  • Stevens, K., Kegelmeyer, P., Andrzejewski, D. and Buttler, D., "Exploring topic coherence over many models and many topics." Proceedings of the 2012 joint conference on empirical methods in natural language processing and computational natural language learning. 2012.
  • Abhishek, T., Rawat, D., Gupta, M. and Varma, V., "Transformer models for text coherence assessment." arXiv preprint arXiv:2109.02176(2021). DOI: 10.48550/ arXiv.2109.02176
  • Fraser, N., Brierley, L., Dey, G., Polka, J.K., Pálfy, M., Nanni, F. and Coates, J.A., "The evolving role of preprints in the dissemination of COVID-19 research and their impact on the science communication landscape." PLoS Biology19(4) (2021): e3000959. DOI: 10.1371/journal.pbio.3000959

Abstract Views: 70

PDF Views: 90




  • Topic modelling-based analysis of COVID-19 vaccine articles published in the preprint server MedRxiv

Abstract Views: 70  |  PDF Views: 90

Authors

Nishad Deshpande
CSIR-Unit for Research and Development of Information Products, Pune, Maharashtra, India., India
Virendra Ligade
Department of Pharmacy Management, Manipal College of Pharmaceutical Science, MAHE, Manipal, Karnataka., India
Shabib-Ahmed Shaikh
CSIR-Unit for Research and Development of Information Products, Pune, Maharashtra, India., India
Alok Khode
CSIR-Unit for Research and Development of Information Products, Pune, Maharashtra, India., India

Abstract


Two thousand one hundred and ninety-eight research publications on COVID-19 vaccines in MedRxiv preprint repository during January 01, 2020 and December 31, 2021 were analyzed for topic modelling with unsupervised inference method. Latent Dirichlet Allocation (LDA) method was used to investigate the thematic structure of the preprints. It was observed that the published articles were related to either clinical trials or patient responses to vaccine or modelling for various applications such as infection transmission, vaccine allocation, vaccine hesitancy etc.

Keywords


COVID-19, Vaccine, Preprints, LDA, Topic modelling.

References