Hindsight2020: Characterizing Uncertainty in the COVID-19 Scientific Literature

Kinga Dobolyi; George P. Sieniawski; David Dobolyi; Joseph Goldfrank; Zigfried Hampel-Arias

doi:10.1017/dmp.2023.82

Hindsight2020: Characterizing Uncertainty in the COVID-19 Scientific Literature

Published online by Cambridge University Press: 25 July 2023

Kinga Dobolyi ,

George P. Sieniawski ,

David Dobolyi ,

Joseph Goldfrank and

Zigfried Hampel-Arias

Show author details

Kinga Dobolyi*: Affiliation:
George Washington University, Department of Computer Science, Washington, DC, USA
George P. Sieniawski: Affiliation:
Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
David Dobolyi: Affiliation:
University of Notre Dame, Indiana, USA
Joseph Goldfrank: Affiliation:
George Washington University, Department of Computer Science, Washington, DC, USA
Zigfried Hampel-Arias: Affiliation:
Los Alamos National Laboratory, Los Alamos, New Mexico, USA
*: Corresponding author: Kinga Dobolyi; Email: kinga@gwu.edu

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

Following emerging, re-emerging, and endemic pathogen outbreaks, the rush to publish and the risk of data misrepresentation, misinterpretation, and even misinformation puts an even greater onus on methodological rigor, which includes revisiting initial assumptions as new evidence becomes available. This study sought to understand how and when early evidence emerges and evolves when addressing different types of recurring pathogen-related questions. By applying claim-matching by means of deep learning Natural Language Processing (NLP) of coronavirus disease 2019 (COVID-19) scientific literature against a set of expert-curated evidence, patterns in timing across different COVID-19 questions-and-answers were identified, to build a framework for characterizing uncertainty in emerging infectious disease (EID) research over time. COVID-19 was chosen as a use case for this framework given the large and accessible datasets curated for scientists during the beginning of the pandemic. Timing patterns in reliably answering broad COVID-19 questions often do not align with general publication patterns, but early expert-curated evidence was generally stable. Because instability in answers often occurred within the first 2 to 6 mo for specific COVID-19 topics, public health officials could apply more conservative policies at the start of future pandemics, to be revised as evidence stabilizes.

Keywords

SARS-CoV-2 uncertainty natural language processing public health pandemics

Information

Type: Concepts in Disaster Medicine
Information: Disaster Medicine and Public Health Preparedness , Volume 17 , 2023 , e437

DOI: https://doi.org/10.1017/dmp.2023.82 [Opens in a new window]
Copyright: © In-Q-Tel, Inc. and the Author(s), 2023. Published by Cambridge University Press on behalf of the Society for Disaster Medicine and Public Health

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

SeyedAlinaghi, S, Oliaei, S, Kianzad, S, et al. Reinfection risk of novel coronavirus (COVID-19): a systematic review of current evidence. World J Virol. 2020;9(5):79-90. doi: 10.5501/wjv.v9.i5.79.CrossRef Google Scholar

Savvides, C, Siegel, R. Asymptomatic and presymptomatic transmission of SARS-CoV-2: a systematic review. 2020. doi: 10.1101/2020.06.11.20129072 CrossRef Google Scholar

Udow-Phillips, M, Lantz, PM. Trust in public health is essential amid the COVID-19 pandemic. J Hosp Med. 2020;15(7):431-433. doi: 10.12788/jhm.3474 CrossRef Google Scholar PubMed

Berger, L, Berger, N, Bosetti, V, et al. Rational policymaking during a pandemic. Proc Natl Acad Sci USA. 2021;118(4):e2012704118. doi: 10.1073/pnas.2012704118.CrossRef Google Scholar PubMed

Soares-Weiser, K, Lasserson, T, Juhl Jorgensen, K, et al. Policy makers must act on incomplete evidence in responding to COVID-19. Cochrane Database Syst Rev. 2020;11: ED000149. doi: 10.1002/14651858.ED000149 CrossRef Google Scholar

US Department of Homeland Security. Master question list for COVID-19 (caused by SARS-CoV-2). Accessed December 21, 2020. https://www.dhs.gov/publication/st-master-question-list-COVID-19, 2022 Google Scholar

Schünemann, HJ, Santesso, N, Vist, GE, et al. Using GRADE in situations of emergencies and urgencies: certainty in evidence and recommendations matters during the COVID-19 pandemic, now more than ever and no matter what. J Clin Epidemiol. 2020;127:202-207. doi: 10.1016/j.jclinepi.2020.05.030 CrossRef Google Scholar

Jalali, R, Hosseinian-Far, A, Mohammadi, M. Contradictions in the promotion of publishing academic and scientific journal articles, and the inability to cope with the new coronavirus (COVID-19). Antimicrob Resist Infect Control. 2021;10(1):10. doi: 10.1186/s13756-021-00884-0 CrossRef Google Scholar PubMed

Odone, A, Galea, S, Stuckler, D, et al. The first 10 000 COVID-19 papers in perspective: are we publishing what we should be publishing? Eur J Public Health. 2020;30(5):849-850. doi: 10.1093/eurpub/ckaa170 CrossRef Google Scholar PubMed

Wang, LL, Lo, K, Chandrasekhar, Y, et al. CORD-19: The COVID-19 open research dataset. In: Proceedings of the 1st Workshop on NLP for COVID-19 at ACL 2020. 2020; arXiv:2004.10706v4. Association for Computational Linguistics.Google Scholar

Älgå, A, Eriksson, O, Nordberg, M. The development of preprints during the COVID-19 pandemic. J Intern Med. 2021;290(2):480-483. doi: 10.1111/joim.13240 CrossRef Google Scholar PubMed

Elgendy, IY, Nimri, N, Barakat, AF, et al. A systematic bias assessment of top-cited full-length original clinical investigations related to COVID-19. Eur J Intern Med. 2021;86:104-106. doi: 10.1016/j.ejim.2021.01.018 CrossRef Google Scholar PubMed

Raynaud, M, Zhang, H, Louis, K, et al. COVID-19-related medical research: a meta-research and critical appraisal. BMC Med Res Methodol. 2021;21(1):1. doi: 10.1186/s12874-020-01190-w CrossRef Google Scholar PubMed

Whitmore, KA, Laupland, KB, Vincent, CM, et al. Changes in medical scientific publication associated with the COVID-19 pandemic. Med J Australia. 2020;213(11):496-499. doi: 10.5694/mja2.50855 CrossRef Google Scholar PubMed

Palayew, A, Norgaard, O, Safreed-Harmon, K, et al. Pandemic publishing poses a new COVID-19 challenge. Nat Hum Behav. 2020;4(7):666-669. doi: 10.1038/s41562-020-0911-0 CrossRef Google Scholar PubMed

Kang, M, Gurbani, SS, Kempker, JA. The published scientific literature on COVID-19: an analysis of Pubmed abstracts. J Med Sys. 2020;45(1):3. doi: 10.1007/s10916-020-01678-4 CrossRef Google Scholar PubMed

Fiske, ST, Dupree, C. Gaining trust as well as respect in communicating to motivated audiences about science topics. Proc Natl Acad Sci USA. 2014;111(Suppl 4):13593-13597. doi: 10.1073/pnas.1317505111 CrossRef Google Scholar PubMed

Pearce, W. Trouble in the trough: how uncertainties were downplayed in the UK’s science advice on COVID-19. Humanit Soc Sci Commun. 2020. doi: 10.1057/s41599-020-00612-w CrossRef Google Scholar

Mohammed, M, Sha’aban, A, Jatau, AI, et al. Assessment of COVID-19 information overload among the general public. J Racial Ethn Health Disparities. 2021;9(1):184-192. doi: 10.1007/s40615-020-00942-0 CrossRef Google Scholar PubMed

Montani, I, Honnibal, M, Van Landeghem, S, et al. spaCy: industrial-strength natural language processing in Python. 2020. Accessed May 29, 2023. https://zenodo.org/record/4021943 Google Scholar

Reimers, N, Gurevych, I. Sentence-BERT: Sentence embeddings using Siamese BERT-Networks. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics. 2019. doi: 10.48550/arXiv.1908.10084 CrossRef Google Scholar

BioASQ.org. BioASQ releases continuous space word vectors obtained by Applying Word2Vec to PubMed Abstracts. Accessed December 3, 2021. http://bioasq.org/news/bioasq-releases-continuous-space-word-vectors-obtained-applying-word2vec-pubmed-abstracts Google Scholar

Meyers, B. meyersbs/uncertainty. Installation & usage. Accessed May 29, 2023. https://github.com/meyersbs/uncertainty/wiki/installation-&-usage.Google Scholar

Vincze, V. Uncertainty Detection in Natural Language Texts. University of Szeged. 2014. doi: 10.14232/phd.2291 Google Scholar

Bero, L, Lawrence, R, Leslie, L, et al. Cross-sectional study of preprints and final journal publications from COVID-19 studies: discrepancies in results reporting and spin in interpretation. BMJ Open. 2021;11(7):e051821.CrossRef Google Scholar PubMed

Article contents

Hindsight2020: Characterizing Uncertainty in the COVID-19 Scientific Literature

Abstract

Keywords

Information

Access options

Article purchase

Temporarily unavailable

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests