National Biomedical Information Services
National Library Of Medicine
Investigators
Linked publications & trials
Abstract
In FY 2023, NLM's national biomedical information services expanded and enhanced access to data and information to accelerate biomedical research and public health. Provided access to biomedical literature Indexed 1+ million MEDLINE citations with medical subject headings (MeSH). Added 1.6 million citations to PubMed, growing it to 36+ million citations to biomedical literature. Expanded LitCovid to provide enhanced access to 360,000+ SARS-CoV-2 article citations in PubMed. Added nearly 1 million articles to PubMed Central (PMC) to provide free public access to 9+ million full-text journal articles, 5+ million of which are downloadable in machine-readable forms to support automated analysis through bulk download or at scale in the cloud using machine learning and artificial intelligence. Provided immediate access to 350,000+ coronavirus articles in machine-readable formats through the PMC Public Health Emergency COVID-19 Collection. Expanded the NIH Preprint Pilot to provide access to 10,000+ preprints reporting on NIH-funded research. Digitized 372,000 pages from print books and journals, making them accessible worldwide through NLMs Digital Collections and PMC. Expanded and maintained other digital collections by adding 5,100+ web resources to a collection that now provides access to 20,000+ resources documenting the COVID-19 pandemic, including content from federal, state, and local governments, aid organizations, and NGOs websites on topics ranging from prevention measures and vaccines. Filled 56,000 interlibrary loan (ILL) requests. Enabled ILL activities between approximately 1,780 medical libraries to fulfill an estimated 545,000 requests through the NLM DOCLINE ILL request and routing system. Provided trusted information for biomedical research, public health, and health care management Registered 37,000 new clinical research studies and added 4,000+ new results summaries to ClinicalTrials.gov, which provides access to 466,000+ studies and 60,000+ results summaries, including 9,000+ COVID-19-related studies. Launched a modernized ClinicalTrials.gov website after a multiyear effort to improve the user experience. Added 968 million genetic sequences, including 1.5 million SARS-CoV-2 genome sequence records, to the GenBank database of all publicly available DNA sequences. Added 41.7 million records to RefSeq, a database of reference sequences including genomic, transcript, and protein data, which generates comprehensive human genome annotation results. Added 720,000+ annotated human genetic sequence variants to ClinVar, which reports on relationships among human genome variations and human phenotypes. Provided 25+ million SRA data records (87 petabytes of genetic sequence data) on two commercial cloud providers in three formats, including a post-processed version available through the AWS Open Data Program and a second post-processed version with a greatly reduced data storage footprint, available from commercial AWS and Google cloud platforms to enable the use of machine learning and artificial intelligence. These data included the COVID-19 Genome Sequence Dataset, providing free cloud-based access to SARS-CoV-2 SRA data and controlled access to human genome sequence data derived from research studies. Supported the SARS-CoV-2 Sequencing for Public Health Emergency Response, Epidemiology, and Surveillance consortium to provide publicly accessible SARS-CoV-2 sequence data in GenBank and SRA. Improved SARS-CoV-2 data submission and validation processes to SARS-CoV-2 virus resources and updated graphic interfaces for viral lineage data retrieval. Supported processes and infrastructure to monitor and evaluate emerging SARS-CoV-2 variants as part of NIH ACTIV TRACE, and standardized, gathered, and shared variant sequencing data. Supported data processing and analysis methods to track the frequency of SARS-CoV-2 sequence mutations and variants using different genome sequencing technologies and predict their impact on vaccine and biologic early interventions. Provided results of this standardized analysis to support the findability of individual samples and large-scale analysis across the entire dataset. Continued to develop the NIH Comparative Genomics Resource to facilitate comparative genomics analyses for eukaryotic organisms through an ecosystem of NLM repositories and community-created knowledgebases offering integrated cloud-ready data, tools, and interfaces compatible with community-provided organism resources. Conducted outreach to support a strategy of stakeholder feedback and community engagement. Submitted manuscripts and released tools to improve the quality of genomic data submitted to GenBank for comparative analyses, to assist users to find and download genomic data, and to visualize and identify sequences. Processed genome sequence data for 416,000+ samples via the Pathogen Detection Pipeline to identify sources of human illnesses such as Salmonella, E. coli, and Listeria. Provided real-time foodborne pathogen surveillance used by the U.S. FDA, resulting in 1,000+ actions to protect consumers from foodborne illness. Provided access to antimicrobial resistance information for nearly 1.2 million pathogens via AMRFinderPlus. Expanded DailyMed drug labeling information for 145,000+ drugs and added information for 460,000+ medical devices in the AccessGUDID database of more than 4 million medical devices. Provided trusted health information to an estimated 500 million visitors through MedlinePlus and nearly 200 million patients via electronic health record (EHR) systems via MedlinePlus Connect. Transitioned MedlinePlus Connect to the cloud for faster, more reliable access to critical health information. Provided information to evaluate online health information. Supported the development and use of health data standards Coordinated clinical data standards for HHS and provided resources to facilitate interoperability of clinical health data and confirmation of federal EHR interoperability requirements. Updated health data standards resources by expanding the SNOMED International Edition and the U.S. Extension with more than 7,000 and 300 concepts respectively. Added 600+ codes to LOINC and expanded the RxNorm drug terminology by 200+ concepts. Added 1,700+ federally recognized American Indian and Alaskan Native tribal names and ethnic and ethnolinguistic minority terms designated as "Population Groups" to MeSH. Enhanced the Value Set Authority Center with new features to support maintenance, comparison, tracking of changes, and usage statistics of value sets. Enhanced the NIH Common Data Element (CDE) Repository to encourage adoption of CDEs in NIH-funded biomedical research. Added 12 CDEs from the NHLBI Collaborating Network of Networks for Evaluating COVID-19 and Therapeutic Strategies program to the NIH-Endorsed CDEs and curated 4 COVID and 5 non-COVID-related CDE submissions in preparation for NIH endorsement, including submission of 30 CDEs from the NIH Science collaborative for Health disparities and Artificial intelligence bias Reduction. Enhanced public engagement and creation of a diverse workforce Supported minority-serving institution libraries knowledge and use of the All of Us Researcher Workbench. Provided training on user conduct and access to All of Us data and data engagement techniques via the All of Us data training and engagement for academic libraries program. Improved public awareness of NLM resources by hosting 7 Michael E. DeBakey Fellows in the History of Medicine to support research using NLM historical collections, an Ada Lovelace Computational Health Lecture highlighting computational innovation efforts to biomedical research, 9 History Talks promoting awareness and use of NLMs historical collections for research, and 38 online exhibitions.
View original record on NIH RePORTER →