Table of contents NASA's Science Mission Directorate (SMD) manages over 150 Petabytes (PB) of scientific data across 54,532 datasets distributed among 10 repositories, serving more than 53 million unique users annually.The five divisions within NASA's Science Mission Directorate — Astrophysics (APD), Biological and Physical Sciences (BPS), Earth Science (ESD), Heliophysics (HPD), and Planetary Science (PSD) — collected comprehensive metrics on their data management practices. Annual publication of these metrics increases transparency around the stewardship of federally funded science data and measures progress toward strategic goals. The full metrics report for Fiscal Year 2024 is available to read online.Read FY24 Science Data Metrics Report (PDF, 665 KB) https://assets.science.nasa.gov/content/dam/science/cds/about-us/ocsdo/reports/NASA%20SMD%20FY2024%20Science%20Data%20Repository%20Metrics.pdfSee key metrics from this report below. Data Assets of the NASA SMD Repositories as of 2024 APDBPSESDHPDPSDTotalsUnique User IPs8,800,00050,00028,400,0008,000,0007,990,00053,240,000Data Repositories3212210Datasets461,08613,50011,50027,46054,532Dataset Volume15,700 TB240 TB127,665 TB3,151 TB3,860 TB150,616 TBGrowth2,100 TB/yr92 TB/yr28,000 TB/yr470 TB/yr 785 TB/yr31,447 TB/yr2030 Projected Volume70,000 TB750 TB400,000 TB35,000 TB24,000 TB529,750 TBAccompanying Information for Data Assets TableUnique User IPs (Number): Unique IP addresses accessing a repository or NASA website to download or perform computation each year. For APD, this is a ballparked number, as not all data distribution points reported this information and because of contamination by bots, crawlers, etc. For ESD, there were 8.4M unique Earthdata.gov (7.4M on-prem/~1M cloud) and 20M unique Global Imagery Browse Services (GIBS)/Worldview.Data Repositories (Number): Number of repositories for each Division. For APD, there are only three data repositories with mission data. Within ESD there are 11 distributed active archive centers and 11 science investigator-led processing systems. Within PDS, there are 6 science nodes and the Navigation and Ancillary Information Facility (NAIF) that host data.Datasets (Number): Number of datasets. For APD, survey and observatory missions are considered one dataset.Dataset Volume (TB): Volume of data holdings, excluding any duplicative data, in Terabytes (TB).Growth (TB/year): Total volume of new data expected, in TB per year, as of 2024. This is expected to significantly increase with future launches of high-volume missions.2030 Projected Volume (TB): Expected 2030 volume of data holdings, in TB, excluding any duplicative data. On-Premises Metrics as of 2024The following tables represent a current state assessment of NASA SMD repositories that are stored in on-premises servers.On-Premises Storage Metrics APDBPSESDHPDPSDTotalsNASA Mission Dataset Volume7,700 TB0 TB50,000 TB3,050 TB860 TB61,610 TBNASA Investigator Dataset Volume900 TB0 TB6,000 TB35 TB930 TB7,865 TBNon-NASA Dataset Volume6,600 TB0 TB11,121 TB0 TB0 TB17,721 TBNCCS Dataset Volume (copied)0 TB0 TB0 TB1,800 TB0 TB1,800 TBTotal Dataset Volume15,200 TB0 TB67,121 TB3,085 TB1,790 TB87,196 TBGrowth2,100 TB/yr0 TB/yr15,000 TB/yr370 TB/yr515 TB/yr17,985 TB/yr2030 Projected Volume37,000 TB0 PB5,000 TB35,000 TB6,000 TB83,000 TBOn-Premises Access Metrics APDBPSESDHPDPSDTotalsUnique User IPs8,800,00007,400,0008,000,0007,990,00032,190,000Data Access Volume5,000 TB/yr0 TB/yr14,287,000 TB/yrNot Available2 TB/yr14,292,002 TB/yrData Download Volume5,000 TB/yr0 TB/yr56,000 TB/yr750 TB/yr2,210 TB/yr63,960 TB/yrOn-Premises Dataset Metrics APDBPSESDHPDPSDTotalsTotal Datasets95807,8008,00013,62030,378Datasets with DOIs89007,8002,0001,56512,255Datasets with API Access95807,8003,00064012,898Accompanying Information for On-Premises Metrics TablesNASA Mission Dataset Volume: Volume of datasets produced by NASA missions or guest observer programs, excluding any duplicative data, in TB.NASA Investigator Dataset Volume: Volume of datasets produced by NASA-funded Principal Investigators (PIs), excluding any duplicative data, in TB. For ESD, this includes model, aircraft, and field measurements. For APD, this includes contributed high-level science products (HLSPs).Non-NASA Dataset Volume: Volume of datasets that are not from a mission or NASA-funded PI. This data includes ancillary data necessary for processing mission and PI data, partner data where we have an agreement to hold the data in a repository, and other data where NASA has traditionally acted as the community repository.Total Dataset Volume: NASA Mission, NASA PI, and Non-NASA data, excluding any duplicative data, in TB.NCCS Dataset Volume (copied): Data copied to NASA Center for Climate Simulation (NCCS) for use on High-End Computing, in TB.Growth: Total volume of new data expected, in TB per year, as of 2024. This is expected to significantly increase with future launches of high-volume missions, in TB/yr.2030 Projected Volume: Expected 2030 volume of data holdings, in TB, excluding any duplicative data.Unique User IPs: Unique IP addresses accessing a repository to download or perform computation each year. For APD, this is a ballparked number, as not all data distribution points reported this information and because of contamination by bots, crawlers, etc.Data Access Volume: Total volume of data accessed in TB/yr. ESD values include Earth Science Data and Information System (ESDIS) archive and visualization tools. For HPD, the Data Access Volume is not a tracked metric, but one that can be added in the future.Data Download Volume: Total volume of data downloaded, in TB/yr.Total Datasets: Number of datasets. For APD, there is one dataset per survey/observatory mission and HLSP.Datasets with DOIs: Number of datasets that have an assigned Digital Object Identifier (DOI).Datasets with API Access: Number of datasets that are accessible through an API. Cloud Metrics as of 2024The following tables represent a current state assessment of NASA SMD repositories that are stored in the cloud.Cloud Storage Metrics APDBPSESDHPDPSDTotalsNASA Mission Dataset Volume3,400 TB6 TB56,500 TB1,200 TB1,970 TB6,576 TBNASA Investigator Dataset Volume700 TB166 TB15,000 TB55 TB100 TB16,021 TBNon-NASA Dataset Volume0 TB68 TB10,544 TB66 TB0 TB10,678 TBTotal Dataset Volume4,100 TB240 TB82,044 TB1,321 TB2,070 TB89,775 TBGrowth500 TB/yr92 TB/yr39,000 TB/yr350 TB/yr270 TB/yr40,212 TB/yr2030 Projected Volume42,000 TB750 TB395,000 TB2,000 TB18,000 TB458,000 TBCloud Access Metrics APDBPSESDHPDPSDTotalsUnique User IPs10,50050,00021,000,000Not Available30,00021,090,500Data Access Volume4,200 TB/yrNot Available147,650,000 TB/yrNot Available1 TB/yr147,654,201 TB/yrData Egress VolumeNot AvailableNot Available648,000 TB/yrNot Available1,120 TB/yr649,120 TB/yrCloud Dataset Metrics APDBPSESDHPDPSDTotalsTotal Datasets461,0685,7003,50013,84024,154Datasets with DOIs127045,7002,0003,97012,386Datasets with API Access436005,7003,5008,35018,193Accompanying Information for Cloud Metrics TablesNASA Mission Dataset Volume: Volume of datasets produced by NASA missions or guest observer programs, excluding any duplicative data, in TB.NASA Investigator Dataset Volume: Volume of datasets produced by NASA-funded principal investigators (PIs), excluding any duplicative data, in TB. For ESD, this includes model, aircraft, and field measurements. For APD, this includes contributed high-level science products (HLSPs).Non-NASA Dataset Volume: Volume of datasets, in TB, that are not from a mission or NASA-funded PI. This data includes ancillary data necessary for processing mission and PI data, partner data where we have an agreement to hold the data in a repository, and other data where NASA has traditionally acted as the community repository.Total Dataset Volume: NASA Mission, NASA PI, and Non-NASA data, excluding any duplicative data, in TB.Growth: Total volume of new data expected, in TB/year, as of 2024. This is expected to significantly increase with future launches of high volume missions.2030 Projected Volume: Expected 2030 volume of data holdings, in TB, excluding any duplicative data.Unique User IPs: Unique IP addresses accessing a repository to download or perform computation each year. For APD, this is a lower limit, as not all repositories reported this information. HPD does not track this metric currently.Data Access Volume: Total volume of data accessed for collocated analysis, in TB/yr. For APD this is a lower limit, as not all data repositories track this metric. ESD values include ESDIS archive and visualization tools. For HPD, as with on premises compute and storage, Data Access Volume is not currently tracked.Data Egress Volume: Total volume of data egressed from cloud storage, including egress of both NASA data and subsequent analysis results in TB/yr. For HPD, data egress data is not tracked in a manner consistent with other divisions. APD does not track this information.Total Datasets: Number of datasets. HPD does not currently track this metric. For APD, there is one dataset per survey/observatory mission and HLSP.Datasets with DOIs: Number of datasets that have an assigned DOI.Datasets with API Access: Number of datasets that are accessible through an API. Access the Full ReportThe full report, with more information about NASA SMD data holdings, is available to read online as a PDF.Read Full FY24 Science Data Metrics Report (PDF, 665 KB) https://assets.science.nasa.gov/content/dam/science/cds/about-us/ocsdo/reports/NASA%20SMD%20FY2024%20Science%20Data%20Repository%20Metrics.pdf