Open Data Registry Project

Amazon Web Services and NASA have entered into a Space Act Agreement to explore best practices around discovery, access, and use of high-value NASA science datasets. Making analytics-optimized data stores available to the science community will minimize the need for data wrangling and preprocessing within the community, leading to a faster time to insight and quicker innovation.

Below are the NASA data products that are currently part of the Open Data Registry Project.

NASA Prediction of Worldwide Energy Resources (POWER)

Created by NASA’s Applied Sciences Program, the Prediction of Worldwide Energy Resources (POWER) project improves the accessibility of NASA Earth Observation solar and meteorological data. The focus is on supporting community research in the areas of renewable energy development, building energy efficiency, and agroclimatology applications. The datasets update frequently in near real time.

Explore the NASA POWER project here.

Multi-Scale Ultra High Resolution (MUR) Sea Surface Temperature (SST)

The MUR SST project is a global, gap-free, gridded, 1 km set of sea surface temperature data, updated daily. It was created by merging information from a variety of satellites and observatories, and contains data from 2002 to the present day.

Explore the MUR SST dataset.

NASA Earth Exchange (NEX) Data Collection

Through the NASA Earth Exchange (NEX), users can explore and analyze large Earth science datasets, run and share modeling algorithms, collaborate on new or existing projects, and exchange workflows and results within and among other science communities. Three NEX datasets are now available to all via Amazon S3, ranging from high-resolution climate change projections in the U.S. to global data on Earth’s surface and land.

Explore the NEX Data Collection.

Solar Dynamics Observatory (SDO) Machine Learning Dataset

The SDO Machine Learning Dataset is a curated dataset from the NASA Solar Dynamics Observatory mission in a format suitable for machine-learning research. It includes data from three different instruments and is intended for heliophysicists who wish to use machine learning in their own research, as well as machine-learning researchers who wish to develop models specialized for the physical sciences.

Explore the SDO Machine Learning Dataset.

Ozone Monitoring Instrument (OMI) / Aura NO2 Tropospheric Column Density

The OMI / Aura NO2 Tropospheric Column dataset provides a worldwide measure of nitrogen dioxide density in the lowest region of Earth’s atmosphere. Nitrogen dioxide is a key indicator of upcoming ozone production and poor air quality, and its measurements are useful for a variety of atmospheric studies.

Explore OMI / Aura NO2 Tropospheric Column Density data.

NASA / USGS Controlled THEMIS Mosaics

The NASA / USGS Controlled THEMIS Mosaics are infrared image mosaics generated using Thermal Emission Imaging System (THEMIS) images from the 2001 Mars Odyssey orbiter mission. The mosaic is generated at the full resolution of the THEMIS infrared dataset, which is approximately 100 meters/pixel, and covers almost 100% of the surface of Mars.

Explore the THEMIS Mosaics.

NASA / USGS Europa Controlled Observation Mosaics

A set of 92 image mosaics were generated from images of Jupiter’s moon Europa, taken by the Solid State Imager (SSI) on NASA’s Galileo spacecraft. These images provide users with nearly the entire Galileo Europa imaging dataset at its native resolution and with improved relative image locations.

Explore the Europa Mosaics.

NASA / USGS Europa Controlled Observations

A set of 481 minimally processed images of Jupiter’s moon Europa were taken by the Solid State Imager (SSI) on NASA’s Galileo spacecraft. These images provide users with nearly the entire Galileo Europa imaging dataset at its native resolution and with improved relative image locations. They were subsequently assembled into 92 image mosaics, which are available at the entry above.

Explore the Europa Observations.

NASA Earth Exchange Global Daily Downscaled Projections (NEX-GDDP-CMIP6)

The NEX-GDDP-CMIP6 dataset provides a set of global, high resolution, bias-corrected climate change projections. They can be used to evaluate climate change impacts on processes that are sensitive to finer-scale climate gradients and the effects of local topography on climate conditions.

Explore the NEX-GDDP-CMIP6 dataset.

NASA SOHO/LASCO2 comet challenge on AWS

The Solar and Heliospheric Observatory (SOHO)/Large Angle and Spectrometric COronagraph C2 (LASCO2) data set comprises approximately 36,000 images spread across 2,950 comet observations. The human eye is the only tool currently used to reliably detect new comets in SOHO data, particularly comets that are very faint and embedded in the instrument background noise. Bright comets can be easily detected in the LASCO data by relatively simple automated algorithms, but the majority of comets observed by the instrument are extremely faint, noise-level observations.

Explore the SOHO/LASCO data set.

Biological and Physical Sciences (BPS) Benchmark Training Datasets

NASA’s Biology and Physical Sciences division has created open datasets of biological information from mice that can be used for AI and machine learning projects related to this field of study.

Biological and Physical Sciences (BPS) Microscopy Benchmark Training Dataset

The Microscopy Benchmark Training Dataset contains fluorescence microscopy images of individual nuclei from mouse fibroblast cells, irradiated with Fe particles or X-rays, with fluorescent foci indicating markers of DNA damage.

Explore the BPS Microscopy Benchmark Training Dataset.

Biological and Physical Sciences (BPS) RNA Sequencing Benchmark Training Dataset

The RNA Sequencing Benchmark Training Dataset contains RNA sequencing data from spaceflown and control mouse liver samples, sourced from NASA GeneLab and augmented with generative adversarial network.

Explore the BPS RNA Sequencing Benchmark Training Dataset.

NASA Physical Sciences Informatics (PSI)

NASA’s Physical Sciences Research Program is pleased to offer the PSI data repository for physical science experiments performed in reduced-gravity environments such as the ISS, Space Shuttle flights, and free-flyers. PSI also includes data from some related ground-based studies. The PSI system is accessible and open to the public.

Explore the PSI data repository.

Terra Fusion Data Sampler

The Terra Basic Fusion dataset is a fused dataset of the original Level 1 radiances from the five instruments on Terra, the flagship satellite of NASA’s Earth Observing System (EOS). Data were taken between the years 2000-2015, but there is a planned future update for the years 2016-2020.

Explore the Terra Fusion Data Sampler.

Mars Spectrometry Data

NASA’s Curiosity rover carries the Sample Analysis at Mars (SAM) instrument, which includes a gas chromatograph that separates gases to aid in identifying them. NASA has made two SAM datasets freely available to the public.

Detect Evidence for Past Habitability

The Mars Spectrometry: Detect Evidence for Past Habitability dataset consists of SAM measurements specifically processed for a project aimed at building a model to automatically analyze evolved gas analysis mass spectrometry (EGA-MS) data to help scientists understand the past habitability of Mars.

Explore the SAM instrument Evidence for Past Habitability data.

Gas Chromatography for the Sample Analysis at Mars Data (SAM) Instrument

The Mars Spectrometry 2: Gas Chromatography dataset consists of SAM data specifically processed for a project aimed at building a model to automatically analyze gas chromatography mass spectrometry (GCMS) measurements to help scientists in their analysis of understanding the past habitability of Mars.

Explore the SAM Instrument Gas Chromatography data

Wide-field Infrared Survey Explorer (WISE) and NEOWISE Data

The Wide-field Infrared Survey Explorer (WISE) was a NASA Medium Explorer satellite in low-Earth orbit that conducted an all-sky astronomical imaging survey over four infrared bands from 2010-2011. The NEOWISE Reactivation mission began in 2013 when the original WISE satellite was brought out of hibernation to learn more about the population of near-Earth objects and comets that could pose an impact hazard to the Earth. The data are also used to study a wide range of astrophysical phenomena in the time domain including brown dwarfs, supernovae and active galactic nuclei.

3-Band Cryo Data

The 3-Band Cryo Data Release contains 3.4, 4.6 and 12 micron (W1, W2, W3) imaging data that were acquired by WISE between 6 Aug and 29 Sept 2010 while the detectors were cooled by the inner cryogen tank following the exhaustion of the outer tank.

Explore WISE 3-band cryo data.

All-Sky Data

The All-Sky Data Release includes all data taken during the WISE full cryogenic mission phase, 7 January 2010 to 6 August 2010, in the 3.4, 4.6, 12, and 22 micron bands (i.e., W1, W2, W3, W4) that were processed with improved calibrations and reduction algorithms.

Explore WISE All-Sky data.

AllWISE Data

The AllWISE Data Release combines data from all cryogenic and post-cryogenic survey phases and provides a comprehensive view of the mid-infrared sky. The Images Atlas includes 18,240 FITS image sets at 3.4, 4.6, 12 and 22 microns. The Source Catalog contains position, apparent motion, and flux information for over 747 million objects detected on the Atlas Images.

Explore AllWISE data.

NEOWISE Post-Cryo Data

The NEOWISE Post-Cryo Data Release contains 3.4 and 4.6 micron (W1 and W2) imaging data that were acquired by WISE between 29 September 2010 and 1 February 2011 following the exhaustion of the inner and outer cryogen tanks.

Explore NEOWISE post-cryo data.

NEOWISE Reactivation Data

Data from the reactivated NEOWISE mission, updated annually. The dataset contains over 20 million calibrated FITS image sets for the individual 7.7 sec NEOWISE survey exposures in the W1 and W2 bands.

Explore NEOWISE reactivation data.

NASA High Energy Astrophysics Mission Data

The High Energy Astrophysics Science Archive Research Center (HEASARC) has released NASA data for high energy astrophysics (generally x-ray and gamma-ray domains). HEASARC hosts the full data archives of over 30 different missions spanning 50 years.

Explore HEASARC data.

NASA Legacy Archive for Microwave Background Data Analysis (LAMBDA)

NASA data for cosmic microwave background (CMB) analysis is made available here by the Legacy Archive for Microwave Background Data Analysis (LAMBDA), which is a part of NASA’s High Energy Astrophysics Science Archive Research Center (HEASARC). LAMBDA hosts the data archives of over 30 different CMB missions spanning 30+ years.

Explore LAMBDA data.

OpenUniverse 2024 Matched Rubin and Roman Simulations: Preview

This release consists of simulated data products designed to mimic observations of the same region of the sky as seen by two astronomical facilities: the Nancy Grace Roman Telescope and the Vera C. Rubin Observatory.

Explore Roman and Rubin simulated data products.

Catalina Sky Survey (CSS) Subset Data

This release of raw CSS data aims to discover Near Earth Objects (NEOs) which potentially could impact Earth.

Explore Catalina Sky Survey Subset Data.