Manil Maskey
YOU?
Author Swipe
View article: Improving Label Error Detection and Elimination with Uncertainty Quantification
Improving Label Error Detection and Elimination with Uncertainty Quantification Open
Identifying and handling label errors can significantly enhance the accuracy of supervised machine learning models. Recent approaches for identifying label errors demonstrate that a low self-confidence of models with respect to a certain l…
View article: Finetuning AI Foundation Models to Develop Subgrid‐Scale Parameterizations: A Case Study on Atmospheric Gravity Waves
Finetuning AI Foundation Models to Develop Subgrid‐Scale Parameterizations: A Case Study on Atmospheric Gravity Waves Open
Global climate models parameterize a range of atmospheric‐oceanic processes, including gravity waves (GWs), clouds, moist convection, and turbulence, that cannot be sufficiently resolved. These subgrid‐scale closures for unresolved process…
View article: Advancing Hurricane Forecasting With AI Models for Track and Intensity Prediction
Advancing Hurricane Forecasting With AI Models for Track and Intensity Prediction Open
Hurricane forecasting has traditionally relied on numerical weather prediction (NWP) models. However, advancements in artificial intelligence (AI) offer new opportunities to improve forecasting accuracy. This study presents a novel evaluat…
View article: SuryaBench: Benchmark Dataset for Advancing Machine Learning in Heliophysics and Space Weather Prediction
SuryaBench: Benchmark Dataset for Advancing Machine Learning in Heliophysics and Space Weather Prediction Open
This paper introduces a high resolution, machine learning-ready heliophysics dataset derived from NASA's Solar Dynamics Observatory (SDO), specifically designed to advance machine learning (ML) applications in solar physics and space weath…
View article: Surya: Foundation Model for Heliophysics
Surya: Foundation Model for Heliophysics Open
Heliophysics is central to understanding and forecasting space weather events and solar activity. Despite decades of high-resolution observations from the Solar Dynamics Observatory (SDO), most models remain task-specific and constrained b…
View article: Prithvi-EO-2.0: A Versatile Multi-Temporal Foundation Model for Earth Observation Applications
Prithvi-EO-2.0: A Versatile Multi-Temporal Foundation Model for Earth Observation Applications Open
View article: WxC-Bench: A Novel Dataset for Weather and Climate Downstream Tasks
WxC-Bench: A Novel Dataset for Weather and Climate Downstream Tasks Open
High-quality machine learning (ML)-ready datasets play a foundational role in developing new artificial intelligence (AI) models or fine-tuning existing models for scientific applications such as weather and climate analysis. Unfortunately…
View article: Prithvi-EO-2.0: A Versatile Multi-Temporal Foundation Model for Earth Observation Applications
Prithvi-EO-2.0: A Versatile Multi-Temporal Foundation Model for Earth Observation Applications Open
This technical report presents Prithvi-EO-2.0, a new geospatial foundation model that offers significant improvements over its predecessor, Prithvi-EO-1.0. Trained on 4.2M global time series samples from NASA's Harmonized Landsat and Senti…
View article: Challenges in Guardrailing Large Language Models for Science
Challenges in Guardrailing Large Language Models for Science Open
The rapid development in large language models (LLMs) has transformed the landscape of natural language processing and understanding (NLP/NLU), offering significant benefits across various domains. However, when applied to scientific resea…
View article: AI Foundation Model for Heliophysics: Applications, Design, and Implementation
AI Foundation Model for Heliophysics: Applications, Design, and Implementation Open
Deep learning-based methods have been widely researched in the areas of language and vision, demonstrating their capacity to understand long sequences of data and their usefulness in numerous helio-physics applications. Foundation models (…
View article: Prithvi WxC: Foundation Model for Weather and Climate
Prithvi WxC: Foundation Model for Weather and Climate Open
Triggered by the realization that AI emulators can rival the performance of traditional numerical weather prediction models running on HPC systems, there is now an increasing number of large AI models that address use cases such as forecas…
View article: Croissant: A Metadata Format for ML-Ready Datasets
Croissant: A Metadata Format for ML-Ready Datasets Open
Presentation on Artificial Intelligence developments in the European Open Science Cloud (EOSC) took place at the EOSC Winter School 2025 in Seville, Spain. Recent advancements in semantic interoperability were highlighted, particularly in …
View article: INDUS: Effective and Efficient Language Models for Scientific Applications
INDUS: Effective and Efficient Language Models for Scientific Applications Open
Large language models (LLMs) trained on general domain corpora showed remarkable results on natural language processing (NLP) tasks. However, previous research demonstrated LLMs trained using domain-focused corpora perform better on specia…
View article: Improving Label Error Detection and Elimination with Uncertainty Quantification
Improving Label Error Detection and Elimination with Uncertainty Quantification Open
Identifying and handling label errors can significantly enhance the accuracy of supervised machine learning models. Recent approaches for identifying label errors demonstrate that a low self-confidence of models with respect to a certain l…
View article: NASA’s Open Science Platform VEDA (Visualization, Exploration and Data Analytics)
NASA’s Open Science Platform VEDA (Visualization, Exploration and Data Analytics) Open
VEDA is an open-source science cyberinfrastructure for data processing, visualization, exploration, and geographic information systems (GIS) capabilities (https://www.earthdata.nasa.gov/esds/veda, https://www.earthdata.nasa.gov/dashboard/)…
View article: Foundation Models for Science: Potential, Challenges, and the Path Forward
Foundation Models for Science: Potential, Challenges, and the Path Forward Open
Foundation models signify a significant shift in AI by creating large-scale machine learning models (FMs) pre-trained on wide-ranging datasets. These models act as flexible starting points, ready to be fine-tuned for various specialized ta…
View article: The U.S. Greenhouse Gas Center: Extending Accessible and Integrated GHG Information from U.S. Government and Non-Public Sources to meet user needs
The U.S. Greenhouse Gas Center: Extending Accessible and Integrated GHG Information from U.S. Government and Non-Public Sources to meet user needs Open
The newly established United States Greenhouse Gas Center (U.S. GHG Center) is a multi-agency partnership between the National Aeronautics and Space Administration (NASA), the Environmental Protection Agency (EPA), the National Oceanic and…
View article: High Performance and Disruptive Computing in Remote Sensing: The Third Edition of the School Organized by the HDCRS Working Group of the GRSS Earth Science Informatics Technical Committee [Technical Committees]
High Performance and Disruptive Computing in Remote Sensing: The Third Edition of the School Organized by the HDCRS Working Group of the GRSS Earth Science Informatics Technical Committee [Technical Committees] Open
Provides society information that may include news, reviews or technical notes that should be of interest to practitioners and researchers.
View article: Foundation Models for Generalist Geospatial Artificial Intelligence
Foundation Models for Generalist Geospatial Artificial Intelligence Open
View article: Rethinking AI for Science: An Evolution From Data‐Driven to Data‐Centric Framework
Rethinking AI for Science: An Evolution From Data‐Driven to Data‐Centric Framework Open
The rapid advancements in Artificial Intelligence (AI) have found applications across a multitude of disciplines, including the scientific domain. Yet, the dominant focus on model‐centric AI often overlooks the critical role that data play…
View article: DMLR: Data-centric Machine Learning Research -- Past, Present and Future
DMLR: Data-centric Machine Learning Research -- Past, Present and Future Open
Drawing from discussions at the inaugural DMLR workshop at ICML 2023 and meetings prior, in this report we outline the relevance of community engagement and infrastructure development for the creation of next-generation public datasets tha…
View article: Leveraging Citizen Science for Flood Extent Detection using Machine Learning Benchmark Dataset
Leveraging Citizen Science for Flood Extent Detection using Machine Learning Benchmark Dataset Open
Accurate detection of inundated water extents during flooding events is crucial in emergency response decisions and aids in recovery efforts. Satellite Remote Sensing data provides a global framework for detecting flooding extents. Specifi…
View article: Foundation Models for Generalist Geospatial Artificial Intelligence
Foundation Models for Generalist Geospatial Artificial Intelligence Open
Significant progress in the development of highly adaptable and reusable Artificial Intelligence (AI) models is expected to have a significant impact on Earth science and remote sensing. Foundation models are pre-trained on large unlabeled…
View article: A Summer School Session on Mastering Geospatial Artificial Intelligence: From Data Production to Artificial Intelligence Foundation Model Development and Downstream Applications [Technical Committees]
A Summer School Session on Mastering Geospatial Artificial Intelligence: From Data Production to Artificial Intelligence Foundation Model Development and Downstream Applications [Technical Committees] Open
Provides society information that may include news, reviews or technical notes that should be of interest to practitioners and researchers.
View article: Tropical Cyclone Wind Speed Estimation: A Large Scale Training Data Set and Community Benchmarking
Tropical Cyclone Wind Speed Estimation: A Large Scale Training Data Set and Community Benchmarking Open
Tropical cyclones (TCs) cause significant disruptions to infrastructure and livelihood. The scale of loss due to TCs may be mitigated by prompt and accurate advisories about TC wind speed. Current advisories are consensus based and have a …
View article: Urban Growth of Houston, Texas over the Last Two Decades: Implications for Heat Stress
Urban Growth of Houston, Texas over the Last Two Decades: Implications for Heat Stress Open
<p>Houston is also one of the coastal cities in the US that has shown considerable growth in urban land cover over the last two decades. We utilized the National Land Cover Database derived from Landsat satellite imagery and coherent…
View article: The Origins of NASA’s Visualization, Exploration and Data Analytics (VEDA) Platform: A platform for biomass research, NASA’s Earth Information System and the COVID-19 Dashboard
The Origins of NASA’s Visualization, Exploration and Data Analytics (VEDA) Platform: A platform for biomass research, NASA’s Earth Information System and the COVID-19 Dashboard Open
NASA's Visualization, Exploration, and Data Analysis (VEDA) project is an open-source science cyberinfrastructure for data processing, visualization, exploration, and geographic information systems (GIS) capabilities (https://www.earthdata…
View article: Programmatic Update for NASA’s Commercial Smallsat Data Acquisition (CSDA) Program
Programmatic Update for NASA’s Commercial Smallsat Data Acquisition (CSDA) Program Open
Established in 2017 as a pilot project, the NASA Commercial Smallsat Data Acquisition (CSDA) Program evaluates and acquires commercial datasets that compliment NASA Earth Science research and application goals. The success of the pilot and…
View article: Foundation AI Models for Science
Foundation AI Models for Science Open
Foundation Models (FM) are AI models that are designed to replace a task or an application specific model. These FM can be applied to many different downstream applications. These FM are trained using self supervised techniques and can be …
View article: The future of NASA Earth Science in the commercial cloud: Challenges and opportunities
The future of NASA Earth Science in the commercial cloud: Challenges and opportunities Open
NASA produces a large volume and variety of data products that are used every day to support research, decision making, and education. The widespread use of NASA’s Earth Science data is enabled by NASA’s Earth Science Data Syst…