Subhransu Maji
YOU?
Author Swipe
View article: Merlin L48 Spectrogram Dataset
Merlin L48 Spectrogram Dataset Open
In the single-positive multi-label (SPML) setting, each image in a dataset is labeled with the presence of a single class, while the true presence of other classes remains unknown. The challenge is to narrow the performance gap between thi…
View article: Pairing weather radars with eBird data for large-scale monitoring of swallow and martin communal roosts
Pairing weather radars with eBird data for large-scale monitoring of swallow and martin communal roosts Open
During their nonbreeding period, many species of swallows and martins (family: Hirundinidae) congregate in large communal roosts. Some of these roosts are well-known within local birdwatching communities; however, monitoring them at large …
View article: SIGMA-GEN: Structure and Identity Guided Multi-subject Assembly for Image Generation
SIGMA-GEN: Structure and Identity Guided Multi-subject Assembly for Image Generation Open
We present SIGMA-GEN, a unified framework for multi-identity preserving image generation. Unlike prior approaches, SIGMA-GEN is the first to enable single-pass multi-subject identity-preserved generation guided by both structural and spati…
View article: Reinforcement Learning for Quantum Network Control with Application-Driven Objectives
Reinforcement Learning for Quantum Network Control with Application-Driven Objectives Open
Optimized control of quantum networks is essential for enabling distributed quantum applications with strict performance requirements. In near-term architectures with constrained hardware, effective control may determine the feasibility of…
View article: Consensus-Driven Active Model Selection
Consensus-Driven Active Model Selection Open
The widespread availability of off-the-shelf machine learning models poses a challenge: which model, of the many available candidates, should be chosen for a given data analysis task? This question of model selection is traditionally answe…
View article: The iNaturalist Sounds Dataset
The iNaturalist Sounds Dataset Open
We present the iNaturalist Sounds Dataset (iNatSounds), a collection of 230,000 audio files capturing sounds from over 5,500 species, contributed by more than 27,000 recordists worldwide. The dataset encompasses sounds from birds, mammals,…
View article: Representation Learning for Long-Chain Hydrocarbon Adsorption in Zeolites
Representation Learning for Long-Chain Hydrocarbon Adsorption in Zeolites Open
Zeolites are a class of crystalline nanoporous materials known for their ability to discriminate molecules based on size and shape. Such molecular shape selectivity arises from the precise 3-dimensional arrangement of zeolite framework ato…
View article: Audio Geolocation: A Natural Sounds Benchmark
Audio Geolocation: A Natural Sounds Benchmark Open
Can we determine someone's geographic location purely from the sounds they hear? Are acoustic signals enough to localize within a country, state, or even city? We tackle the challenge of global-scale audio geolocation, formalize the proble…
View article: Automated extraction of right whale morphometric data from drone aerial photographs
Automated extraction of right whale morphometric data from drone aerial photographs Open
Aerial photogrammetry is a popular non‐invasive tool to measure the size, body morphometrics and body condition of wild animals. While the method can generate large datasets quickly, the lack of efficient processing tools can create bottle…
View article: CVOCSemRPL: Class-Variance Optimized Clustering, Semantic Information Injection and Restricted Pseudo Labeling based Improved Semi-Supervised Few-Shot Learning
CVOCSemRPL: Class-Variance Optimized Clustering, Semantic Information Injection and Restricted Pseudo Labeling based Improved Semi-Supervised Few-Shot Learning Open
Few-shot learning has been extensively explored to address problems where the amount of labeled samples is very limited for some classes. In the semi-supervised few-shot learning setting, substantial quantities of unlabeled samples are ava…
View article: Improving Satellite Imagery Masking Using Multitask and Transfer Learning
Improving Satellite Imagery Masking Using Multitask and Transfer Learning Open
Many remote sensing applications require masking of pixels in satellite imagery for further analysis. For instance, estimating water quality variables such as suspended sediment concentration (SSC) requires isolating pixels depicting water…
View article: WildSAT: Learning Satellite Image Representations from Wildlife Observations
WildSAT: Learning Satellite Image Representations from Wildlife Observations Open
Species distributions encode valuable ecological and environmental information, yet their potential for guiding representation learning in remote sensing remains underexplored. We introduce WildSAT, which pairs satellite images with millio…
View article: Improving Satellite Imagery Masking using Multi-task and Transfer Learning
Improving Satellite Imagery Masking using Multi-task and Transfer Learning Open
Many remote sensing applications employ masking of pixels in satellite imagery for subsequent measurements. For example, estimating water quality variables, such as Suspended Sediment Concentration (SSC) requires isolating pixels depicting…
View article: Combining Observational Data and Language for Species Range Estimation
Combining Observational Data and Language for Species Range Estimation Open
Species range maps (SRMs) are essential tools for research and policy-making in ecology, conservation, and environmental management. However, traditional SRMs rely on the availability of environmental covariates and high-quality species lo…
View article: OSLO: One-Shot Label-Only Membership Inference Attacks
OSLO: One-Shot Label-Only Membership Inference Attacks Open
We introduce One-Shot Label-Only (OSLO) membership inference attacks (MIAs), which accurately infer a given sample's membership in a target model's training set with high precision using just \emph{a single query}, where the target model o…
View article: Feedback in emerging extragalactic star clusters, FEAST: The relation between 3.3 $μ$m PAH emission and Star Formation Rate traced by ionized gas in NGC 628
Feedback in emerging extragalactic star clusters, FEAST: The relation between 3.3 $μ$m PAH emission and Star Formation Rate traced by ionized gas in NGC 628 Open
We present maps of ionized gas (traced by Pa$α$ and Br$α$) and 3.3 $μ$m Polycyclic Aromatic Hydrocarbon (PAH) emission in the nearby spiral galaxy NGC 628, derived from new JWST/NIRCam data from the FEAST survey. With this data, we investi…
View article: Task2Box: Box Embeddings for Modeling Asymmetric Task Relationships
Task2Box: Box Embeddings for Modeling Asymmetric Task Relationships Open
Modeling and visualizing relationships between tasks or datasets is an important step towards solving various meta-tasks such as dataset discovery, multi-tasking, and transfer learning. However, many relationships, such as containment and …
View article: DISCount: Counting in Large Image Collections with Detector-Based Importance Sampling
DISCount: Counting in Large Image Collections with Detector-Based Importance Sampling Open
Many applications use computer vision to detect and count objects in massive image collections. However, automated methods may fail to deliver accurate counts, especially when the task is very difficult or requires a fast response time. Fo…
View article: Modeling suspended sediment concentration using artificial neural networks, an effort towards global sediment flux observations in rivers from space
Modeling suspended sediment concentration using artificial neural networks, an effort towards global sediment flux observations in rivers from space Open
Harmonized Landsat Sentinel-2 (HLS) provides high-quality images every 2-3 days across Earth. However, HLS has not been widely used to measure Suspended Sediment Concentration (SSC) in rivers. Here, we used HLS to generate a fully open-sou…
View article: Improved Zero-Shot Classification by Adapting VLMs with Text Descriptions
Improved Zero-Shot Classification by Adapting VLMs with Text Descriptions Open
The zero-shot performance of existing vision-language models (VLMs) such as CLIP is limited by the availability of large-scale, aligned image and text datasets in specific domains. In this work, we leverage two complementary sources of inf…
View article: Human-in-the-Loop Visual Re-ID for Population Size Estimation
Human-in-the-Loop Visual Re-ID for Population Size Estimation Open
Computer vision-based re-identification (Re-ID) systems are increasingly being deployed for estimating population size in large image collections. However, the estimated size can be significantly inaccurate when the task is challenging or …
View article: PARTICLE: Part Discovery and Contrastive Learning for Fine-grained Recognition
PARTICLE: Part Discovery and Contrastive Learning for Fine-grained Recognition Open
We develop techniques for refining representations for fine-grained classification and segmentation tasks in a self-supervised manner. We find that fine-tuning methods based on instance-discriminative contrastive learning are not as effect…
View article: COSE: A Consistency-Sensitivity Metric for Saliency on Image Classification
COSE: A Consistency-Sensitivity Metric for Saliency on Image Classification Open
We present a set of metrics that utilize vision priors to effectively assess the performance of saliency methods on image classification tasks. To understand behavior in deep learning models, many methods provide visual saliency maps empha…
View article: LU-NeRF: Scene and Pose Estimation by Synchronizing Local Unposed NeRFs
LU-NeRF: Scene and Pose Estimation by Synchronizing Local Unposed NeRFs Open
A critical obstacle preventing NeRF models from being deployed broadly in the wild is their reliance on accurate camera poses. Consequently, there is growing interest in extending NeRF models to jointly optimize camera poses and scene repr…
View article: DISCount: Counting in Large Image Collections with Detector-Based Importance Sampling
DISCount: Counting in Large Image Collections with Detector-Based Importance Sampling Open
Many modern applications use computer vision to detect and count objects in massive image collections. However, when the detection task is very difficult or in the presence of domain shifts, the counts may be inaccurate even with significa…
View article: Inside back cover
Inside back cover Open
Permissions Request permissions Inside back cover J. Mater. Chem. A, 2023, 11, 17873 DOI: 10.1039/D3TA90179C This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in othe…
View article: Accidental Turntables: Learning 3D Pose by Watching Objects Turn
Accidental Turntables: Learning 3D Pose by Watching Objects Turn Open
We propose a technique for learning single-view 3D object pose estimation models by utilizing a new source of data -- in-the-wild videos where objects turn. Such videos are prevalent in practice (e.g., cars in roundabouts, airplanes near r…
View article: Using spatio-temporal information in weather radar data to detect and track communal bird roosts
Using spatio-temporal information in weather radar data to detect and track communal bird roosts Open
The exodus of swallows from communal nighttime roosts is often visible as an expanding ring-shaped pattern in weather radar data. The WSR-88D network operated by the National Weather Service archives more than 25 years of data across 143 s…
View article: MvDeCor: Multi-view Dense Correspondence Learning for Fine-grained 3D Segmentation
MvDeCor: Multi-view Dense Correspondence Learning for Fine-grained 3D Segmentation Open
We propose to utilize self-supervised techniques in the 2D domain for fine-grained 3D shape segmentation tasks. This is inspired by the observation that view-based surface representations are more effective at modeling high-resolution surf…
View article: Star Cluster Formation and Evolution in M101: An Investigation with the Legacy Extragalactic UV Survey
Star Cluster Formation and Evolution in M101: An Investigation with the Legacy Extragalactic UV Survey Open
We present Hubble Space Telescope WFC3/UVIS (F275W, F336W) and ACS/WFC optical (F435W, F555W, and F814W) observations of the nearby grand-design spiral galaxy M101 as part of the Legacy Extragalactic UV Survey (LEGUS). Compact sources dete…