Explanipedia

CoT Referring: Improving Referring Expression Tasks with Grounded Reasoning Open

Qi Dong, Luis Figueroa, Handong Zhao, Kushal Kafle, Jason Kuen , et al. · 2025

Referring Expression Comprehension and Segmentation are critical tasks for assessing the integration of language understanding and image comprehension, serving as benchmarks for Multimodal Large Language Models (MLLMs) capabilities. To add…

The Photographer Eye: Teaching Multimodal Large Language Models to Understand Image Aesthetics like Photographers Open

Daiqing Qi, Handong Zhao, Jing Shi, Simon Jenni, Yifei Fan , et al. · 2025

While editing directly from life, photographers have found it too difficult to see simultaneously both the blue and the sky. Photographer and curator, Szarkowski insightfully revealed one of the notable gaps between general and aesthetic v…

IgA nephropathy: Update on pathogenesis and treatment Open

Seshma Ramsawak, Scott Cohen, A. Linares, Corey Cavanaugh · 2025

The pathogenesis of immunoglobulin (Ig) A nephropathy is described through a "4-hit" model involving production of galactose-deficient IgA, production of autoantibodies to galactose-deficient IgA, and subsequent deposition of immune comple…

CompleteMe: Reference-based Human Image Completion Open

Luis Figueroa, Scott Cohen · 2025

Recent methods for human image completion can reconstruct plausible body shapes but often fail to preserve unique details, such as specific clothing patterns or distinctive accessories, without explicit reference images. Even state-of-the-…

“But Who Eats the Mosquitos?”: Deaf Learners’ Language Use and Translanguaging During STEAM Discussions Open

Jessica Scott, Patrick Enderle, Scott Cohen, Jasmine Smith, R E Hutchison · 2025

Science, technology, engineering, arts, and mathematics (STEAM) education represents an array of fields that have significant promise for the future careers of students. However, in deaf education, little research has been conducted to und…

MetaShadow: Object-Centered Shadow Detection, Removal, and Synthesis Open

Tianyu Wang, Jianming Zhang, Haitian Zheng, Zhihong Ding, Scott Cohen , et al. · 2024

Shadows are often under-considered or even ignored in image editing applications, limiting the realism of the edited results. In this paper, we introduce MetaShadow, a three-in-one versatile framework that enables detection, removal, and c…

Refine-by-Align: Reference-Guided Artifacts Refinement through Semantic Alignment Open

Yizhi Song, He Liu, Zhifei Zhang, Soo Ye Kim, Qi Chen , et al. · 2024

Personalized image generation has emerged from the recent advancements in generative models. However, these generated personalized images often suffer from localized artifacts such as incorrect logos, reducing fidelity and fine-grained ide…

Cardiovascular Outcomes Associated With Hypoplastic Left Heart Syndrome Versus Other Types of Single Right Ventricle: A Multicenter Study Open

Nabil Dib, Nancy Poirier, Michelle Samuel, Sèwanou Hermann Honfo, Ali N. Zaidi , et al. · 2024

Background The univentricular heart with a predominant right ventricle morphology (uRV) has been associated with a higher rate of adverse cardiovascular events. It remains to be determined whether the specific type of uRV influences outcom…

FairDeDup: Detecting and Mitigating Vision-Language Fairness Disparities in Semantic Dataset Deduplication Open

Eric Slyman, Stefan Lee, Scott Cohen, Kushal Kafle · 2024

Recent dataset deduplication techniques have demonstrated that content-aware dataset pruning can dramatically reduce the cost of training Vision-Language Pretrained (VLP) models without significant performance losses compared to training o…

FINEMATCH: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction Open

Hang Hua, Jing Shi, Kushal Kafle, Simon Jenni, Daoan Zhang , et al. · 2024

Recent progress in large-scale pre-training has led to the development of advanced vision-language models (VLMs) with remarkable proficiency in comprehending and generating multimodal content. Despite the impressive ability to perform comp…

IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation Open

Yizhi Song, Zhifei Zhang, Zhe Lin, Scott Cohen, Brian Price , et al. · 2024

Generative object compositing emerges as a promising new avenue for compositional image editing. However, the requirement of object identity preservation poses a significant challenge, limiting practical usage of most existing methods. In …

Cardiovascular Outcomes in Fontan Patients With Right vs Left Univentricular Morphology Open

Nabil Dib, Marie Chaix, Michelle Samuel, Sewanou Hermann Honfo, Robert M Hamilton , et al. · 2024

Fontan patients with uRV vs uLV morphology have a higher incidence of adverse cardiovascular events, including atrial arrhythmia, cardiac transplantation, and all-cause mortality.

The diversity of kidney biopsy findings among diabetic patients in the cleveland clinic kidney biopsy epidemiology project Open

Shane A. Bobart, Ahreum Kwon, Hadi Sawaf, Leal Herlitz, Scott Cohen , et al. · 2024

The Clinical and Pathological Characteristics of Patients with Oxalate Nephropathy Open

Maria Llanos, Alvin Kwon, Leal Herlitz, Tariq Shafi, Scott Cohen , et al. · 2023

Key Points Oxalate nephropathy is an underrecognized cause of CKD and ESKD We present one of the largest native oxalate nephropathy cohorts to date from a tertiary care institution in the United States Oxalate nephropathy has multiple etio…

Latent Feature-Guided Diffusion Models for Shadow Removal Open

Kangfu Mei, Luis Figueroa, Zhe Lin, Zhihong Ding, Scott Cohen , et al. · 2023

Recovering textures under shadows has remained a challenging problem due to the difficulty of inferring shadow-free scenes from shadow images. In this paper, we propose the use of diffusion models as they offer a promising approach to grad…

Using a Language Community to Unlock the Abstractness of Signed Language Open

Scott Cohen · 2023

SCoRD: Subject-Conditional Relation Detection with Text-Augmented Data Open

Ziyan Yang, Kushal Kafle, Zhe Lin, Scott Cohen, Zhihong Ding , et al. · 2023

We propose Subject-Conditional Relation Detection SCoRD, where conditioned on an input subject, the goal is to predict all its relations to other objects in a scene along with their locations. Based on the Open Images dataset, we propose a…

Endothelial dysfunction in autoimmune, pulmonary, and kidney systems, and exercise tolerance following SARS-CoV-2 infection Open

Sabyasachi Sen, Shikha Khosla, Omar Awan, Scott Cohen, Jared M. Gollie · 2023

Long COVID is characterized by persistent symptoms beyond 3-months of severe acute respiratory syndrome Coronavirus-2 (SARS-CoV-2) infection that last for at least 2 months and cannot be explained by an alternative diagnosis. Autonomic, im…

GamutMLP: A Lightweight MLP for Color Loss Recovery Open

Hoang Le, Brian Price, Scott Cohen, Michael S. Brown · 2023

Cameras and image-editing software often process images in the wide-gamut ProPhoto color space, encompassing 90% of all visible colors. However, when images are encoded for sharing, this color-rich representation is transformed and clipped…

TopNet: Transformer-based Object Placement Network for Image Compositing Open

Sijie Zhu, Zhe Lin, Scott Cohen, Jason Kuen, Zhifei Zhang , et al. · 2023

We investigate the problem of automatically placing an object into a background image for image compositing. Given a background image and a segmented object, the goal is to train a model to predict plausible placements (location and scale)…

Point-of-Care Ultrasound Use in Nephrology: A Survey of Nephrology Program Directors, Fellows, and Fellowship Graduates Open

David L. Cook, Samir Patel, Robert Nee, Dustin J. Little, Scott Cohen , et al. · 2023

Nephrology program directors, fellows, and graduates surveyed want POCUS training incorporated into the fellowship curriculum. No group felt sufficiently trained to confidently perform POCUS, and the major barrier to training was lack of s…

Structure-Guided Image Completion with Image-level and Object-level Semantic Discriminators Open

Haitian Zheng, Zhe Lin, Jingwan Lu, Scott Cohen, Eli Shechtman , et al. · 2022

Structure-guided image completion aims to inpaint a local region of an image according to an input guidance map from users. While such a task enables many practical applications for interactive editing, existing methods often struggle to h…

ObjectStitch: Generative Object Compositing Open

Yizhi Song, Zhifei Zhang, Zhe Lin, Scott Cohen, Brian Price , et al. · 2022

Object compositing based on 2D images is a challenging problem since it typically involves multiple processing stages such as color harmonization, geometry correction and shadow generation to generate realistic results. Furthermore, annota…

Physical Activity and Exercise for Cardiorespiratory Health and Fitness in Chronic Kidney Disease Open

Jared M. Gollie, Scott Cohen, Samir S. Patel · 2022

Chronic kidney disease (CKD) is associated with an increased risk for cardiovascular disease (CVD), major adverse CVD events, and cardiovascular mortality. Low levels of physical activity and reduced cardiorespiratory fitness further compo…

GALA: Toward Geometry-and-Lighting-Aware Object Search for Compositing Open

Sijie Zhu, Zhe Lin, Scott Cohen, Jason Kuen, Zhifei Zhang , et al. · 2022

Compositing-aware object search aims to find the most compatible objects for compositing given a background image and a query bounding box. Previous works focus on learning compatibility between the foreground object and background, but fa…

CM-GAN: Image Inpainting with Cascaded Modulation GAN and Object-Aware\n Training Open

Haitian Zheng, Zhe Lin, Jingwan Lu, Scott Cohen, Eli Shechtman , et al. · 2022

Recent image inpainting methods have made great progress but often struggle\nto generate plausible image structures when dealing with large holes in complex\nimages. This is partially due to the lack of effective network structures that\nc…

CM-GAN: Image Inpainting with Cascaded Modulation GAN and Object-Aware Training Open

Haitian Zheng, Zhe Lin, Jingwan Lu, Scott Cohen, Eli Shechtman , et al. · 2022

Recent image inpainting methods have made great progress but often struggle to generate plausible image structures when dealing with large holes in complex images. This is partially due to the lack of effective network structures that can …

Fatigability and the Role of Neuromuscular Impairments in Chronic Kidney Disease Open

Jared M. Gollie, Samir S. Patel, Michael O. Harris‐Love, Scott Cohen, Marc R. Blackman · 2022

Background: The combination of neuromuscular impairments plus psychosocial aspects of chronic kidney disease (CKD) may predispose these patients to greater risk for experiencing increased levels of fatigability. There has bee…

Front Matter Open

Mustafa Ahmad, K. Sheraz Fahad, Jafar Alsaid, Carmichael Angeles, Kisra Anis , et al. · 2022

Generalized Few-Shot Semantic Segmentation: All You Need is Fine-Tuning Open

Josh Myers-Dean, Yinan Zhao, Brian Price, Scott Cohen, Danna Gurari · 2021

Generalized few-shot semantic segmentation was introduced to move beyond only evaluating few-shot segmentation models on novel classes to include testing their ability to remember base classes. While the current state-of-the-art approach i…

Scott Cohen YOU? Author Swipe