Scott Cohen
YOU?
Author Swipe
View article: CoT Referring: Improving Referring Expression Tasks with Grounded Reasoning
CoT Referring: Improving Referring Expression Tasks with Grounded Reasoning Open
Referring Expression Comprehension and Segmentation are critical tasks for assessing the integration of language understanding and image comprehension, serving as benchmarks for Multimodal Large Language Models (MLLMs) capabilities. To add…
View article: The Photographer Eye: Teaching Multimodal Large Language Models to Understand Image Aesthetics like Photographers
The Photographer Eye: Teaching Multimodal Large Language Models to Understand Image Aesthetics like Photographers Open
While editing directly from life, photographers have found it too difficult to see simultaneously both the blue and the sky. Photographer and curator, Szarkowski insightfully revealed one of the notable gaps between general and aesthetic v…
View article: IgA nephropathy: Update on pathogenesis and treatment
IgA nephropathy: Update on pathogenesis and treatment Open
The pathogenesis of immunoglobulin (Ig) A nephropathy is described through a "4-hit" model involving production of galactose-deficient IgA, production of autoantibodies to galactose-deficient IgA, and subsequent deposition of immune comple…
View article: CompleteMe: Reference-based Human Image Completion
CompleteMe: Reference-based Human Image Completion Open
Recent methods for human image completion can reconstruct plausible body shapes but often fail to preserve unique details, such as specific clothing patterns or distinctive accessories, without explicit reference images. Even state-of-the-…
View article: “But Who Eats the Mosquitos?”: Deaf Learners’ Language Use and Translanguaging During STEAM Discussions
“But Who Eats the Mosquitos?”: Deaf Learners’ Language Use and Translanguaging During STEAM Discussions Open
Science, technology, engineering, arts, and mathematics (STEAM) education represents an array of fields that have significant promise for the future careers of students. However, in deaf education, little research has been conducted to und…
View article: MetaShadow: Object-Centered Shadow Detection, Removal, and Synthesis
MetaShadow: Object-Centered Shadow Detection, Removal, and Synthesis Open
Shadows are often under-considered or even ignored in image editing applications, limiting the realism of the edited results. In this paper, we introduce MetaShadow, a three-in-one versatile framework that enables detection, removal, and c…
View article: Refine-by-Align: Reference-Guided Artifacts Refinement through Semantic Alignment
Refine-by-Align: Reference-Guided Artifacts Refinement through Semantic Alignment Open
Personalized image generation has emerged from the recent advancements in generative models. However, these generated personalized images often suffer from localized artifacts such as incorrect logos, reducing fidelity and fine-grained ide…
View article: Cardiovascular Outcomes Associated With Hypoplastic Left Heart Syndrome Versus Other Types of Single Right Ventricle: A Multicenter Study
Cardiovascular Outcomes Associated With Hypoplastic Left Heart Syndrome Versus Other Types of Single Right Ventricle: A Multicenter Study Open
Background The univentricular heart with a predominant right ventricle morphology (uRV) has been associated with a higher rate of adverse cardiovascular events. It remains to be determined whether the specific type of uRV influences outcom…
View article: FairDeDup: Detecting and Mitigating Vision-Language Fairness Disparities in Semantic Dataset Deduplication
FairDeDup: Detecting and Mitigating Vision-Language Fairness Disparities in Semantic Dataset Deduplication Open
Recent dataset deduplication techniques have demonstrated that content-aware dataset pruning can dramatically reduce the cost of training Vision-Language Pretrained (VLP) models without significant performance losses compared to training o…
View article: FINEMATCH: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction
FINEMATCH: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction Open
Recent progress in large-scale pre-training has led to the development of advanced vision-language models (VLMs) with remarkable proficiency in comprehending and generating multimodal content. Despite the impressive ability to perform comp…
View article: IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation
IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation Open
Generative object compositing emerges as a promising new avenue for compositional image editing. However, the requirement of object identity preservation poses a significant challenge, limiting practical usage of most existing methods. In …
View article: Cardiovascular Outcomes in Fontan Patients With Right vs Left Univentricular Morphology
Cardiovascular Outcomes in Fontan Patients With Right vs Left Univentricular Morphology Open
Fontan patients with uRV vs uLV morphology have a higher incidence of adverse cardiovascular events, including atrial arrhythmia, cardiac transplantation, and all-cause mortality.
View article: The diversity of kidney biopsy findings among diabetic patients in the cleveland clinic kidney biopsy epidemiology project
The diversity of kidney biopsy findings among diabetic patients in the cleveland clinic kidney biopsy epidemiology project Open
View article: The Clinical and Pathological Characteristics of Patients with Oxalate Nephropathy
The Clinical and Pathological Characteristics of Patients with Oxalate Nephropathy Open
Key Points Oxalate nephropathy is an underrecognized cause of CKD and ESKD We present one of the largest native oxalate nephropathy cohorts to date from a tertiary care institution in the United States Oxalate nephropathy has multiple etio…
View article: Latent Feature-Guided Diffusion Models for Shadow Removal
Latent Feature-Guided Diffusion Models for Shadow Removal Open
Recovering textures under shadows has remained a challenging problem due to the difficulty of inferring shadow-free scenes from shadow images. In this paper, we propose the use of diffusion models as they offer a promising approach to grad…
View article: Using a Language Community to Unlock the Abstractness of Signed Language
Using a Language Community to Unlock the Abstractness of Signed Language Open
View article: SCoRD: Subject-Conditional Relation Detection with Text-Augmented Data
SCoRD: Subject-Conditional Relation Detection with Text-Augmented Data Open
We propose Subject-Conditional Relation Detection SCoRD, where conditioned on an input subject, the goal is to predict all its relations to other objects in a scene along with their locations. Based on the Open Images dataset, we propose a…
View article: Endothelial dysfunction in autoimmune, pulmonary, and kidney systems, and exercise tolerance following SARS-CoV-2 infection
Endothelial dysfunction in autoimmune, pulmonary, and kidney systems, and exercise tolerance following SARS-CoV-2 infection Open
Long COVID is characterized by persistent symptoms beyond 3-months of severe acute respiratory syndrome Coronavirus-2 (SARS-CoV-2) infection that last for at least 2 months and cannot be explained by an alternative diagnosis. Autonomic, im…
View article: GamutMLP: A Lightweight MLP for Color Loss Recovery
GamutMLP: A Lightweight MLP for Color Loss Recovery Open
Cameras and image-editing software often process images in the wide-gamut ProPhoto color space, encompassing 90% of all visible colors. However, when images are encoded for sharing, this color-rich representation is transformed and clipped…
View article: TopNet: Transformer-based Object Placement Network for Image Compositing
TopNet: Transformer-based Object Placement Network for Image Compositing Open
We investigate the problem of automatically placing an object into a background image for image compositing. Given a background image and a segmented object, the goal is to train a model to predict plausible placements (location and scale)…
View article: Point-of-Care Ultrasound Use in Nephrology: A Survey of Nephrology Program Directors, Fellows, and Fellowship Graduates
Point-of-Care Ultrasound Use in Nephrology: A Survey of Nephrology Program Directors, Fellows, and Fellowship Graduates Open
Nephrology program directors, fellows, and graduates surveyed want POCUS training incorporated into the fellowship curriculum. No group felt sufficiently trained to confidently perform POCUS, and the major barrier to training was lack of s…
View article: Structure-Guided Image Completion with Image-level and Object-level Semantic Discriminators
Structure-Guided Image Completion with Image-level and Object-level Semantic Discriminators Open
Structure-guided image completion aims to inpaint a local region of an image according to an input guidance map from users. While such a task enables many practical applications for interactive editing, existing methods often struggle to h…
View article: ObjectStitch: Generative Object Compositing
ObjectStitch: Generative Object Compositing Open
Object compositing based on 2D images is a challenging problem since it typically involves multiple processing stages such as color harmonization, geometry correction and shadow generation to generate realistic results. Furthermore, annota…
View article: Physical Activity and Exercise for Cardiorespiratory Health and Fitness in Chronic Kidney Disease
Physical Activity and Exercise for Cardiorespiratory Health and Fitness in Chronic Kidney Disease Open
Chronic kidney disease (CKD) is associated with an increased risk for cardiovascular disease (CVD), major adverse CVD events, and cardiovascular mortality. Low levels of physical activity and reduced cardiorespiratory fitness further compo…
View article: GALA: Toward Geometry-and-Lighting-Aware Object Search for Compositing
GALA: Toward Geometry-and-Lighting-Aware Object Search for Compositing Open
Compositing-aware object search aims to find the most compatible objects for compositing given a background image and a query bounding box. Previous works focus on learning compatibility between the foreground object and background, but fa…
View article: CM-GAN: Image Inpainting with Cascaded Modulation GAN and Object-Aware\n Training
CM-GAN: Image Inpainting with Cascaded Modulation GAN and Object-Aware\n Training Open
Recent image inpainting methods have made great progress but often struggle\nto generate plausible image structures when dealing with large holes in complex\nimages. This is partially due to the lack of effective network structures that\nc…
View article: CM-GAN: Image Inpainting with Cascaded Modulation GAN and Object-Aware Training
CM-GAN: Image Inpainting with Cascaded Modulation GAN and Object-Aware Training Open
Recent image inpainting methods have made great progress but often struggle to generate plausible image structures when dealing with large holes in complex images. This is partially due to the lack of effective network structures that can …
View article: Fatigability and the Role of Neuromuscular Impairments in Chronic Kidney Disease
Fatigability and the Role of Neuromuscular Impairments in Chronic Kidney Disease Open
Background: The combination of neuromuscular impairments plus psychosocial aspects of chronic kidney disease (CKD) may predispose these patients to greater risk for experiencing increased levels of fatigability. There has bee…
View article: Front Matter
Front Matter Open
View article: Generalized Few-Shot Semantic Segmentation: All You Need is Fine-Tuning
Generalized Few-Shot Semantic Segmentation: All You Need is Fine-Tuning Open
Generalized few-shot semantic segmentation was introduced to move beyond only evaluating few-shot segmentation models on novel classes to include testing their ability to remember base classes. While the current state-of-the-art approach i…