OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps Article Swipe
YOU?
·
· 2025
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.2509.19282
Despite steady progress in layout-to-image generation, current methods still struggle with layouts containing significant overlap between bounding boxes. We identify two primary challenges: (1) large overlapping regions and (2) overlapping instances with minimal semantic distinction. Through both qualitative examples and quantitative analysis, we demonstrate how these factors degrade generation quality. To systematically assess this issue, we introduce OverLayScore, a novel metric that quantifies the complexity of overlapping bounding boxes. Our analysis reveals that existing benchmarks are biased toward simpler cases with low OverLayScore values, limiting their effectiveness in evaluating model performance under more challenging conditions. To bridge this gap, we present OverLayBench, a new benchmark featuring high-quality annotations and a balanced distribution across different levels of OverLayScore. As an initial step toward improving performance on complex overlaps, we also propose CreatiLayout-AM, a model fine-tuned on a curated amodal mask dataset. Together, our contributions lay the groundwork for more robust layout-to-image generation under realistic and challenging scenarios. Project link: https://mlpc-ucsd.github.io/OverLayBench.
Related Topics
- Type
- preprint
- Language
- en
- Landing Page
- http://arxiv.org/abs/2509.19282
- https://arxiv.org/pdf/2509.19282
- OA Status
- green
- OpenAlex ID
- https://openalex.org/W4415251617
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4415251617Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.48550/arxiv.2509.19282Digital Object Identifier
- Title
-
OverLayBench: A Benchmark for Layout-to-Image Generation with Dense OverlapsWork title
- Type
-
preprintOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2025Year of publication
- Publication date
-
2025-09-23Full publication date if available
- Authors
-
Bingnan Li, Chenyu Wang, Haiyang Xu, Xiang Zhang, Ethan J. Armand, Divyansh Srivastava, Xiaojun Shan, Zeyuan Chen, Jianwen Xie, Zhuowen TuList of authors in order
- Landing page
-
https://arxiv.org/abs/2509.19282Publisher landing page
- PDF URL
-
https://arxiv.org/pdf/2509.19282Direct link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://arxiv.org/pdf/2509.19282Direct OA link when available
- Cited by
-
0Total citation count in OpenAlex
Full payload
| id | https://openalex.org/W4415251617 |
|---|---|
| doi | https://doi.org/10.48550/arxiv.2509.19282 |
| ids.doi | https://doi.org/10.48550/arxiv.2509.19282 |
| ids.openalex | https://openalex.org/W4415251617 |
| fwci | |
| type | preprint |
| title | OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T10481 |
| topics[0].field.id | https://openalex.org/fields/17 |
| topics[0].field.display_name | Computer Science |
| topics[0].score | 0.9828000068664551 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1704 |
| topics[0].subfield.display_name | Computer Graphics and Computer-Aided Design |
| topics[0].display_name | Computer Graphics and Visualization Techniques |
| topics[1].id | https://openalex.org/T10531 |
| topics[1].field.id | https://openalex.org/fields/17 |
| topics[1].field.display_name | Computer Science |
| topics[1].score | 0.9768999814987183 |
| topics[1].domain.id | https://openalex.org/domains/3 |
| topics[1].domain.display_name | Physical Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/1707 |
| topics[1].subfield.display_name | Computer Vision and Pattern Recognition |
| topics[1].display_name | Advanced Vision and Imaging |
| topics[2].id | https://openalex.org/T10052 |
| topics[2].field.id | https://openalex.org/fields/17 |
| topics[2].field.display_name | Computer Science |
| topics[2].score | 0.9690999984741211 |
| topics[2].domain.id | https://openalex.org/domains/3 |
| topics[2].domain.display_name | Physical Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/1707 |
| topics[2].subfield.display_name | Computer Vision and Pattern Recognition |
| topics[2].display_name | Medical Image Segmentation Techniques |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| language | en |
| locations[0].id | pmh:oai:arXiv.org:2509.19282 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306400194 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | arXiv (Cornell University) |
| locations[0].source.host_organization | https://openalex.org/I205783295 |
| locations[0].source.host_organization_name | Cornell University |
| locations[0].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[0].license | |
| locations[0].pdf_url | https://arxiv.org/pdf/2509.19282 |
| locations[0].version | submittedVersion |
| locations[0].raw_type | text |
| locations[0].license_id | |
| locations[0].is_accepted | False |
| locations[0].is_published | False |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | http://arxiv.org/abs/2509.19282 |
| locations[1].id | doi:10.48550/arxiv.2509.19282 |
| locations[1].is_oa | True |
| locations[1].source.id | https://openalex.org/S4306400194 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | True |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | arXiv (Cornell University) |
| locations[1].source.host_organization | https://openalex.org/I205783295 |
| locations[1].source.host_organization_name | Cornell University |
| locations[1].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[1].license | cc-by |
| locations[1].pdf_url | |
| locations[1].version | |
| locations[1].raw_type | article |
| locations[1].license_id | https://openalex.org/licenses/cc-by |
| locations[1].is_accepted | False |
| locations[1].is_published | |
| locations[1].raw_source_name | |
| locations[1].landing_page_url | https://doi.org/10.48550/arxiv.2509.19282 |
| indexed_in | arxiv, datacite |
| authorships[0].author.id | https://openalex.org/A5078314744 |
| authorships[0].author.orcid | |
| authorships[0].author.display_name | Bingnan Li |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Li, Bingnan |
| authorships[0].is_corresponding | False |
| authorships[1].author.id | https://openalex.org/A5100385875 |
| authorships[1].author.orcid | https://orcid.org/0000-0002-6527-2897 |
| authorships[1].author.display_name | Chenyu Wang |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Wang, Chen-Yu |
| authorships[1].is_corresponding | False |
| authorships[2].author.id | https://openalex.org/A5111023416 |
| authorships[2].author.orcid | |
| authorships[2].author.display_name | Haiyang Xu |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Xu, Haiyang |
| authorships[2].is_corresponding | False |
| authorships[3].author.id | https://openalex.org/A5100651136 |
| authorships[3].author.orcid | https://orcid.org/0000-0002-1017-742X |
| authorships[3].author.display_name | Xiang Zhang |
| authorships[3].author_position | middle |
| authorships[3].raw_author_name | Zhang, Xiang |
| authorships[3].is_corresponding | False |
| authorships[4].author.id | https://openalex.org/A5027165698 |
| authorships[4].author.orcid | https://orcid.org/0000-0002-4516-6317 |
| authorships[4].author.display_name | Ethan J. Armand |
| authorships[4].author_position | middle |
| authorships[4].raw_author_name | Armand, Ethan |
| authorships[4].is_corresponding | False |
| authorships[5].author.id | https://openalex.org/A5007978190 |
| authorships[5].author.orcid | https://orcid.org/0000-0001-5755-7123 |
| authorships[5].author.display_name | Divyansh Srivastava |
| authorships[5].author_position | middle |
| authorships[5].raw_author_name | Srivastava, Divyansh |
| authorships[5].is_corresponding | False |
| authorships[6].author.id | https://openalex.org/A5034063217 |
| authorships[6].author.orcid | https://orcid.org/0000-0002-2569-7161 |
| authorships[6].author.display_name | Xiaojun Shan |
| authorships[6].author_position | middle |
| authorships[6].raw_author_name | Shan, Xiaojun |
| authorships[6].is_corresponding | False |
| authorships[7].author.id | https://openalex.org/A5104236856 |
| authorships[7].author.orcid | |
| authorships[7].author.display_name | Zeyuan Chen |
| authorships[7].author_position | middle |
| authorships[7].raw_author_name | Chen, Zeyuan |
| authorships[7].is_corresponding | False |
| authorships[8].author.id | https://openalex.org/A5104112430 |
| authorships[8].author.orcid | |
| authorships[8].author.display_name | Jianwen Xie |
| authorships[8].author_position | middle |
| authorships[8].raw_author_name | Xie, Jianwen |
| authorships[8].is_corresponding | False |
| authorships[9].author.id | https://openalex.org/A5001760915 |
| authorships[9].author.orcid | https://orcid.org/0000-0002-1900-2124 |
| authorships[9].author.display_name | Zhuowen Tu |
| authorships[9].author_position | last |
| authorships[9].raw_author_name | Tu, Zhuowen |
| authorships[9].is_corresponding | False |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://arxiv.org/pdf/2509.19282 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-16T00:00:00 |
| display_name | OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T06:51:31.235846 |
| primary_topic.id | https://openalex.org/T10481 |
| primary_topic.field.id | https://openalex.org/fields/17 |
| primary_topic.field.display_name | Computer Science |
| primary_topic.score | 0.9828000068664551 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1704 |
| primary_topic.subfield.display_name | Computer Graphics and Computer-Aided Design |
| primary_topic.display_name | Computer Graphics and Visualization Techniques |
| cited_by_count | 0 |
| locations_count | 2 |
| best_oa_location.id | pmh:oai:arXiv.org:2509.19282 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306400194 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | arXiv (Cornell University) |
| best_oa_location.source.host_organization | https://openalex.org/I205783295 |
| best_oa_location.source.host_organization_name | Cornell University |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| best_oa_location.license | |
| best_oa_location.pdf_url | https://arxiv.org/pdf/2509.19282 |
| best_oa_location.version | submittedVersion |
| best_oa_location.raw_type | text |
| best_oa_location.license_id | |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | http://arxiv.org/abs/2509.19282 |
| primary_location.id | pmh:oai:arXiv.org:2509.19282 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306400194 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | arXiv (Cornell University) |
| primary_location.source.host_organization | https://openalex.org/I205783295 |
| primary_location.source.host_organization_name | Cornell University |
| primary_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| primary_location.license | |
| primary_location.pdf_url | https://arxiv.org/pdf/2509.19282 |
| primary_location.version | submittedVersion |
| primary_location.raw_type | text |
| primary_location.license_id | |
| primary_location.is_accepted | False |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | http://arxiv.org/abs/2509.19282 |
| publication_date | 2025-09-23 |
| publication_year | 2025 |
| referenced_works_count | 0 |
| abstract_inverted_index.a | 58, 102, 109, 131, 135 |
| abstract_inverted_index.As | 117 |
| abstract_inverted_index.To | 50, 95 |
| abstract_inverted_index.We | 18 |
| abstract_inverted_index.an | 118 |
| abstract_inverted_index.in | 3, 87 |
| abstract_inverted_index.of | 65, 115 |
| abstract_inverted_index.on | 124, 134 |
| abstract_inverted_index.we | 42, 55, 99, 127 |
| abstract_inverted_index.(1) | 23 |
| abstract_inverted_index.(2) | 28 |
| abstract_inverted_index.Our | 69 |
| abstract_inverted_index.and | 27, 39, 108, 153 |
| abstract_inverted_index.are | 75 |
| abstract_inverted_index.for | 146 |
| abstract_inverted_index.how | 44 |
| abstract_inverted_index.lay | 143 |
| abstract_inverted_index.low | 81 |
| abstract_inverted_index.new | 103 |
| abstract_inverted_index.our | 141 |
| abstract_inverted_index.the | 63, 144 |
| abstract_inverted_index.two | 20 |
| abstract_inverted_index.also | 128 |
| abstract_inverted_index.both | 36 |
| abstract_inverted_index.gap, | 98 |
| abstract_inverted_index.mask | 138 |
| abstract_inverted_index.more | 92, 147 |
| abstract_inverted_index.step | 120 |
| abstract_inverted_index.that | 61, 72 |
| abstract_inverted_index.this | 53, 97 |
| abstract_inverted_index.with | 10, 31, 80 |
| abstract_inverted_index.cases | 79 |
| abstract_inverted_index.large | 24 |
| abstract_inverted_index.link: | 157 |
| abstract_inverted_index.model | 89, 132 |
| abstract_inverted_index.novel | 59 |
| abstract_inverted_index.still | 8 |
| abstract_inverted_index.their | 85 |
| abstract_inverted_index.these | 45 |
| abstract_inverted_index.under | 91, 151 |
| abstract_inverted_index.across | 112 |
| abstract_inverted_index.amodal | 137 |
| abstract_inverted_index.assess | 52 |
| abstract_inverted_index.biased | 76 |
| abstract_inverted_index.boxes. | 17, 68 |
| abstract_inverted_index.bridge | 96 |
| abstract_inverted_index.issue, | 54 |
| abstract_inverted_index.levels | 114 |
| abstract_inverted_index.metric | 60 |
| abstract_inverted_index.robust | 148 |
| abstract_inverted_index.steady | 1 |
| abstract_inverted_index.toward | 77, 121 |
| abstract_inverted_index.Despite | 0 |
| abstract_inverted_index.Project | 156 |
| abstract_inverted_index.Through | 35 |
| abstract_inverted_index.between | 15 |
| abstract_inverted_index.complex | 125 |
| abstract_inverted_index.curated | 136 |
| abstract_inverted_index.current | 6 |
| abstract_inverted_index.degrade | 47 |
| abstract_inverted_index.factors | 46 |
| abstract_inverted_index.initial | 119 |
| abstract_inverted_index.layouts | 11 |
| abstract_inverted_index.methods | 7 |
| abstract_inverted_index.minimal | 32 |
| abstract_inverted_index.overlap | 14 |
| abstract_inverted_index.present | 100 |
| abstract_inverted_index.primary | 21 |
| abstract_inverted_index.propose | 129 |
| abstract_inverted_index.regions | 26 |
| abstract_inverted_index.reveals | 71 |
| abstract_inverted_index.simpler | 78 |
| abstract_inverted_index.values, | 83 |
| abstract_inverted_index.analysis | 70 |
| abstract_inverted_index.balanced | 110 |
| abstract_inverted_index.bounding | 16, 67 |
| abstract_inverted_index.dataset. | 139 |
| abstract_inverted_index.examples | 38 |
| abstract_inverted_index.existing | 73 |
| abstract_inverted_index.identify | 19 |
| abstract_inverted_index.limiting | 84 |
| abstract_inverted_index.progress | 2 |
| abstract_inverted_index.quality. | 49 |
| abstract_inverted_index.semantic | 33 |
| abstract_inverted_index.struggle | 9 |
| abstract_inverted_index.Together, | 140 |
| abstract_inverted_index.analysis, | 41 |
| abstract_inverted_index.benchmark | 104 |
| abstract_inverted_index.different | 113 |
| abstract_inverted_index.featuring | 105 |
| abstract_inverted_index.improving | 122 |
| abstract_inverted_index.instances | 30 |
| abstract_inverted_index.introduce | 56 |
| abstract_inverted_index.overlaps, | 126 |
| abstract_inverted_index.realistic | 152 |
| abstract_inverted_index.benchmarks | 74 |
| abstract_inverted_index.complexity | 64 |
| abstract_inverted_index.containing | 12 |
| abstract_inverted_index.evaluating | 88 |
| abstract_inverted_index.fine-tuned | 133 |
| abstract_inverted_index.generation | 48, 150 |
| abstract_inverted_index.groundwork | 145 |
| abstract_inverted_index.quantifies | 62 |
| abstract_inverted_index.scenarios. | 155 |
| abstract_inverted_index.annotations | 107 |
| abstract_inverted_index.challenges: | 22 |
| abstract_inverted_index.challenging | 93, 154 |
| abstract_inverted_index.conditions. | 94 |
| abstract_inverted_index.demonstrate | 43 |
| abstract_inverted_index.generation, | 5 |
| abstract_inverted_index.overlapping | 25, 29, 66 |
| abstract_inverted_index.performance | 90, 123 |
| abstract_inverted_index.qualitative | 37 |
| abstract_inverted_index.significant | 13 |
| abstract_inverted_index.OverLayScore | 82 |
| abstract_inverted_index.distinction. | 34 |
| abstract_inverted_index.distribution | 111 |
| abstract_inverted_index.high-quality | 106 |
| abstract_inverted_index.quantitative | 40 |
| abstract_inverted_index.OverLayBench, | 101 |
| abstract_inverted_index.OverLayScore, | 57 |
| abstract_inverted_index.OverLayScore. | 116 |
| abstract_inverted_index.contributions | 142 |
| abstract_inverted_index.effectiveness | 86 |
| abstract_inverted_index.systematically | 51 |
| abstract_inverted_index.layout-to-image | 4, 149 |
| abstract_inverted_index.CreatiLayout-AM, | 130 |
| abstract_inverted_index.https://mlpc-ucsd.github.io/OverLayBench. | 158 |
| cited_by_percentile_year | |
| countries_distinct_count | 0 |
| institutions_distinct_count | 10 |
| citation_normalized_percentile |