MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal Image Generation Article Swipe
YOU?
·
· 2023
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.2305.15296
The recent popularity of text-to-image diffusion models (DM) can largely be attributed to the intuitive interface they provide to users. The intended generation can be expressed in natural language, with the model producing faithful interpretations of text prompts. However, expressing complex or nuanced ideas in text alone can be difficult. To ease image generation, we propose MultiFusion that allows one to express complex and nuanced concepts with arbitrarily interleaved inputs of multiple modalities and languages. MutliFusion leverages pre-trained models and aligns them for integration into a cohesive system, thereby avoiding the need for extensive training from scratch. Our experimental results demonstrate the efficient transfer of capabilities from individual modules to the downstream model. Specifically, the fusion of all independent components allows the image generation module to utilize multilingual, interleaved multimodal inputs despite being trained solely on monomodal data in a single language.
Related Topics
- Type
- preprint
- Language
- en
- Landing Page
- http://arxiv.org/abs/2305.15296
- https://arxiv.org/pdf/2305.15296
- OA Status
- green
- Cited By
- 6
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4378510419
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4378510419Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.48550/arxiv.2305.15296Digital Object Identifier
- Title
-
MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal Image GenerationWork title
- Type
-
preprintOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2023Year of publication
- Publication date
-
2023-05-24Full publication date if available
- Authors
-
Marco Bellagente, Manuel Brack, Hannah Teufel, F. Friedrich, Björn Deiseroth, Constantin Eichenberg, Andrew M. Dai, Robert Baldock, Souradeep Nanda, Koen Oostermeijer, Andrés Felipe Cruz-Salinas, Patrick Schramowski, Kristian Kersting, Samuel WeinbachList of authors in order
- Landing page
-
https://arxiv.org/abs/2305.15296Publisher landing page
- PDF URL
-
https://arxiv.org/pdf/2305.15296Direct link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://arxiv.org/pdf/2305.15296Direct OA link when available
- Concepts
-
Computer science, Image (mathematics), Modal, Artificial intelligence, Scratch, Natural language processing, Interface (matter), Modalities, Human–computer interaction, Programming language, Bubble, Parallel computing, Social science, Sociology, Chemistry, Maximum bubble pressure method, Polymer chemistryTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
6Total citation count in OpenAlex
- Citations by year (recent)
-
2025: 3, 2024: 3Per-year citation counts (last 5 years)
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W4378510419 |
|---|---|
| doi | https://doi.org/10.48550/arxiv.2305.15296 |
| ids.doi | https://doi.org/10.48550/arxiv.2305.15296 |
| ids.openalex | https://openalex.org/W4378510419 |
| fwci | |
| type | preprint |
| title | MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal Image Generation |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T10028 |
| topics[0].field.id | https://openalex.org/fields/17 |
| topics[0].field.display_name | Computer Science |
| topics[0].score | 0.9524000287055969 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1702 |
| topics[0].subfield.display_name | Artificial Intelligence |
| topics[0].display_name | Topic Modeling |
| topics[1].id | https://openalex.org/T11714 |
| topics[1].field.id | https://openalex.org/fields/17 |
| topics[1].field.display_name | Computer Science |
| topics[1].score | 0.9322999715805054 |
| topics[1].domain.id | https://openalex.org/domains/3 |
| topics[1].domain.display_name | Physical Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/1707 |
| topics[1].subfield.display_name | Computer Vision and Pattern Recognition |
| topics[1].display_name | Multimodal Machine Learning Applications |
| topics[2].id | https://openalex.org/T10181 |
| topics[2].field.id | https://openalex.org/fields/17 |
| topics[2].field.display_name | Computer Science |
| topics[2].score | 0.930400013923645 |
| topics[2].domain.id | https://openalex.org/domains/3 |
| topics[2].domain.display_name | Physical Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/1702 |
| topics[2].subfield.display_name | Artificial Intelligence |
| topics[2].display_name | Natural Language Processing Techniques |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| concepts[0].id | https://openalex.org/C41008148 |
| concepts[0].level | 0 |
| concepts[0].score | 0.8011695146560669 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[0].display_name | Computer science |
| concepts[1].id | https://openalex.org/C115961682 |
| concepts[1].level | 2 |
| concepts[1].score | 0.5948824882507324 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q860623 |
| concepts[1].display_name | Image (mathematics) |
| concepts[2].id | https://openalex.org/C71139939 |
| concepts[2].level | 2 |
| concepts[2].score | 0.5350372195243835 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q910194 |
| concepts[2].display_name | Modal |
| concepts[3].id | https://openalex.org/C154945302 |
| concepts[3].level | 1 |
| concepts[3].score | 0.5102221965789795 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q11660 |
| concepts[3].display_name | Artificial intelligence |
| concepts[4].id | https://openalex.org/C2781235140 |
| concepts[4].level | 2 |
| concepts[4].score | 0.5079715847969055 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q275131 |
| concepts[4].display_name | Scratch |
| concepts[5].id | https://openalex.org/C204321447 |
| concepts[5].level | 1 |
| concepts[5].score | 0.4817104935646057 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q30642 |
| concepts[5].display_name | Natural language processing |
| concepts[6].id | https://openalex.org/C113843644 |
| concepts[6].level | 4 |
| concepts[6].score | 0.4395506978034973 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q901882 |
| concepts[6].display_name | Interface (matter) |
| concepts[7].id | https://openalex.org/C2779903281 |
| concepts[7].level | 2 |
| concepts[7].score | 0.4141630530357361 |
| concepts[7].wikidata | https://www.wikidata.org/wiki/Q6888026 |
| concepts[7].display_name | Modalities |
| concepts[8].id | https://openalex.org/C107457646 |
| concepts[8].level | 1 |
| concepts[8].score | 0.3371521830558777 |
| concepts[8].wikidata | https://www.wikidata.org/wiki/Q207434 |
| concepts[8].display_name | Human–computer interaction |
| concepts[9].id | https://openalex.org/C199360897 |
| concepts[9].level | 1 |
| concepts[9].score | 0.24084314703941345 |
| concepts[9].wikidata | https://www.wikidata.org/wiki/Q9143 |
| concepts[9].display_name | Programming language |
| concepts[10].id | https://openalex.org/C157915830 |
| concepts[10].level | 2 |
| concepts[10].score | 0.0 |
| concepts[10].wikidata | https://www.wikidata.org/wiki/Q2928001 |
| concepts[10].display_name | Bubble |
| concepts[11].id | https://openalex.org/C173608175 |
| concepts[11].level | 1 |
| concepts[11].score | 0.0 |
| concepts[11].wikidata | https://www.wikidata.org/wiki/Q232661 |
| concepts[11].display_name | Parallel computing |
| concepts[12].id | https://openalex.org/C36289849 |
| concepts[12].level | 1 |
| concepts[12].score | 0.0 |
| concepts[12].wikidata | https://www.wikidata.org/wiki/Q34749 |
| concepts[12].display_name | Social science |
| concepts[13].id | https://openalex.org/C144024400 |
| concepts[13].level | 0 |
| concepts[13].score | 0.0 |
| concepts[13].wikidata | https://www.wikidata.org/wiki/Q21201 |
| concepts[13].display_name | Sociology |
| concepts[14].id | https://openalex.org/C185592680 |
| concepts[14].level | 0 |
| concepts[14].score | 0.0 |
| concepts[14].wikidata | https://www.wikidata.org/wiki/Q2329 |
| concepts[14].display_name | Chemistry |
| concepts[15].id | https://openalex.org/C129307140 |
| concepts[15].level | 3 |
| concepts[15].score | 0.0 |
| concepts[15].wikidata | https://www.wikidata.org/wiki/Q6795880 |
| concepts[15].display_name | Maximum bubble pressure method |
| concepts[16].id | https://openalex.org/C188027245 |
| concepts[16].level | 1 |
| concepts[16].score | 0.0 |
| concepts[16].wikidata | https://www.wikidata.org/wiki/Q750446 |
| concepts[16].display_name | Polymer chemistry |
| keywords[0].id | https://openalex.org/keywords/computer-science |
| keywords[0].score | 0.8011695146560669 |
| keywords[0].display_name | Computer science |
| keywords[1].id | https://openalex.org/keywords/image |
| keywords[1].score | 0.5948824882507324 |
| keywords[1].display_name | Image (mathematics) |
| keywords[2].id | https://openalex.org/keywords/modal |
| keywords[2].score | 0.5350372195243835 |
| keywords[2].display_name | Modal |
| keywords[3].id | https://openalex.org/keywords/artificial-intelligence |
| keywords[3].score | 0.5102221965789795 |
| keywords[3].display_name | Artificial intelligence |
| keywords[4].id | https://openalex.org/keywords/scratch |
| keywords[4].score | 0.5079715847969055 |
| keywords[4].display_name | Scratch |
| keywords[5].id | https://openalex.org/keywords/natural-language-processing |
| keywords[5].score | 0.4817104935646057 |
| keywords[5].display_name | Natural language processing |
| keywords[6].id | https://openalex.org/keywords/interface |
| keywords[6].score | 0.4395506978034973 |
| keywords[6].display_name | Interface (matter) |
| keywords[7].id | https://openalex.org/keywords/modalities |
| keywords[7].score | 0.4141630530357361 |
| keywords[7].display_name | Modalities |
| keywords[8].id | https://openalex.org/keywords/human–computer-interaction |
| keywords[8].score | 0.3371521830558777 |
| keywords[8].display_name | Human–computer interaction |
| keywords[9].id | https://openalex.org/keywords/programming-language |
| keywords[9].score | 0.24084314703941345 |
| keywords[9].display_name | Programming language |
| language | en |
| locations[0].id | pmh:oai:arXiv.org:2305.15296 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306400194 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | arXiv (Cornell University) |
| locations[0].source.host_organization | https://openalex.org/I205783295 |
| locations[0].source.host_organization_name | Cornell University |
| locations[0].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[0].license | |
| locations[0].pdf_url | https://arxiv.org/pdf/2305.15296 |
| locations[0].version | submittedVersion |
| locations[0].raw_type | text |
| locations[0].license_id | |
| locations[0].is_accepted | False |
| locations[0].is_published | False |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | http://arxiv.org/abs/2305.15296 |
| locations[1].id | doi:10.48550/arxiv.2305.15296 |
| locations[1].is_oa | True |
| locations[1].source.id | https://openalex.org/S4306400194 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | True |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | arXiv (Cornell University) |
| locations[1].source.host_organization | https://openalex.org/I205783295 |
| locations[1].source.host_organization_name | Cornell University |
| locations[1].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[1].license | cc-by |
| locations[1].pdf_url | |
| locations[1].version | |
| locations[1].raw_type | article |
| locations[1].license_id | https://openalex.org/licenses/cc-by |
| locations[1].is_accepted | False |
| locations[1].is_published | |
| locations[1].raw_source_name | |
| locations[1].landing_page_url | https://doi.org/10.48550/arxiv.2305.15296 |
| indexed_in | arxiv, datacite |
| authorships[0].author.id | https://openalex.org/A5045754297 |
| authorships[0].author.orcid | |
| authorships[0].author.display_name | Marco Bellagente |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Bellagente, Marco |
| authorships[0].is_corresponding | False |
| authorships[1].author.id | https://openalex.org/A5002883390 |
| authorships[1].author.orcid | |
| authorships[1].author.display_name | Manuel Brack |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Brack, Manuel |
| authorships[1].is_corresponding | False |
| authorships[2].author.id | https://openalex.org/A5051845237 |
| authorships[2].author.orcid | |
| authorships[2].author.display_name | Hannah Teufel |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Teufel, Hannah |
| authorships[2].is_corresponding | False |
| authorships[3].author.id | https://openalex.org/A5014054034 |
| authorships[3].author.orcid | https://orcid.org/0000-0001-5791-9892 |
| authorships[3].author.display_name | F. Friedrich |
| authorships[3].author_position | middle |
| authorships[3].raw_author_name | Friedrich, Felix |
| authorships[3].is_corresponding | False |
| authorships[4].author.id | https://openalex.org/A5077091864 |
| authorships[4].author.orcid | |
| authorships[4].author.display_name | Björn Deiseroth |
| authorships[4].author_position | middle |
| authorships[4].raw_author_name | Deiseroth, Björn |
| authorships[4].is_corresponding | False |
| authorships[5].author.id | https://openalex.org/A5036135663 |
| authorships[5].author.orcid | https://orcid.org/0000-0002-9973-2687 |
| authorships[5].author.display_name | Constantin Eichenberg |
| authorships[5].author_position | middle |
| authorships[5].raw_author_name | Eichenberg, Constantin |
| authorships[5].is_corresponding | False |
| authorships[6].author.id | https://openalex.org/A5101597225 |
| authorships[6].author.orcid | https://orcid.org/0009-0007-9200-8577 |
| authorships[6].author.display_name | Andrew M. Dai |
| authorships[6].author_position | middle |
| authorships[6].raw_author_name | Dai, Andrew |
| authorships[6].is_corresponding | False |
| authorships[7].author.id | https://openalex.org/A5023222103 |
| authorships[7].author.orcid | https://orcid.org/0000-0002-3222-4004 |
| authorships[7].author.display_name | Robert Baldock |
| authorships[7].author_position | middle |
| authorships[7].raw_author_name | Baldock, Robert |
| authorships[7].is_corresponding | False |
| authorships[8].author.id | https://openalex.org/A5060064566 |
| authorships[8].author.orcid | |
| authorships[8].author.display_name | Souradeep Nanda |
| authorships[8].author_position | middle |
| authorships[8].raw_author_name | Nanda, Souradeep |
| authorships[8].is_corresponding | False |
| authorships[9].author.id | https://openalex.org/A5058203980 |
| authorships[9].author.orcid | |
| authorships[9].author.display_name | Koen Oostermeijer |
| authorships[9].author_position | middle |
| authorships[9].raw_author_name | Oostermeijer, Koen |
| authorships[9].is_corresponding | False |
| authorships[10].author.id | https://openalex.org/A5076876303 |
| authorships[10].author.orcid | |
| authorships[10].author.display_name | Andrés Felipe Cruz-Salinas |
| authorships[10].author_position | middle |
| authorships[10].raw_author_name | Cruz-Salinas, Andres Felipe |
| authorships[10].is_corresponding | False |
| authorships[11].author.id | https://openalex.org/A5017166414 |
| authorships[11].author.orcid | https://orcid.org/0000-0003-1231-7120 |
| authorships[11].author.display_name | Patrick Schramowski |
| authorships[11].author_position | middle |
| authorships[11].raw_author_name | Schramowski, Patrick |
| authorships[11].is_corresponding | False |
| authorships[12].author.id | https://openalex.org/A5037636074 |
| authorships[12].author.orcid | https://orcid.org/0000-0002-2873-9152 |
| authorships[12].author.display_name | Kristian Kersting |
| authorships[12].author_position | middle |
| authorships[12].raw_author_name | Kersting, Kristian |
| authorships[12].is_corresponding | False |
| authorships[13].author.id | https://openalex.org/A5055789969 |
| authorships[13].author.orcid | |
| authorships[13].author.display_name | Samuel Weinbach |
| authorships[13].author_position | last |
| authorships[13].raw_author_name | Weinbach, Samuel |
| authorships[13].is_corresponding | False |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://arxiv.org/pdf/2305.15296 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2023-05-27T00:00:00 |
| display_name | MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal Image Generation |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T06:51:31.235846 |
| primary_topic.id | https://openalex.org/T10028 |
| primary_topic.field.id | https://openalex.org/fields/17 |
| primary_topic.field.display_name | Computer Science |
| primary_topic.score | 0.9524000287055969 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1702 |
| primary_topic.subfield.display_name | Artificial Intelligence |
| primary_topic.display_name | Topic Modeling |
| related_works | https://openalex.org/W2475116013, https://openalex.org/W2770018148, https://openalex.org/W2358308169, https://openalex.org/W2385135707, https://openalex.org/W2140315382, https://openalex.org/W2059109728, https://openalex.org/W322691623, https://openalex.org/W2494989134, https://openalex.org/W4301143707, https://openalex.org/W2952745240 |
| cited_by_count | 6 |
| counts_by_year[0].year | 2025 |
| counts_by_year[0].cited_by_count | 3 |
| counts_by_year[1].year | 2024 |
| counts_by_year[1].cited_by_count | 3 |
| locations_count | 2 |
| best_oa_location.id | pmh:oai:arXiv.org:2305.15296 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306400194 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | arXiv (Cornell University) |
| best_oa_location.source.host_organization | https://openalex.org/I205783295 |
| best_oa_location.source.host_organization_name | Cornell University |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| best_oa_location.license | |
| best_oa_location.pdf_url | https://arxiv.org/pdf/2305.15296 |
| best_oa_location.version | submittedVersion |
| best_oa_location.raw_type | text |
| best_oa_location.license_id | |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | http://arxiv.org/abs/2305.15296 |
| primary_location.id | pmh:oai:arXiv.org:2305.15296 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306400194 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | arXiv (Cornell University) |
| primary_location.source.host_organization | https://openalex.org/I205783295 |
| primary_location.source.host_organization_name | Cornell University |
| primary_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| primary_location.license | |
| primary_location.pdf_url | https://arxiv.org/pdf/2305.15296 |
| primary_location.version | submittedVersion |
| primary_location.raw_type | text |
| primary_location.license_id | |
| primary_location.is_accepted | False |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | http://arxiv.org/abs/2305.15296 |
| publication_date | 2023-05-24 |
| publication_year | 2023 |
| referenced_works_count | 0 |
| abstract_inverted_index.a | 85, 139 |
| abstract_inverted_index.To | 50 |
| abstract_inverted_index.be | 10, 24, 48 |
| abstract_inverted_index.in | 26, 44, 138 |
| abstract_inverted_index.of | 3, 35, 70, 104, 116 |
| abstract_inverted_index.on | 135 |
| abstract_inverted_index.or | 41 |
| abstract_inverted_index.to | 12, 18, 60, 109, 125 |
| abstract_inverted_index.we | 54 |
| abstract_inverted_index.Our | 97 |
| abstract_inverted_index.The | 0, 20 |
| abstract_inverted_index.all | 117 |
| abstract_inverted_index.and | 63, 73, 79 |
| abstract_inverted_index.can | 8, 23, 47 |
| abstract_inverted_index.for | 82, 92 |
| abstract_inverted_index.one | 59 |
| abstract_inverted_index.the | 13, 30, 90, 101, 110, 114, 121 |
| abstract_inverted_index.(DM) | 7 |
| abstract_inverted_index.data | 137 |
| abstract_inverted_index.ease | 51 |
| abstract_inverted_index.from | 95, 106 |
| abstract_inverted_index.into | 84 |
| abstract_inverted_index.need | 91 |
| abstract_inverted_index.text | 36, 45 |
| abstract_inverted_index.that | 57 |
| abstract_inverted_index.them | 81 |
| abstract_inverted_index.they | 16 |
| abstract_inverted_index.with | 29, 66 |
| abstract_inverted_index.alone | 46 |
| abstract_inverted_index.being | 132 |
| abstract_inverted_index.ideas | 43 |
| abstract_inverted_index.image | 52, 122 |
| abstract_inverted_index.model | 31 |
| abstract_inverted_index.aligns | 80 |
| abstract_inverted_index.allows | 58, 120 |
| abstract_inverted_index.fusion | 115 |
| abstract_inverted_index.inputs | 69, 130 |
| abstract_inverted_index.model. | 112 |
| abstract_inverted_index.models | 6, 78 |
| abstract_inverted_index.module | 124 |
| abstract_inverted_index.recent | 1 |
| abstract_inverted_index.single | 140 |
| abstract_inverted_index.solely | 134 |
| abstract_inverted_index.users. | 19 |
| abstract_inverted_index.complex | 40, 62 |
| abstract_inverted_index.despite | 131 |
| abstract_inverted_index.express | 61 |
| abstract_inverted_index.largely | 9 |
| abstract_inverted_index.modules | 108 |
| abstract_inverted_index.natural | 27 |
| abstract_inverted_index.nuanced | 42, 64 |
| abstract_inverted_index.propose | 55 |
| abstract_inverted_index.provide | 17 |
| abstract_inverted_index.results | 99 |
| abstract_inverted_index.system, | 87 |
| abstract_inverted_index.thereby | 88 |
| abstract_inverted_index.trained | 133 |
| abstract_inverted_index.utilize | 126 |
| abstract_inverted_index.However, | 38 |
| abstract_inverted_index.avoiding | 89 |
| abstract_inverted_index.cohesive | 86 |
| abstract_inverted_index.concepts | 65 |
| abstract_inverted_index.faithful | 33 |
| abstract_inverted_index.intended | 21 |
| abstract_inverted_index.multiple | 71 |
| abstract_inverted_index.prompts. | 37 |
| abstract_inverted_index.scratch. | 96 |
| abstract_inverted_index.training | 94 |
| abstract_inverted_index.transfer | 103 |
| abstract_inverted_index.diffusion | 5 |
| abstract_inverted_index.efficient | 102 |
| abstract_inverted_index.expressed | 25 |
| abstract_inverted_index.extensive | 93 |
| abstract_inverted_index.interface | 15 |
| abstract_inverted_index.intuitive | 14 |
| abstract_inverted_index.language, | 28 |
| abstract_inverted_index.language. | 141 |
| abstract_inverted_index.leverages | 76 |
| abstract_inverted_index.monomodal | 136 |
| abstract_inverted_index.producing | 32 |
| abstract_inverted_index.attributed | 11 |
| abstract_inverted_index.components | 119 |
| abstract_inverted_index.difficult. | 49 |
| abstract_inverted_index.downstream | 111 |
| abstract_inverted_index.expressing | 39 |
| abstract_inverted_index.generation | 22, 123 |
| abstract_inverted_index.individual | 107 |
| abstract_inverted_index.languages. | 74 |
| abstract_inverted_index.modalities | 72 |
| abstract_inverted_index.multimodal | 129 |
| abstract_inverted_index.popularity | 2 |
| abstract_inverted_index.MultiFusion | 56 |
| abstract_inverted_index.MutliFusion | 75 |
| abstract_inverted_index.arbitrarily | 67 |
| abstract_inverted_index.demonstrate | 100 |
| abstract_inverted_index.generation, | 53 |
| abstract_inverted_index.independent | 118 |
| abstract_inverted_index.integration | 83 |
| abstract_inverted_index.interleaved | 68, 128 |
| abstract_inverted_index.pre-trained | 77 |
| abstract_inverted_index.capabilities | 105 |
| abstract_inverted_index.experimental | 98 |
| abstract_inverted_index.Specifically, | 113 |
| abstract_inverted_index.multilingual, | 127 |
| abstract_inverted_index.text-to-image | 4 |
| abstract_inverted_index.interpretations | 34 |
| cited_by_percentile_year | |
| countries_distinct_count | 0 |
| institutions_distinct_count | 14 |
| sustainable_development_goals[0].id | https://metadata.un.org/sdg/4 |
| sustainable_development_goals[0].score | 0.7699999809265137 |
| sustainable_development_goals[0].display_name | Quality Education |
| citation_normalized_percentile |