You Only Look at Screens: Multimodal Chain-of-Action Agents Article Swipe
YOU?
·
· 2023
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.2309.11436
Autonomous graphical user interface (GUI) agents aim to facilitate task automation by interacting with the user interface without manual intervention. Recent studies have investigated eliciting the capabilities of large language models (LLMs) for effective engagement in diverse environments. To align with the input-output requirement of LLMs, most existing approaches are developed under a sandbox setting where they rely on external tools and application-specific APIs to parse the environment into textual elements and interpret the predicted actions. Consequently, those approaches often grapple with inference inefficiency and error propagation risks. To mitigate the challenges, we introduce Auto-GUI, a multimodal solution that directly interacts with the interface, bypassing the need for environment parsing or reliance on application-dependent APIs. Moreover, we propose a chain-of-action technique -- leveraging a series of intermediate previous action histories and future action plans -- to help the agent decide what action to execute. We evaluate our approach on a new device-control benchmark AITW with 30$K$ unique instructions, spanning multi-step tasks such as application operation, web searching, and web shopping. Experimental results show that Auto-GUI achieves state-of-the-art performance with an action type prediction accuracy of 90\% and an overall action success rate of 74\%. Code is publicly available at https://github.com/cooelf/Auto-GUI.
Related Topics
- Type
- preprint
- Language
- en
- Landing Page
- http://arxiv.org/abs/2309.11436
- https://arxiv.org/pdf/2309.11436
- OA Status
- green
- Cited By
- 3
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4386976823
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4386976823Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.48550/arxiv.2309.11436Digital Object Identifier
- Title
-
You Only Look at Screens: Multimodal Chain-of-Action AgentsWork title
- Type
-
preprintOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2023Year of publication
- Publication date
-
2023-09-20Full publication date if available
- Authors
-
Zhuosheng Zhang, Aston ZhangList of authors in order
- Landing page
-
https://arxiv.org/abs/2309.11436Publisher landing page
- PDF URL
-
https://arxiv.org/pdf/2309.11436Direct link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://arxiv.org/pdf/2309.11436Direct OA link when available
- Concepts
-
Computer science, Action (physics), Parsing, Benchmark (surveying), Task (project management), Sandbox (software development), Interface (matter), Human–computer interaction, Inference, Inefficiency, User interface, Code (set theory), Artificial intelligence, Machine learning, Software engineering, Programming language, Bubble, Economics, Maximum bubble pressure method, Set (abstract data type), Management, Parallel computing, Geodesy, Quantum mechanics, Microeconomics, Physics, GeographyTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
3Total citation count in OpenAlex
- Citations by year (recent)
-
2025: 1, 2024: 2Per-year citation counts (last 5 years)
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W4386976823 |
|---|---|
| doi | https://doi.org/10.48550/arxiv.2309.11436 |
| ids.doi | https://doi.org/10.48550/arxiv.2309.11436 |
| ids.openalex | https://openalex.org/W4386976823 |
| fwci | |
| type | preprint |
| title | You Only Look at Screens: Multimodal Chain-of-Action Agents |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T10028 |
| topics[0].field.id | https://openalex.org/fields/17 |
| topics[0].field.display_name | Computer Science |
| topics[0].score | 0.9922999739646912 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1702 |
| topics[0].subfield.display_name | Artificial Intelligence |
| topics[0].display_name | Topic Modeling |
| topics[1].id | https://openalex.org/T12128 |
| topics[1].field.id | https://openalex.org/fields/17 |
| topics[1].field.display_name | Computer Science |
| topics[1].score | 0.9779000282287598 |
| topics[1].domain.id | https://openalex.org/domains/3 |
| topics[1].domain.display_name | Physical Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/1702 |
| topics[1].subfield.display_name | Artificial Intelligence |
| topics[1].display_name | AI in Service Interactions |
| topics[2].id | https://openalex.org/T11636 |
| topics[2].field.id | https://openalex.org/fields/27 |
| topics[2].field.display_name | Medicine |
| topics[2].score | 0.9063000082969666 |
| topics[2].domain.id | https://openalex.org/domains/4 |
| topics[2].domain.display_name | Health Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/2718 |
| topics[2].subfield.display_name | Health Informatics |
| topics[2].display_name | Artificial Intelligence in Healthcare and Education |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| concepts[0].id | https://openalex.org/C41008148 |
| concepts[0].level | 0 |
| concepts[0].score | 0.8311219215393066 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[0].display_name | Computer science |
| concepts[1].id | https://openalex.org/C2780791683 |
| concepts[1].level | 2 |
| concepts[1].score | 0.6354100108146667 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q846785 |
| concepts[1].display_name | Action (physics) |
| concepts[2].id | https://openalex.org/C186644900 |
| concepts[2].level | 2 |
| concepts[2].score | 0.5889145135879517 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q194152 |
| concepts[2].display_name | Parsing |
| concepts[3].id | https://openalex.org/C185798385 |
| concepts[3].level | 2 |
| concepts[3].score | 0.5737404227256775 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q1161707 |
| concepts[3].display_name | Benchmark (surveying) |
| concepts[4].id | https://openalex.org/C2780451532 |
| concepts[4].level | 2 |
| concepts[4].score | 0.5624116659164429 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q759676 |
| concepts[4].display_name | Task (project management) |
| concepts[5].id | https://openalex.org/C167981075 |
| concepts[5].level | 2 |
| concepts[5].score | 0.558518648147583 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q2667186 |
| concepts[5].display_name | Sandbox (software development) |
| concepts[6].id | https://openalex.org/C113843644 |
| concepts[6].level | 4 |
| concepts[6].score | 0.5539332628250122 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q901882 |
| concepts[6].display_name | Interface (matter) |
| concepts[7].id | https://openalex.org/C107457646 |
| concepts[7].level | 1 |
| concepts[7].score | 0.5522633194923401 |
| concepts[7].wikidata | https://www.wikidata.org/wiki/Q207434 |
| concepts[7].display_name | Human–computer interaction |
| concepts[8].id | https://openalex.org/C2776214188 |
| concepts[8].level | 2 |
| concepts[8].score | 0.518779993057251 |
| concepts[8].wikidata | https://www.wikidata.org/wiki/Q408386 |
| concepts[8].display_name | Inference |
| concepts[9].id | https://openalex.org/C2778869765 |
| concepts[9].level | 2 |
| concepts[9].score | 0.49507132172584534 |
| concepts[9].wikidata | https://www.wikidata.org/wiki/Q6028363 |
| concepts[9].display_name | Inefficiency |
| concepts[10].id | https://openalex.org/C89505385 |
| concepts[10].level | 2 |
| concepts[10].score | 0.46380189061164856 |
| concepts[10].wikidata | https://www.wikidata.org/wiki/Q47146 |
| concepts[10].display_name | User interface |
| concepts[11].id | https://openalex.org/C2776760102 |
| concepts[11].level | 3 |
| concepts[11].score | 0.4241068363189697 |
| concepts[11].wikidata | https://www.wikidata.org/wiki/Q5139990 |
| concepts[11].display_name | Code (set theory) |
| concepts[12].id | https://openalex.org/C154945302 |
| concepts[12].level | 1 |
| concepts[12].score | 0.40269121527671814 |
| concepts[12].wikidata | https://www.wikidata.org/wiki/Q11660 |
| concepts[12].display_name | Artificial intelligence |
| concepts[13].id | https://openalex.org/C119857082 |
| concepts[13].level | 1 |
| concepts[13].score | 0.36348891258239746 |
| concepts[13].wikidata | https://www.wikidata.org/wiki/Q2539 |
| concepts[13].display_name | Machine learning |
| concepts[14].id | https://openalex.org/C115903868 |
| concepts[14].level | 1 |
| concepts[14].score | 0.31370121240615845 |
| concepts[14].wikidata | https://www.wikidata.org/wiki/Q80993 |
| concepts[14].display_name | Software engineering |
| concepts[15].id | https://openalex.org/C199360897 |
| concepts[15].level | 1 |
| concepts[15].score | 0.2083665430545807 |
| concepts[15].wikidata | https://www.wikidata.org/wiki/Q9143 |
| concepts[15].display_name | Programming language |
| concepts[16].id | https://openalex.org/C157915830 |
| concepts[16].level | 2 |
| concepts[16].score | 0.0 |
| concepts[16].wikidata | https://www.wikidata.org/wiki/Q2928001 |
| concepts[16].display_name | Bubble |
| concepts[17].id | https://openalex.org/C162324750 |
| concepts[17].level | 0 |
| concepts[17].score | 0.0 |
| concepts[17].wikidata | https://www.wikidata.org/wiki/Q8134 |
| concepts[17].display_name | Economics |
| concepts[18].id | https://openalex.org/C129307140 |
| concepts[18].level | 3 |
| concepts[18].score | 0.0 |
| concepts[18].wikidata | https://www.wikidata.org/wiki/Q6795880 |
| concepts[18].display_name | Maximum bubble pressure method |
| concepts[19].id | https://openalex.org/C177264268 |
| concepts[19].level | 2 |
| concepts[19].score | 0.0 |
| concepts[19].wikidata | https://www.wikidata.org/wiki/Q1514741 |
| concepts[19].display_name | Set (abstract data type) |
| concepts[20].id | https://openalex.org/C187736073 |
| concepts[20].level | 1 |
| concepts[20].score | 0.0 |
| concepts[20].wikidata | https://www.wikidata.org/wiki/Q2920921 |
| concepts[20].display_name | Management |
| concepts[21].id | https://openalex.org/C173608175 |
| concepts[21].level | 1 |
| concepts[21].score | 0.0 |
| concepts[21].wikidata | https://www.wikidata.org/wiki/Q232661 |
| concepts[21].display_name | Parallel computing |
| concepts[22].id | https://openalex.org/C13280743 |
| concepts[22].level | 1 |
| concepts[22].score | 0.0 |
| concepts[22].wikidata | https://www.wikidata.org/wiki/Q131089 |
| concepts[22].display_name | Geodesy |
| concepts[23].id | https://openalex.org/C62520636 |
| concepts[23].level | 1 |
| concepts[23].score | 0.0 |
| concepts[23].wikidata | https://www.wikidata.org/wiki/Q944 |
| concepts[23].display_name | Quantum mechanics |
| concepts[24].id | https://openalex.org/C175444787 |
| concepts[24].level | 1 |
| concepts[24].score | 0.0 |
| concepts[24].wikidata | https://www.wikidata.org/wiki/Q39072 |
| concepts[24].display_name | Microeconomics |
| concepts[25].id | https://openalex.org/C121332964 |
| concepts[25].level | 0 |
| concepts[25].score | 0.0 |
| concepts[25].wikidata | https://www.wikidata.org/wiki/Q413 |
| concepts[25].display_name | Physics |
| concepts[26].id | https://openalex.org/C205649164 |
| concepts[26].level | 0 |
| concepts[26].score | 0.0 |
| concepts[26].wikidata | https://www.wikidata.org/wiki/Q1071 |
| concepts[26].display_name | Geography |
| keywords[0].id | https://openalex.org/keywords/computer-science |
| keywords[0].score | 0.8311219215393066 |
| keywords[0].display_name | Computer science |
| keywords[1].id | https://openalex.org/keywords/action |
| keywords[1].score | 0.6354100108146667 |
| keywords[1].display_name | Action (physics) |
| keywords[2].id | https://openalex.org/keywords/parsing |
| keywords[2].score | 0.5889145135879517 |
| keywords[2].display_name | Parsing |
| keywords[3].id | https://openalex.org/keywords/benchmark |
| keywords[3].score | 0.5737404227256775 |
| keywords[3].display_name | Benchmark (surveying) |
| keywords[4].id | https://openalex.org/keywords/task |
| keywords[4].score | 0.5624116659164429 |
| keywords[4].display_name | Task (project management) |
| keywords[5].id | https://openalex.org/keywords/sandbox |
| keywords[5].score | 0.558518648147583 |
| keywords[5].display_name | Sandbox (software development) |
| keywords[6].id | https://openalex.org/keywords/interface |
| keywords[6].score | 0.5539332628250122 |
| keywords[6].display_name | Interface (matter) |
| keywords[7].id | https://openalex.org/keywords/human–computer-interaction |
| keywords[7].score | 0.5522633194923401 |
| keywords[7].display_name | Human–computer interaction |
| keywords[8].id | https://openalex.org/keywords/inference |
| keywords[8].score | 0.518779993057251 |
| keywords[8].display_name | Inference |
| keywords[9].id | https://openalex.org/keywords/inefficiency |
| keywords[9].score | 0.49507132172584534 |
| keywords[9].display_name | Inefficiency |
| keywords[10].id | https://openalex.org/keywords/user-interface |
| keywords[10].score | 0.46380189061164856 |
| keywords[10].display_name | User interface |
| keywords[11].id | https://openalex.org/keywords/code |
| keywords[11].score | 0.4241068363189697 |
| keywords[11].display_name | Code (set theory) |
| keywords[12].id | https://openalex.org/keywords/artificial-intelligence |
| keywords[12].score | 0.40269121527671814 |
| keywords[12].display_name | Artificial intelligence |
| keywords[13].id | https://openalex.org/keywords/machine-learning |
| keywords[13].score | 0.36348891258239746 |
| keywords[13].display_name | Machine learning |
| keywords[14].id | https://openalex.org/keywords/software-engineering |
| keywords[14].score | 0.31370121240615845 |
| keywords[14].display_name | Software engineering |
| keywords[15].id | https://openalex.org/keywords/programming-language |
| keywords[15].score | 0.2083665430545807 |
| keywords[15].display_name | Programming language |
| language | en |
| locations[0].id | pmh:oai:arXiv.org:2309.11436 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306400194 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | arXiv (Cornell University) |
| locations[0].source.host_organization | https://openalex.org/I205783295 |
| locations[0].source.host_organization_name | Cornell University |
| locations[0].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[0].license | |
| locations[0].pdf_url | https://arxiv.org/pdf/2309.11436 |
| locations[0].version | submittedVersion |
| locations[0].raw_type | text |
| locations[0].license_id | |
| locations[0].is_accepted | False |
| locations[0].is_published | False |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | http://arxiv.org/abs/2309.11436 |
| locations[1].id | doi:10.48550/arxiv.2309.11436 |
| locations[1].is_oa | True |
| locations[1].source.id | https://openalex.org/S4306400194 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | True |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | arXiv (Cornell University) |
| locations[1].source.host_organization | https://openalex.org/I205783295 |
| locations[1].source.host_organization_name | Cornell University |
| locations[1].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[1].license | |
| locations[1].pdf_url | |
| locations[1].version | |
| locations[1].raw_type | article |
| locations[1].license_id | |
| locations[1].is_accepted | False |
| locations[1].is_published | |
| locations[1].raw_source_name | |
| locations[1].landing_page_url | https://doi.org/10.48550/arxiv.2309.11436 |
| indexed_in | arxiv, datacite |
| authorships[0].author.id | https://openalex.org/A5070962435 |
| authorships[0].author.orcid | https://orcid.org/0000-0002-4183-3645 |
| authorships[0].author.display_name | Zhuosheng Zhang |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Zhang, Zhuosheng |
| authorships[0].is_corresponding | False |
| authorships[1].author.id | https://openalex.org/A5049841140 |
| authorships[1].author.orcid | |
| authorships[1].author.display_name | Aston Zhang |
| authorships[1].author_position | last |
| authorships[1].raw_author_name | Zhang, Aston |
| authorships[1].is_corresponding | False |
| has_content.pdf | True |
| has_content.grobid_xml | True |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://arxiv.org/pdf/2309.11436 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | You Only Look at Screens: Multimodal Chain-of-Action Agents |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T06:51:31.235846 |
| primary_topic.id | https://openalex.org/T10028 |
| primary_topic.field.id | https://openalex.org/fields/17 |
| primary_topic.field.display_name | Computer Science |
| primary_topic.score | 0.9922999739646912 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1702 |
| primary_topic.subfield.display_name | Artificial Intelligence |
| primary_topic.display_name | Topic Modeling |
| related_works | https://openalex.org/W2033352828, https://openalex.org/W2355810117, https://openalex.org/W3098313552, https://openalex.org/W70177500, https://openalex.org/W2546418048, https://openalex.org/W2076427967, https://openalex.org/W3212184609, https://openalex.org/W2499283203, https://openalex.org/W2795849205, https://openalex.org/W2111618996 |
| cited_by_count | 3 |
| counts_by_year[0].year | 2025 |
| counts_by_year[0].cited_by_count | 1 |
| counts_by_year[1].year | 2024 |
| counts_by_year[1].cited_by_count | 2 |
| locations_count | 2 |
| best_oa_location.id | pmh:oai:arXiv.org:2309.11436 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306400194 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | arXiv (Cornell University) |
| best_oa_location.source.host_organization | https://openalex.org/I205783295 |
| best_oa_location.source.host_organization_name | Cornell University |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| best_oa_location.license | |
| best_oa_location.pdf_url | https://arxiv.org/pdf/2309.11436 |
| best_oa_location.version | submittedVersion |
| best_oa_location.raw_type | text |
| best_oa_location.license_id | |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | http://arxiv.org/abs/2309.11436 |
| primary_location.id | pmh:oai:arXiv.org:2309.11436 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306400194 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | arXiv (Cornell University) |
| primary_location.source.host_organization | https://openalex.org/I205783295 |
| primary_location.source.host_organization_name | Cornell University |
| primary_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| primary_location.license | |
| primary_location.pdf_url | https://arxiv.org/pdf/2309.11436 |
| primary_location.version | submittedVersion |
| primary_location.raw_type | text |
| primary_location.license_id | |
| primary_location.is_accepted | False |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | http://arxiv.org/abs/2309.11436 |
| publication_date | 2023-09-20 |
| publication_year | 2023 |
| referenced_works_count | 0 |
| abstract_inverted_index.a | 52, 95, 118, 123, 149 |
| abstract_inverted_index.-- | 121, 134 |
| abstract_inverted_index.To | 38, 88 |
| abstract_inverted_index.We | 144 |
| abstract_inverted_index.an | 179, 187 |
| abstract_inverted_index.as | 162 |
| abstract_inverted_index.at | 198 |
| abstract_inverted_index.by | 11 |
| abstract_inverted_index.in | 35 |
| abstract_inverted_index.is | 195 |
| abstract_inverted_index.of | 27, 44, 125, 184, 192 |
| abstract_inverted_index.on | 58, 112, 148 |
| abstract_inverted_index.or | 110 |
| abstract_inverted_index.to | 7, 64, 135, 142 |
| abstract_inverted_index.we | 92, 116 |
| abstract_inverted_index.aim | 6 |
| abstract_inverted_index.and | 61, 71, 84, 130, 167, 186 |
| abstract_inverted_index.are | 49 |
| abstract_inverted_index.for | 32, 107 |
| abstract_inverted_index.new | 150 |
| abstract_inverted_index.our | 146 |
| abstract_inverted_index.the | 14, 25, 41, 66, 73, 90, 102, 105, 137 |
| abstract_inverted_index.web | 165, 168 |
| abstract_inverted_index.90\% | 185 |
| abstract_inverted_index.AITW | 153 |
| abstract_inverted_index.APIs | 63 |
| abstract_inverted_index.Code | 194 |
| abstract_inverted_index.have | 22 |
| abstract_inverted_index.help | 136 |
| abstract_inverted_index.into | 68 |
| abstract_inverted_index.most | 46 |
| abstract_inverted_index.need | 106 |
| abstract_inverted_index.rate | 191 |
| abstract_inverted_index.rely | 57 |
| abstract_inverted_index.show | 172 |
| abstract_inverted_index.such | 161 |
| abstract_inverted_index.task | 9 |
| abstract_inverted_index.that | 98, 173 |
| abstract_inverted_index.they | 56 |
| abstract_inverted_index.type | 181 |
| abstract_inverted_index.user | 2, 15 |
| abstract_inverted_index.what | 140 |
| abstract_inverted_index.with | 13, 40, 81, 101, 154, 178 |
| abstract_inverted_index.(GUI) | 4 |
| abstract_inverted_index.30$K$ | 155 |
| abstract_inverted_index.74\%. | 193 |
| abstract_inverted_index.APIs. | 114 |
| abstract_inverted_index.LLMs, | 45 |
| abstract_inverted_index.agent | 138 |
| abstract_inverted_index.align | 39 |
| abstract_inverted_index.error | 85 |
| abstract_inverted_index.large | 28 |
| abstract_inverted_index.often | 79 |
| abstract_inverted_index.parse | 65 |
| abstract_inverted_index.plans | 133 |
| abstract_inverted_index.tasks | 160 |
| abstract_inverted_index.those | 77 |
| abstract_inverted_index.tools | 60 |
| abstract_inverted_index.under | 51 |
| abstract_inverted_index.where | 55 |
| abstract_inverted_index.(LLMs) | 31 |
| abstract_inverted_index.Recent | 20 |
| abstract_inverted_index.action | 128, 132, 141, 180, 189 |
| abstract_inverted_index.agents | 5 |
| abstract_inverted_index.decide | 139 |
| abstract_inverted_index.future | 131 |
| abstract_inverted_index.manual | 18 |
| abstract_inverted_index.models | 30 |
| abstract_inverted_index.risks. | 87 |
| abstract_inverted_index.series | 124 |
| abstract_inverted_index.unique | 156 |
| abstract_inverted_index.diverse | 36 |
| abstract_inverted_index.grapple | 80 |
| abstract_inverted_index.overall | 188 |
| abstract_inverted_index.parsing | 109 |
| abstract_inverted_index.propose | 117 |
| abstract_inverted_index.results | 171 |
| abstract_inverted_index.sandbox | 53 |
| abstract_inverted_index.setting | 54 |
| abstract_inverted_index.studies | 21 |
| abstract_inverted_index.success | 190 |
| abstract_inverted_index.textual | 69 |
| abstract_inverted_index.without | 17 |
| abstract_inverted_index.Auto-GUI | 174 |
| abstract_inverted_index.accuracy | 183 |
| abstract_inverted_index.achieves | 175 |
| abstract_inverted_index.actions. | 75 |
| abstract_inverted_index.approach | 147 |
| abstract_inverted_index.directly | 99 |
| abstract_inverted_index.elements | 70 |
| abstract_inverted_index.evaluate | 145 |
| abstract_inverted_index.execute. | 143 |
| abstract_inverted_index.existing | 47 |
| abstract_inverted_index.external | 59 |
| abstract_inverted_index.language | 29 |
| abstract_inverted_index.mitigate | 89 |
| abstract_inverted_index.previous | 127 |
| abstract_inverted_index.publicly | 196 |
| abstract_inverted_index.reliance | 111 |
| abstract_inverted_index.solution | 97 |
| abstract_inverted_index.spanning | 158 |
| abstract_inverted_index.Auto-GUI, | 94 |
| abstract_inverted_index.Moreover, | 115 |
| abstract_inverted_index.available | 197 |
| abstract_inverted_index.benchmark | 152 |
| abstract_inverted_index.bypassing | 104 |
| abstract_inverted_index.developed | 50 |
| abstract_inverted_index.effective | 33 |
| abstract_inverted_index.eliciting | 24 |
| abstract_inverted_index.graphical | 1 |
| abstract_inverted_index.histories | 129 |
| abstract_inverted_index.inference | 82 |
| abstract_inverted_index.interacts | 100 |
| abstract_inverted_index.interface | 3, 16 |
| abstract_inverted_index.interpret | 72 |
| abstract_inverted_index.introduce | 93 |
| abstract_inverted_index.predicted | 74 |
| abstract_inverted_index.shopping. | 169 |
| abstract_inverted_index.technique | 120 |
| abstract_inverted_index.Autonomous | 0 |
| abstract_inverted_index.approaches | 48, 78 |
| abstract_inverted_index.automation | 10 |
| abstract_inverted_index.engagement | 34 |
| abstract_inverted_index.facilitate | 8 |
| abstract_inverted_index.interface, | 103 |
| abstract_inverted_index.leveraging | 122 |
| abstract_inverted_index.multi-step | 159 |
| abstract_inverted_index.multimodal | 96 |
| abstract_inverted_index.operation, | 164 |
| abstract_inverted_index.prediction | 182 |
| abstract_inverted_index.searching, | 166 |
| abstract_inverted_index.application | 163 |
| abstract_inverted_index.challenges, | 91 |
| abstract_inverted_index.environment | 67, 108 |
| abstract_inverted_index.interacting | 12 |
| abstract_inverted_index.performance | 177 |
| abstract_inverted_index.propagation | 86 |
| abstract_inverted_index.requirement | 43 |
| abstract_inverted_index.Experimental | 170 |
| abstract_inverted_index.capabilities | 26 |
| abstract_inverted_index.inefficiency | 83 |
| abstract_inverted_index.input-output | 42 |
| abstract_inverted_index.intermediate | 126 |
| abstract_inverted_index.investigated | 23 |
| abstract_inverted_index.Consequently, | 76 |
| abstract_inverted_index.environments. | 37 |
| abstract_inverted_index.instructions, | 157 |
| abstract_inverted_index.intervention. | 19 |
| abstract_inverted_index.device-control | 151 |
| abstract_inverted_index.chain-of-action | 119 |
| abstract_inverted_index.state-of-the-art | 176 |
| abstract_inverted_index.application-specific | 62 |
| abstract_inverted_index.application-dependent | 113 |
| abstract_inverted_index.https://github.com/cooelf/Auto-GUI. | 199 |
| cited_by_percentile_year | |
| countries_distinct_count | 0 |
| institutions_distinct_count | 2 |
| citation_normalized_percentile |