Explaining YOLO: Leveraging Grad-CAM to Explain Object Detections Article Swipe
Armin Kirchknopf
,
Djordje Slijepčević
,
Ilkay Wunderlich
,
Michael Breiter
,
Johannes Traxler
,
Matthias Zeppelzauer
·
YOU?
·
· 2022
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.2211.12108
YOU?
·
· 2022
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.2211.12108
We investigate the problem of explainability for visual object detectors. Specifically, we demonstrate on the example of the YOLO object detector how to integrate Grad-CAM into the model architecture and analyze the results. We show how to compute attribution-based explanations for individual detections and find that the normalization of the results has a great impact on their interpretation.
Related Topics
Concepts
Metadata
- Type
- preprint
- Language
- en
- Landing Page
- http://arxiv.org/abs/2211.12108
- https://arxiv.org/pdf/2211.12108
- OA Status
- green
- Cited By
- 1
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4309872251
All OpenAlex metadata
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4309872251Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.48550/arxiv.2211.12108Digital Object Identifier
- Title
-
Explaining YOLO: Leveraging Grad-CAM to Explain Object DetectionsWork title
- Type
-
preprintOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2022Year of publication
- Publication date
-
2022-11-22Full publication date if available
- Authors
-
Armin Kirchknopf, Djordje Slijepčević, Ilkay Wunderlich, Michael Breiter, Johannes Traxler, Matthias ZeppelzauerList of authors in order
- Landing page
-
https://arxiv.org/abs/2211.12108Publisher landing page
- PDF URL
-
https://arxiv.org/pdf/2211.12108Direct link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://arxiv.org/pdf/2211.12108Direct OA link when available
- Concepts
-
Normalization (sociology), Computer science, Object (grammar), Detector, Interpretation (philosophy), Attribution, Artificial intelligence, Architecture, Computer vision, Psychology, Geography, Programming language, Social psychology, Anthropology, Telecommunications, Sociology, ArchaeologyTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
1Total citation count in OpenAlex
- Citations by year (recent)
-
2023: 1Per-year citation counts (last 5 years)
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W4309872251 |
|---|---|
| doi | https://doi.org/10.48550/arxiv.2211.12108 |
| ids.doi | https://doi.org/10.3217/978-3-85125-869-1-13 |
| ids.openalex | https://openalex.org/W4309872251 |
| fwci | 0.19579882 |
| type | preprint |
| title | Explaining YOLO: Leveraging Grad-CAM to Explain Object Detections |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T12026 |
| topics[0].field.id | https://openalex.org/fields/17 |
| topics[0].field.display_name | Computer Science |
| topics[0].score | 0.9987999796867371 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1702 |
| topics[0].subfield.display_name | Artificial Intelligence |
| topics[0].display_name | Explainable Artificial Intelligence (XAI) |
| topics[1].id | https://openalex.org/T11689 |
| topics[1].field.id | https://openalex.org/fields/17 |
| topics[1].field.display_name | Computer Science |
| topics[1].score | 0.9987000226974487 |
| topics[1].domain.id | https://openalex.org/domains/3 |
| topics[1].domain.display_name | Physical Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/1702 |
| topics[1].subfield.display_name | Artificial Intelligence |
| topics[1].display_name | Adversarial Robustness in Machine Learning |
| topics[2].id | https://openalex.org/T10036 |
| topics[2].field.id | https://openalex.org/fields/17 |
| topics[2].field.display_name | Computer Science |
| topics[2].score | 0.9857000112533569 |
| topics[2].domain.id | https://openalex.org/domains/3 |
| topics[2].domain.display_name | Physical Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/1707 |
| topics[2].subfield.display_name | Computer Vision and Pattern Recognition |
| topics[2].display_name | Advanced Neural Network Applications |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| concepts[0].id | https://openalex.org/C136886441 |
| concepts[0].level | 2 |
| concepts[0].score | 0.8094578385353088 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q926129 |
| concepts[0].display_name | Normalization (sociology) |
| concepts[1].id | https://openalex.org/C41008148 |
| concepts[1].level | 0 |
| concepts[1].score | 0.6388735771179199 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[1].display_name | Computer science |
| concepts[2].id | https://openalex.org/C2781238097 |
| concepts[2].level | 2 |
| concepts[2].score | 0.5745640397071838 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q175026 |
| concepts[2].display_name | Object (grammar) |
| concepts[3].id | https://openalex.org/C94915269 |
| concepts[3].level | 2 |
| concepts[3].score | 0.5381428003311157 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q1834857 |
| concepts[3].display_name | Detector |
| concepts[4].id | https://openalex.org/C527412718 |
| concepts[4].level | 2 |
| concepts[4].score | 0.5363783836364746 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q855395 |
| concepts[4].display_name | Interpretation (philosophy) |
| concepts[5].id | https://openalex.org/C143299363 |
| concepts[5].level | 2 |
| concepts[5].score | 0.5290920734405518 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q900584 |
| concepts[5].display_name | Attribution |
| concepts[6].id | https://openalex.org/C154945302 |
| concepts[6].level | 1 |
| concepts[6].score | 0.5223849415779114 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q11660 |
| concepts[6].display_name | Artificial intelligence |
| concepts[7].id | https://openalex.org/C123657996 |
| concepts[7].level | 2 |
| concepts[7].score | 0.5193092226982117 |
| concepts[7].wikidata | https://www.wikidata.org/wiki/Q12271 |
| concepts[7].display_name | Architecture |
| concepts[8].id | https://openalex.org/C31972630 |
| concepts[8].level | 1 |
| concepts[8].score | 0.3966265022754669 |
| concepts[8].wikidata | https://www.wikidata.org/wiki/Q844240 |
| concepts[8].display_name | Computer vision |
| concepts[9].id | https://openalex.org/C15744967 |
| concepts[9].level | 0 |
| concepts[9].score | 0.18265819549560547 |
| concepts[9].wikidata | https://www.wikidata.org/wiki/Q9418 |
| concepts[9].display_name | Psychology |
| concepts[10].id | https://openalex.org/C205649164 |
| concepts[10].level | 0 |
| concepts[10].score | 0.09707573056221008 |
| concepts[10].wikidata | https://www.wikidata.org/wiki/Q1071 |
| concepts[10].display_name | Geography |
| concepts[11].id | https://openalex.org/C199360897 |
| concepts[11].level | 1 |
| concepts[11].score | 0.09190917015075684 |
| concepts[11].wikidata | https://www.wikidata.org/wiki/Q9143 |
| concepts[11].display_name | Programming language |
| concepts[12].id | https://openalex.org/C77805123 |
| concepts[12].level | 1 |
| concepts[12].score | 0.051008403301239014 |
| concepts[12].wikidata | https://www.wikidata.org/wiki/Q161272 |
| concepts[12].display_name | Social psychology |
| concepts[13].id | https://openalex.org/C19165224 |
| concepts[13].level | 1 |
| concepts[13].score | 0.0 |
| concepts[13].wikidata | https://www.wikidata.org/wiki/Q23404 |
| concepts[13].display_name | Anthropology |
| concepts[14].id | https://openalex.org/C76155785 |
| concepts[14].level | 1 |
| concepts[14].score | 0.0 |
| concepts[14].wikidata | https://www.wikidata.org/wiki/Q418 |
| concepts[14].display_name | Telecommunications |
| concepts[15].id | https://openalex.org/C144024400 |
| concepts[15].level | 0 |
| concepts[15].score | 0.0 |
| concepts[15].wikidata | https://www.wikidata.org/wiki/Q21201 |
| concepts[15].display_name | Sociology |
| concepts[16].id | https://openalex.org/C166957645 |
| concepts[16].level | 1 |
| concepts[16].score | 0.0 |
| concepts[16].wikidata | https://www.wikidata.org/wiki/Q23498 |
| concepts[16].display_name | Archaeology |
| keywords[0].id | https://openalex.org/keywords/normalization |
| keywords[0].score | 0.8094578385353088 |
| keywords[0].display_name | Normalization (sociology) |
| keywords[1].id | https://openalex.org/keywords/computer-science |
| keywords[1].score | 0.6388735771179199 |
| keywords[1].display_name | Computer science |
| keywords[2].id | https://openalex.org/keywords/object |
| keywords[2].score | 0.5745640397071838 |
| keywords[2].display_name | Object (grammar) |
| keywords[3].id | https://openalex.org/keywords/detector |
| keywords[3].score | 0.5381428003311157 |
| keywords[3].display_name | Detector |
| keywords[4].id | https://openalex.org/keywords/interpretation |
| keywords[4].score | 0.5363783836364746 |
| keywords[4].display_name | Interpretation (philosophy) |
| keywords[5].id | https://openalex.org/keywords/attribution |
| keywords[5].score | 0.5290920734405518 |
| keywords[5].display_name | Attribution |
| keywords[6].id | https://openalex.org/keywords/artificial-intelligence |
| keywords[6].score | 0.5223849415779114 |
| keywords[6].display_name | Artificial intelligence |
| keywords[7].id | https://openalex.org/keywords/architecture |
| keywords[7].score | 0.5193092226982117 |
| keywords[7].display_name | Architecture |
| keywords[8].id | https://openalex.org/keywords/computer-vision |
| keywords[8].score | 0.3966265022754669 |
| keywords[8].display_name | Computer vision |
| keywords[9].id | https://openalex.org/keywords/psychology |
| keywords[9].score | 0.18265819549560547 |
| keywords[9].display_name | Psychology |
| keywords[10].id | https://openalex.org/keywords/geography |
| keywords[10].score | 0.09707573056221008 |
| keywords[10].display_name | Geography |
| keywords[11].id | https://openalex.org/keywords/programming-language |
| keywords[11].score | 0.09190917015075684 |
| keywords[11].display_name | Programming language |
| keywords[12].id | https://openalex.org/keywords/social-psychology |
| keywords[12].score | 0.051008403301239014 |
| keywords[12].display_name | Social psychology |
| language | en |
| locations[0].id | pmh:oai:arXiv.org:2211.12108 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306400194 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | arXiv (Cornell University) |
| locations[0].source.host_organization | https://openalex.org/I205783295 |
| locations[0].source.host_organization_name | Cornell University |
| locations[0].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[0].license | |
| locations[0].pdf_url | https://arxiv.org/pdf/2211.12108 |
| locations[0].version | submittedVersion |
| locations[0].raw_type | text |
| locations[0].license_id | |
| locations[0].is_accepted | False |
| locations[0].is_published | False |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | http://arxiv.org/abs/2211.12108 |
| locations[1].id | doi:10.48550/arxiv.2211.12108 |
| locations[1].is_oa | True |
| locations[1].source.id | https://openalex.org/S4306400194 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | True |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | arXiv (Cornell University) |
| locations[1].source.host_organization | https://openalex.org/I205783295 |
| locations[1].source.host_organization_name | Cornell University |
| locations[1].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[1].license | |
| locations[1].pdf_url | |
| locations[1].version | |
| locations[1].raw_type | article-journal |
| locations[1].license_id | |
| locations[1].is_accepted | False |
| locations[1].is_published | |
| locations[1].raw_source_name | |
| locations[1].landing_page_url | https://doi.org/10.48550/arxiv.2211.12108 |
| locations[2].id | doi:10.3217/978-3-85125-869-1-13 |
| locations[2].is_oa | True |
| locations[2].source.id | https://openalex.org/S4306400660 |
| locations[2].source.issn | |
| locations[2].source.type | repository |
| locations[2].source.is_oa | False |
| locations[2].source.issn_l | |
| locations[2].source.is_core | False |
| locations[2].source.is_in_doaj | False |
| locations[2].source.display_name | TUGraz OPEN Library (Graz University of Technology) |
| locations[2].source.host_organization | https://openalex.org/I4092182 |
| locations[2].source.host_organization_name | Graz University of Technology |
| locations[2].source.host_organization_lineage | https://openalex.org/I4092182 |
| locations[2].license | |
| locations[2].pdf_url | |
| locations[2].version | |
| locations[2].raw_type | |
| locations[2].license_id | |
| locations[2].is_accepted | False |
| locations[2].is_published | |
| locations[2].raw_source_name | |
| locations[2].landing_page_url | https://doi.org/10.3217/978-3-85125-869-1-13 |
| indexed_in | arxiv, datacite |
| authorships[0].author.id | https://openalex.org/A5035648017 |
| authorships[0].author.orcid | |
| authorships[0].author.display_name | Armin Kirchknopf |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Kirchknopf, Armin |
| authorships[0].is_corresponding | False |
| authorships[1].author.id | https://openalex.org/A5055814440 |
| authorships[1].author.orcid | https://orcid.org/0000-0002-2295-7466 |
| authorships[1].author.display_name | Djordje Slijepčević |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Slijepcevic, Djordje |
| authorships[1].is_corresponding | False |
| authorships[2].author.id | https://openalex.org/A5077671985 |
| authorships[2].author.orcid | |
| authorships[2].author.display_name | Ilkay Wunderlich |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Wunderlich, Ilkay |
| authorships[2].is_corresponding | False |
| authorships[3].author.id | https://openalex.org/A5066764850 |
| authorships[3].author.orcid | https://orcid.org/0000-0002-7245-1623 |
| authorships[3].author.display_name | Michael Breiter |
| authorships[3].author_position | middle |
| authorships[3].raw_author_name | Breiter, Michael |
| authorships[3].is_corresponding | False |
| authorships[4].author.id | https://openalex.org/A5017339363 |
| authorships[4].author.orcid | |
| authorships[4].author.display_name | Johannes Traxler |
| authorships[4].author_position | middle |
| authorships[4].raw_author_name | Traxler, Johannes |
| authorships[4].is_corresponding | False |
| authorships[5].author.id | https://openalex.org/A5060926433 |
| authorships[5].author.orcid | https://orcid.org/0000-0003-0413-4746 |
| authorships[5].author.display_name | Matthias Zeppelzauer |
| authorships[5].author_position | last |
| authorships[5].raw_author_name | Zeppelzauer, Matthias |
| authorships[5].is_corresponding | False |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://arxiv.org/pdf/2211.12108 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2022-11-29T00:00:00 |
| display_name | Explaining YOLO: Leveraging Grad-CAM to Explain Object Detections |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T06:51:31.235846 |
| primary_topic.id | https://openalex.org/T12026 |
| primary_topic.field.id | https://openalex.org/fields/17 |
| primary_topic.field.display_name | Computer Science |
| primary_topic.score | 0.9987999796867371 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1702 |
| primary_topic.subfield.display_name | Artificial Intelligence |
| primary_topic.display_name | Explainable Artificial Intelligence (XAI) |
| related_works | https://openalex.org/W2035546108, https://openalex.org/W2376361520, https://openalex.org/W2133328864, https://openalex.org/W2093949997, https://openalex.org/W2570200690, https://openalex.org/W2389726244, https://openalex.org/W3030478661, https://openalex.org/W2323536476, https://openalex.org/W2104624653, https://openalex.org/W2128730003 |
| cited_by_count | 1 |
| counts_by_year[0].year | 2023 |
| counts_by_year[0].cited_by_count | 1 |
| locations_count | 3 |
| best_oa_location.id | pmh:oai:arXiv.org:2211.12108 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306400194 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | arXiv (Cornell University) |
| best_oa_location.source.host_organization | https://openalex.org/I205783295 |
| best_oa_location.source.host_organization_name | Cornell University |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| best_oa_location.license | |
| best_oa_location.pdf_url | https://arxiv.org/pdf/2211.12108 |
| best_oa_location.version | submittedVersion |
| best_oa_location.raw_type | text |
| best_oa_location.license_id | |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | http://arxiv.org/abs/2211.12108 |
| primary_location.id | pmh:oai:arXiv.org:2211.12108 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306400194 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | arXiv (Cornell University) |
| primary_location.source.host_organization | https://openalex.org/I205783295 |
| primary_location.source.host_organization_name | Cornell University |
| primary_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| primary_location.license | |
| primary_location.pdf_url | https://arxiv.org/pdf/2211.12108 |
| primary_location.version | submittedVersion |
| primary_location.raw_type | text |
| primary_location.license_id | |
| primary_location.is_accepted | False |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | http://arxiv.org/abs/2211.12108 |
| publication_date | 2022-11-22 |
| publication_year | 2022 |
| referenced_works_count | 0 |
| abstract_inverted_index.a | 52 |
| abstract_inverted_index.We | 0, 33 |
| abstract_inverted_index.of | 4, 16, 48 |
| abstract_inverted_index.on | 13, 55 |
| abstract_inverted_index.to | 22, 36 |
| abstract_inverted_index.we | 11 |
| abstract_inverted_index.and | 29, 43 |
| abstract_inverted_index.for | 6, 40 |
| abstract_inverted_index.has | 51 |
| abstract_inverted_index.how | 21, 35 |
| abstract_inverted_index.the | 2, 14, 17, 26, 31, 46, 49 |
| abstract_inverted_index.YOLO | 18 |
| abstract_inverted_index.find | 44 |
| abstract_inverted_index.into | 25 |
| abstract_inverted_index.show | 34 |
| abstract_inverted_index.that | 45 |
| abstract_inverted_index.great | 53 |
| abstract_inverted_index.model | 27 |
| abstract_inverted_index.their | 56 |
| abstract_inverted_index.impact | 54 |
| abstract_inverted_index.object | 8, 19 |
| abstract_inverted_index.visual | 7 |
| abstract_inverted_index.analyze | 30 |
| abstract_inverted_index.compute | 37 |
| abstract_inverted_index.example | 15 |
| abstract_inverted_index.problem | 3 |
| abstract_inverted_index.results | 50 |
| abstract_inverted_index.Grad-CAM | 24 |
| abstract_inverted_index.detector | 20 |
| abstract_inverted_index.results. | 32 |
| abstract_inverted_index.integrate | 23 |
| abstract_inverted_index.detections | 42 |
| abstract_inverted_index.detectors. | 9 |
| abstract_inverted_index.individual | 41 |
| abstract_inverted_index.demonstrate | 12 |
| abstract_inverted_index.investigate | 1 |
| abstract_inverted_index.architecture | 28 |
| abstract_inverted_index.explanations | 39 |
| abstract_inverted_index.Specifically, | 10 |
| abstract_inverted_index.normalization | 47 |
| abstract_inverted_index.explainability | 5 |
| abstract_inverted_index.interpretation. | 57 |
| abstract_inverted_index.attribution-based | 38 |
| cited_by_percentile_year.max | 94 |
| cited_by_percentile_year.min | 89 |
| countries_distinct_count | 0 |
| institutions_distinct_count | 6 |
| sustainable_development_goals[0].id | https://metadata.un.org/sdg/11 |
| sustainable_development_goals[0].score | 0.4399999976158142 |
| sustainable_development_goals[0].display_name | Sustainable cities and communities |
| citation_normalized_percentile.value | 0.5533489 |
| citation_normalized_percentile.is_in_top_1_percent | False |
| citation_normalized_percentile.is_in_top_10_percent | False |