Attention-based Joint Detection of Object and Semantic Part Article Swipe
Keval Morabia
,
Jatin Arora
,
T. N. Vijaykumar
·
YOU?
·
· 2020
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.2007.02419
YOU?
·
· 2020
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.2007.02419
In this paper, we address the problem of joint detection of objects like dog and its semantic parts like face, leg, etc. Our model is created on top of two Faster-RCNN models that share their features to perform a novel Attention-based feature fusion of related Object and Part features to get enhanced representations of both. These representations are used for final classification and bounding box regression separately for both models. Our experiments on the PASCAL-Part 2010 dataset show that joint detection can simultaneously improve both object detection and part detection in terms of mean Average Precision (mAP) at IoU=0.5.
Related Topics
Concepts
Metadata
- Type
- preprint
- Language
- en
- Landing Page
- http://arxiv.org/abs/2007.02419
- https://arxiv.org/pdf/2007.02419
- OA Status
- green
- Cited By
- 10
- References
- 7
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W3039515117
All OpenAlex metadata
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W3039515117Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.48550/arxiv.2007.02419Digital Object Identifier
- Title
-
Attention-based Joint Detection of Object and Semantic PartWork title
- Type
-
preprintOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2020Year of publication
- Publication date
-
2020-07-05Full publication date if available
- Authors
-
Keval Morabia, Jatin Arora, T. N. VijaykumarList of authors in order
- Landing page
-
https://arxiv.org/abs/2007.02419Publisher landing page
- PDF URL
-
https://arxiv.org/pdf/2007.02419Direct link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://arxiv.org/pdf/2007.02419Direct OA link when available
- Concepts
-
Joint (building), Object (grammar), Computer science, Artificial intelligence, Natural language processing, Engineering, Structural engineeringTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
10Total citation count in OpenAlex
- Citations by year (recent)
-
2024: 2, 2023: 4, 2022: 3, 2021: 1Per-year citation counts (last 5 years)
- References (count)
-
7Number of works referenced by this work
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W3039515117 |
|---|---|
| doi | https://doi.org/10.48550/arxiv.2007.02419 |
| ids.doi | https://doi.org/10.48550/arxiv.2007.02419 |
| ids.mag | 3039515117 |
| ids.openalex | https://openalex.org/W3039515117 |
| fwci | |
| type | preprint |
| title | Attention-based Joint Detection of Object and Semantic Part |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T10036 |
| topics[0].field.id | https://openalex.org/fields/17 |
| topics[0].field.display_name | Computer Science |
| topics[0].score | 0.9993000030517578 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1707 |
| topics[0].subfield.display_name | Computer Vision and Pattern Recognition |
| topics[0].display_name | Advanced Neural Network Applications |
| topics[1].id | https://openalex.org/T11605 |
| topics[1].field.id | https://openalex.org/fields/17 |
| topics[1].field.display_name | Computer Science |
| topics[1].score | 0.9993000030517578 |
| topics[1].domain.id | https://openalex.org/domains/3 |
| topics[1].domain.display_name | Physical Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/1707 |
| topics[1].subfield.display_name | Computer Vision and Pattern Recognition |
| topics[1].display_name | Visual Attention and Saliency Detection |
| topics[2].id | https://openalex.org/T10627 |
| topics[2].field.id | https://openalex.org/fields/17 |
| topics[2].field.display_name | Computer Science |
| topics[2].score | 0.9987999796867371 |
| topics[2].domain.id | https://openalex.org/domains/3 |
| topics[2].domain.display_name | Physical Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/1707 |
| topics[2].subfield.display_name | Computer Vision and Pattern Recognition |
| topics[2].display_name | Advanced Image and Video Retrieval Techniques |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| concepts[0].id | https://openalex.org/C18555067 |
| concepts[0].level | 2 |
| concepts[0].score | 0.7344177961349487 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q8375051 |
| concepts[0].display_name | Joint (building) |
| concepts[1].id | https://openalex.org/C2781238097 |
| concepts[1].level | 2 |
| concepts[1].score | 0.5358990430831909 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q175026 |
| concepts[1].display_name | Object (grammar) |
| concepts[2].id | https://openalex.org/C41008148 |
| concepts[2].level | 0 |
| concepts[2].score | 0.525903582572937 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[2].display_name | Computer science |
| concepts[3].id | https://openalex.org/C154945302 |
| concepts[3].level | 1 |
| concepts[3].score | 0.40250104665756226 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q11660 |
| concepts[3].display_name | Artificial intelligence |
| concepts[4].id | https://openalex.org/C204321447 |
| concepts[4].level | 1 |
| concepts[4].score | 0.3989393711090088 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q30642 |
| concepts[4].display_name | Natural language processing |
| concepts[5].id | https://openalex.org/C127413603 |
| concepts[5].level | 0 |
| concepts[5].score | 0.10397031903266907 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q11023 |
| concepts[5].display_name | Engineering |
| concepts[6].id | https://openalex.org/C66938386 |
| concepts[6].level | 1 |
| concepts[6].score | 0.04326626658439636 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q633538 |
| concepts[6].display_name | Structural engineering |
| keywords[0].id | https://openalex.org/keywords/joint |
| keywords[0].score | 0.7344177961349487 |
| keywords[0].display_name | Joint (building) |
| keywords[1].id | https://openalex.org/keywords/object |
| keywords[1].score | 0.5358990430831909 |
| keywords[1].display_name | Object (grammar) |
| keywords[2].id | https://openalex.org/keywords/computer-science |
| keywords[2].score | 0.525903582572937 |
| keywords[2].display_name | Computer science |
| keywords[3].id | https://openalex.org/keywords/artificial-intelligence |
| keywords[3].score | 0.40250104665756226 |
| keywords[3].display_name | Artificial intelligence |
| keywords[4].id | https://openalex.org/keywords/natural-language-processing |
| keywords[4].score | 0.3989393711090088 |
| keywords[4].display_name | Natural language processing |
| keywords[5].id | https://openalex.org/keywords/engineering |
| keywords[5].score | 0.10397031903266907 |
| keywords[5].display_name | Engineering |
| keywords[6].id | https://openalex.org/keywords/structural-engineering |
| keywords[6].score | 0.04326626658439636 |
| keywords[6].display_name | Structural engineering |
| language | en |
| locations[0].id | pmh:oai:arXiv.org:2007.02419 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306400194 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | arXiv (Cornell University) |
| locations[0].source.host_organization | https://openalex.org/I205783295 |
| locations[0].source.host_organization_name | Cornell University |
| locations[0].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[0].license | |
| locations[0].pdf_url | https://arxiv.org/pdf/2007.02419 |
| locations[0].version | submittedVersion |
| locations[0].raw_type | text |
| locations[0].license_id | |
| locations[0].is_accepted | False |
| locations[0].is_published | False |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | http://arxiv.org/abs/2007.02419 |
| locations[1].id | doi:10.48550/arxiv.2007.02419 |
| locations[1].is_oa | True |
| locations[1].source.id | https://openalex.org/S4306400194 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | True |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | arXiv (Cornell University) |
| locations[1].source.host_organization | https://openalex.org/I205783295 |
| locations[1].source.host_organization_name | Cornell University |
| locations[1].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[1].license | |
| locations[1].pdf_url | |
| locations[1].version | |
| locations[1].raw_type | article |
| locations[1].license_id | |
| locations[1].is_accepted | False |
| locations[1].is_published | |
| locations[1].raw_source_name | |
| locations[1].landing_page_url | https://doi.org/10.48550/arxiv.2007.02419 |
| indexed_in | arxiv, datacite |
| authorships[0].author.id | https://openalex.org/A5082373409 |
| authorships[0].author.orcid | |
| authorships[0].author.display_name | Keval Morabia |
| authorships[0].countries | US |
| authorships[0].affiliations[0].institution_ids | https://openalex.org/I157725225 |
| authorships[0].affiliations[0].raw_affiliation_string | ***University of Illinois at Urbana-Champaign |
| authorships[0].institutions[0].id | https://openalex.org/I157725225 |
| authorships[0].institutions[0].ror | https://ror.org/047426m28 |
| authorships[0].institutions[0].type | education |
| authorships[0].institutions[0].lineage | https://openalex.org/I157725225 |
| authorships[0].institutions[0].country_code | US |
| authorships[0].institutions[0].display_name | University of Illinois Urbana-Champaign |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Keval Morabia |
| authorships[0].is_corresponding | False |
| authorships[0].raw_affiliation_strings | ***University of Illinois at Urbana-Champaign |
| authorships[1].author.id | https://openalex.org/A5053555464 |
| authorships[1].author.orcid | https://orcid.org/0000-0002-4420-2486 |
| authorships[1].author.display_name | Jatin Arora |
| authorships[1].countries | US |
| authorships[1].affiliations[0].institution_ids | https://openalex.org/I157725225 |
| authorships[1].affiliations[0].raw_affiliation_string | ***University of Illinois at Urbana-Champaign |
| authorships[1].institutions[0].id | https://openalex.org/I157725225 |
| authorships[1].institutions[0].ror | https://ror.org/047426m28 |
| authorships[1].institutions[0].type | education |
| authorships[1].institutions[0].lineage | https://openalex.org/I157725225 |
| authorships[1].institutions[0].country_code | US |
| authorships[1].institutions[0].display_name | University of Illinois Urbana-Champaign |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Jatin Arora |
| authorships[1].is_corresponding | False |
| authorships[1].raw_affiliation_strings | ***University of Illinois at Urbana-Champaign |
| authorships[2].author.id | https://openalex.org/A5103145581 |
| authorships[2].author.orcid | https://orcid.org/0000-0001-6624-4372 |
| authorships[2].author.display_name | T. N. Vijaykumar |
| authorships[2].countries | US |
| authorships[2].affiliations[0].institution_ids | https://openalex.org/I157725225 |
| authorships[2].affiliations[0].raw_affiliation_string | ***University of Illinois at Urbana-Champaign |
| authorships[2].institutions[0].id | https://openalex.org/I157725225 |
| authorships[2].institutions[0].ror | https://ror.org/047426m28 |
| authorships[2].institutions[0].type | education |
| authorships[2].institutions[0].lineage | https://openalex.org/I157725225 |
| authorships[2].institutions[0].country_code | US |
| authorships[2].institutions[0].display_name | University of Illinois Urbana-Champaign |
| authorships[2].author_position | last |
| authorships[2].raw_author_name | Tara Vijaykumar |
| authorships[2].is_corresponding | False |
| authorships[2].raw_affiliation_strings | ***University of Illinois at Urbana-Champaign |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://arxiv.org/pdf/2007.02419 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | Attention-based Joint Detection of Object and Semantic Part |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T06:51:31.235846 |
| primary_topic.id | https://openalex.org/T10036 |
| primary_topic.field.id | https://openalex.org/fields/17 |
| primary_topic.field.display_name | Computer Science |
| primary_topic.score | 0.9993000030517578 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1707 |
| primary_topic.subfield.display_name | Computer Vision and Pattern Recognition |
| primary_topic.display_name | Advanced Neural Network Applications |
| related_works | https://openalex.org/W4391375266, https://openalex.org/W2748952813, https://openalex.org/W2390279801, https://openalex.org/W2358668433, https://openalex.org/W2376932109, https://openalex.org/W2001405890, https://openalex.org/W2382290278, https://openalex.org/W2478288626, https://openalex.org/W2975200075, https://openalex.org/W3204019825 |
| cited_by_count | 10 |
| counts_by_year[0].year | 2024 |
| counts_by_year[0].cited_by_count | 2 |
| counts_by_year[1].year | 2023 |
| counts_by_year[1].cited_by_count | 4 |
| counts_by_year[2].year | 2022 |
| counts_by_year[2].cited_by_count | 3 |
| counts_by_year[3].year | 2021 |
| counts_by_year[3].cited_by_count | 1 |
| locations_count | 2 |
| best_oa_location.id | pmh:oai:arXiv.org:2007.02419 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306400194 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | arXiv (Cornell University) |
| best_oa_location.source.host_organization | https://openalex.org/I205783295 |
| best_oa_location.source.host_organization_name | Cornell University |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| best_oa_location.license | |
| best_oa_location.pdf_url | https://arxiv.org/pdf/2007.02419 |
| best_oa_location.version | submittedVersion |
| best_oa_location.raw_type | text |
| best_oa_location.license_id | |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | http://arxiv.org/abs/2007.02419 |
| primary_location.id | pmh:oai:arXiv.org:2007.02419 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306400194 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | arXiv (Cornell University) |
| primary_location.source.host_organization | https://openalex.org/I205783295 |
| primary_location.source.host_organization_name | Cornell University |
| primary_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| primary_location.license | |
| primary_location.pdf_url | https://arxiv.org/pdf/2007.02419 |
| primary_location.version | submittedVersion |
| primary_location.raw_type | text |
| primary_location.license_id | |
| primary_location.is_accepted | False |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | http://arxiv.org/abs/2007.02419 |
| publication_date | 2020-07-05 |
| publication_year | 2020 |
| referenced_works | https://openalex.org/W2963403868, https://openalex.org/W2599765304, https://openalex.org/W2168356304, https://openalex.org/W2104408738, https://openalex.org/W2796347433, https://openalex.org/W2953106684, https://openalex.org/W2946411540 |
| referenced_works_count | 7 |
| abstract_inverted_index.a | 38 |
| abstract_inverted_index.In | 0 |
| abstract_inverted_index.at | 97 |
| abstract_inverted_index.in | 90 |
| abstract_inverted_index.is | 24 |
| abstract_inverted_index.of | 7, 10, 28, 43, 53, 92 |
| abstract_inverted_index.on | 26, 72 |
| abstract_inverted_index.to | 36, 49 |
| abstract_inverted_index.we | 3 |
| abstract_inverted_index.Our | 22, 70 |
| abstract_inverted_index.and | 14, 46, 62, 87 |
| abstract_inverted_index.are | 57 |
| abstract_inverted_index.box | 64 |
| abstract_inverted_index.can | 81 |
| abstract_inverted_index.dog | 13 |
| abstract_inverted_index.for | 59, 67 |
| abstract_inverted_index.get | 50 |
| abstract_inverted_index.its | 15 |
| abstract_inverted_index.the | 5, 73 |
| abstract_inverted_index.top | 27 |
| abstract_inverted_index.two | 29 |
| abstract_inverted_index.2010 | 75 |
| abstract_inverted_index.Part | 47 |
| abstract_inverted_index.both | 68, 84 |
| abstract_inverted_index.etc. | 21 |
| abstract_inverted_index.leg, | 20 |
| abstract_inverted_index.like | 12, 18 |
| abstract_inverted_index.mean | 93 |
| abstract_inverted_index.part | 88 |
| abstract_inverted_index.show | 77 |
| abstract_inverted_index.that | 32, 78 |
| abstract_inverted_index.this | 1 |
| abstract_inverted_index.used | 58 |
| abstract_inverted_index.(mAP) | 96 |
| abstract_inverted_index.These | 55 |
| abstract_inverted_index.both. | 54 |
| abstract_inverted_index.face, | 19 |
| abstract_inverted_index.final | 60 |
| abstract_inverted_index.joint | 8, 79 |
| abstract_inverted_index.model | 23 |
| abstract_inverted_index.novel | 39 |
| abstract_inverted_index.parts | 17 |
| abstract_inverted_index.share | 33 |
| abstract_inverted_index.terms | 91 |
| abstract_inverted_index.their | 34 |
| abstract_inverted_index.Object | 45 |
| abstract_inverted_index.fusion | 42 |
| abstract_inverted_index.models | 31 |
| abstract_inverted_index.object | 85 |
| abstract_inverted_index.paper, | 2 |
| abstract_inverted_index.Average | 94 |
| abstract_inverted_index.address | 4 |
| abstract_inverted_index.created | 25 |
| abstract_inverted_index.dataset | 76 |
| abstract_inverted_index.feature | 41 |
| abstract_inverted_index.improve | 83 |
| abstract_inverted_index.models. | 69 |
| abstract_inverted_index.objects | 11 |
| abstract_inverted_index.perform | 37 |
| abstract_inverted_index.problem | 6 |
| abstract_inverted_index.related | 44 |
| abstract_inverted_index.IoU=0.5. | 98 |
| abstract_inverted_index.bounding | 63 |
| abstract_inverted_index.enhanced | 51 |
| abstract_inverted_index.features | 35, 48 |
| abstract_inverted_index.semantic | 16 |
| abstract_inverted_index.Precision | 95 |
| abstract_inverted_index.detection | 9, 80, 86, 89 |
| abstract_inverted_index.regression | 65 |
| abstract_inverted_index.separately | 66 |
| abstract_inverted_index.Faster-RCNN | 30 |
| abstract_inverted_index.PASCAL-Part | 74 |
| abstract_inverted_index.experiments | 71 |
| abstract_inverted_index.classification | 61 |
| abstract_inverted_index.simultaneously | 82 |
| abstract_inverted_index.Attention-based | 40 |
| abstract_inverted_index.representations | 52, 56 |
| cited_by_percentile_year | |
| countries_distinct_count | 1 |
| institutions_distinct_count | 3 |
| citation_normalized_percentile |