Multi-label Open-set Audio Classification Article Swipe
YOU?
·
· 2023
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.2310.13759
Current audio classification models have small class vocabularies relative to the large number of sound event classes of interest in the real world. Thus, they provide a limited view of the world that may miss important yet unexpected or unknown sound events. To address this issue, open-set audio classification techniques have been developed to detect sound events from unknown classes. Although these methods have been applied to a multi-class context in audio, such as sound scene classification, they have yet to be investigated for polyphonic audio in which sound events overlap, requiring the use of multi-label models. In this study, we establish the problem of multi-label open-set audio classification by creating a dataset with varying unknown class distributions and evaluating baseline approaches built upon existing techniques.
Related Topics
- Type
- preprint
- Language
- en
- Landing Page
- http://arxiv.org/abs/2310.13759
- https://arxiv.org/pdf/2310.13759
- OA Status
- green
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4387928492
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4387928492Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.48550/arxiv.2310.13759Digital Object Identifier
- Title
-
Multi-label Open-set Audio ClassificationWork title
- Type
-
preprintOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2023Year of publication
- Publication date
-
2023-10-20Full publication date if available
- Authors
-
Sripathi Sridhar, Mark CartwrightList of authors in order
- Landing page
-
https://arxiv.org/abs/2310.13759Publisher landing page
- PDF URL
-
https://arxiv.org/pdf/2310.13759Direct link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://arxiv.org/pdf/2310.13759Direct OA link when available
- Concepts
-
Computer science, Class (philosophy), Set (abstract data type), Event (particle physics), Speech recognition, Context (archaeology), Sound (geography), Polyphony, Sound recording and reproduction, Artificial intelligence, Geography, Physics, Geomorphology, Archaeology, Programming language, Acoustics, Quantum mechanics, GeologyTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
0Total citation count in OpenAlex
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W4387928492 |
|---|---|
| doi | https://doi.org/10.48550/arxiv.2310.13759 |
| ids.doi | https://doi.org/10.48550/arxiv.2310.13759 |
| ids.openalex | https://openalex.org/W4387928492 |
| fwci | |
| type | preprint |
| title | Multi-label Open-set Audio Classification |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T11309 |
| topics[0].field.id | https://openalex.org/fields/17 |
| topics[0].field.display_name | Computer Science |
| topics[0].score | 0.9987999796867371 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1711 |
| topics[0].subfield.display_name | Signal Processing |
| topics[0].display_name | Music and Audio Processing |
| topics[1].id | https://openalex.org/T11220 |
| topics[1].field.id | https://openalex.org/fields/22 |
| topics[1].field.display_name | Engineering |
| topics[1].score | 0.9695000052452087 |
| topics[1].domain.id | https://openalex.org/domains/3 |
| topics[1].domain.display_name | Physical Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/2205 |
| topics[1].subfield.display_name | Civil and Structural Engineering |
| topics[1].display_name | Water Systems and Optimization |
| topics[2].id | https://openalex.org/T13996 |
| topics[2].field.id | https://openalex.org/fields/12 |
| topics[2].field.display_name | Arts and Humanities |
| topics[2].score | 0.9610000252723694 |
| topics[2].domain.id | https://openalex.org/domains/2 |
| topics[2].domain.display_name | Social Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/1210 |
| topics[2].subfield.display_name | Music |
| topics[2].display_name | Diverse Musicological Studies |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| concepts[0].id | https://openalex.org/C41008148 |
| concepts[0].level | 0 |
| concepts[0].score | 0.7174963355064392 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[0].display_name | Computer science |
| concepts[1].id | https://openalex.org/C2777212361 |
| concepts[1].level | 2 |
| concepts[1].score | 0.6616451740264893 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q5127848 |
| concepts[1].display_name | Class (philosophy) |
| concepts[2].id | https://openalex.org/C177264268 |
| concepts[2].level | 2 |
| concepts[2].score | 0.5957716107368469 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q1514741 |
| concepts[2].display_name | Set (abstract data type) |
| concepts[3].id | https://openalex.org/C2779662365 |
| concepts[3].level | 2 |
| concepts[3].score | 0.5406212210655212 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q5416694 |
| concepts[3].display_name | Event (particle physics) |
| concepts[4].id | https://openalex.org/C28490314 |
| concepts[4].level | 1 |
| concepts[4].score | 0.5401095747947693 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q189436 |
| concepts[4].display_name | Speech recognition |
| concepts[5].id | https://openalex.org/C2779343474 |
| concepts[5].level | 2 |
| concepts[5].score | 0.5391578674316406 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q3109175 |
| concepts[5].display_name | Context (archaeology) |
| concepts[6].id | https://openalex.org/C203718221 |
| concepts[6].level | 2 |
| concepts[6].score | 0.5024023056030273 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q491713 |
| concepts[6].display_name | Sound (geography) |
| concepts[7].id | https://openalex.org/C128979739 |
| concepts[7].level | 2 |
| concepts[7].score | 0.49990105628967285 |
| concepts[7].wikidata | https://www.wikidata.org/wiki/Q179465 |
| concepts[7].display_name | Polyphony |
| concepts[8].id | https://openalex.org/C128422554 |
| concepts[8].level | 2 |
| concepts[8].score | 0.42440640926361084 |
| concepts[8].wikidata | https://www.wikidata.org/wiki/Q20077126 |
| concepts[8].display_name | Sound recording and reproduction |
| concepts[9].id | https://openalex.org/C154945302 |
| concepts[9].level | 1 |
| concepts[9].score | 0.38330286741256714 |
| concepts[9].wikidata | https://www.wikidata.org/wiki/Q11660 |
| concepts[9].display_name | Artificial intelligence |
| concepts[10].id | https://openalex.org/C205649164 |
| concepts[10].level | 0 |
| concepts[10].score | 0.07081004977226257 |
| concepts[10].wikidata | https://www.wikidata.org/wiki/Q1071 |
| concepts[10].display_name | Geography |
| concepts[11].id | https://openalex.org/C121332964 |
| concepts[11].level | 0 |
| concepts[11].score | 0.0 |
| concepts[11].wikidata | https://www.wikidata.org/wiki/Q413 |
| concepts[11].display_name | Physics |
| concepts[12].id | https://openalex.org/C114793014 |
| concepts[12].level | 1 |
| concepts[12].score | 0.0 |
| concepts[12].wikidata | https://www.wikidata.org/wiki/Q52109 |
| concepts[12].display_name | Geomorphology |
| concepts[13].id | https://openalex.org/C166957645 |
| concepts[13].level | 1 |
| concepts[13].score | 0.0 |
| concepts[13].wikidata | https://www.wikidata.org/wiki/Q23498 |
| concepts[13].display_name | Archaeology |
| concepts[14].id | https://openalex.org/C199360897 |
| concepts[14].level | 1 |
| concepts[14].score | 0.0 |
| concepts[14].wikidata | https://www.wikidata.org/wiki/Q9143 |
| concepts[14].display_name | Programming language |
| concepts[15].id | https://openalex.org/C24890656 |
| concepts[15].level | 1 |
| concepts[15].score | 0.0 |
| concepts[15].wikidata | https://www.wikidata.org/wiki/Q82811 |
| concepts[15].display_name | Acoustics |
| concepts[16].id | https://openalex.org/C62520636 |
| concepts[16].level | 1 |
| concepts[16].score | 0.0 |
| concepts[16].wikidata | https://www.wikidata.org/wiki/Q944 |
| concepts[16].display_name | Quantum mechanics |
| concepts[17].id | https://openalex.org/C127313418 |
| concepts[17].level | 0 |
| concepts[17].score | 0.0 |
| concepts[17].wikidata | https://www.wikidata.org/wiki/Q1069 |
| concepts[17].display_name | Geology |
| keywords[0].id | https://openalex.org/keywords/computer-science |
| keywords[0].score | 0.7174963355064392 |
| keywords[0].display_name | Computer science |
| keywords[1].id | https://openalex.org/keywords/class |
| keywords[1].score | 0.6616451740264893 |
| keywords[1].display_name | Class (philosophy) |
| keywords[2].id | https://openalex.org/keywords/set |
| keywords[2].score | 0.5957716107368469 |
| keywords[2].display_name | Set (abstract data type) |
| keywords[3].id | https://openalex.org/keywords/event |
| keywords[3].score | 0.5406212210655212 |
| keywords[3].display_name | Event (particle physics) |
| keywords[4].id | https://openalex.org/keywords/speech-recognition |
| keywords[4].score | 0.5401095747947693 |
| keywords[4].display_name | Speech recognition |
| keywords[5].id | https://openalex.org/keywords/context |
| keywords[5].score | 0.5391578674316406 |
| keywords[5].display_name | Context (archaeology) |
| keywords[6].id | https://openalex.org/keywords/sound |
| keywords[6].score | 0.5024023056030273 |
| keywords[6].display_name | Sound (geography) |
| keywords[7].id | https://openalex.org/keywords/polyphony |
| keywords[7].score | 0.49990105628967285 |
| keywords[7].display_name | Polyphony |
| keywords[8].id | https://openalex.org/keywords/sound-recording-and-reproduction |
| keywords[8].score | 0.42440640926361084 |
| keywords[8].display_name | Sound recording and reproduction |
| keywords[9].id | https://openalex.org/keywords/artificial-intelligence |
| keywords[9].score | 0.38330286741256714 |
| keywords[9].display_name | Artificial intelligence |
| keywords[10].id | https://openalex.org/keywords/geography |
| keywords[10].score | 0.07081004977226257 |
| keywords[10].display_name | Geography |
| language | en |
| locations[0].id | pmh:oai:arXiv.org:2310.13759 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306400194 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | arXiv (Cornell University) |
| locations[0].source.host_organization | https://openalex.org/I205783295 |
| locations[0].source.host_organization_name | Cornell University |
| locations[0].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[0].license | |
| locations[0].pdf_url | https://arxiv.org/pdf/2310.13759 |
| locations[0].version | submittedVersion |
| locations[0].raw_type | text |
| locations[0].license_id | |
| locations[0].is_accepted | False |
| locations[0].is_published | False |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | http://arxiv.org/abs/2310.13759 |
| locations[1].id | doi:10.48550/arxiv.2310.13759 |
| locations[1].is_oa | True |
| locations[1].source.id | https://openalex.org/S4306400194 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | True |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | arXiv (Cornell University) |
| locations[1].source.host_organization | https://openalex.org/I205783295 |
| locations[1].source.host_organization_name | Cornell University |
| locations[1].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[1].license | |
| locations[1].pdf_url | |
| locations[1].version | |
| locations[1].raw_type | article |
| locations[1].license_id | |
| locations[1].is_accepted | False |
| locations[1].is_published | |
| locations[1].raw_source_name | |
| locations[1].landing_page_url | https://doi.org/10.48550/arxiv.2310.13759 |
| indexed_in | arxiv, datacite |
| authorships[0].author.id | https://openalex.org/A5011188265 |
| authorships[0].author.orcid | https://orcid.org/0000-0001-9761-3564 |
| authorships[0].author.display_name | Sripathi Sridhar |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Sridhar, Sripathi |
| authorships[0].is_corresponding | False |
| authorships[1].author.id | https://openalex.org/A5056532548 |
| authorships[1].author.orcid | https://orcid.org/0000-0002-5908-390X |
| authorships[1].author.display_name | Mark Cartwright |
| authorships[1].author_position | last |
| authorships[1].raw_author_name | Cartwright, Mark |
| authorships[1].is_corresponding | False |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://arxiv.org/pdf/2310.13759 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | Multi-label Open-set Audio Classification |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T06:51:31.235846 |
| primary_topic.id | https://openalex.org/T11309 |
| primary_topic.field.id | https://openalex.org/fields/17 |
| primary_topic.field.display_name | Computer Science |
| primary_topic.score | 0.9987999796867371 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1711 |
| primary_topic.subfield.display_name | Signal Processing |
| primary_topic.display_name | Music and Audio Processing |
| related_works | https://openalex.org/W2411659965, https://openalex.org/W2387677326, https://openalex.org/W4200063482, https://openalex.org/W2357575019, https://openalex.org/W2370117122, https://openalex.org/W2360603947, https://openalex.org/W2371528275, https://openalex.org/W2375454309, https://openalex.org/W2374135200, https://openalex.org/W2318538434 |
| cited_by_count | 0 |
| locations_count | 2 |
| best_oa_location.id | pmh:oai:arXiv.org:2310.13759 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306400194 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | arXiv (Cornell University) |
| best_oa_location.source.host_organization | https://openalex.org/I205783295 |
| best_oa_location.source.host_organization_name | Cornell University |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| best_oa_location.license | |
| best_oa_location.pdf_url | https://arxiv.org/pdf/2310.13759 |
| best_oa_location.version | submittedVersion |
| best_oa_location.raw_type | text |
| best_oa_location.license_id | |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | http://arxiv.org/abs/2310.13759 |
| primary_location.id | pmh:oai:arXiv.org:2310.13759 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306400194 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | arXiv (Cornell University) |
| primary_location.source.host_organization | https://openalex.org/I205783295 |
| primary_location.source.host_organization_name | Cornell University |
| primary_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| primary_location.license | |
| primary_location.pdf_url | https://arxiv.org/pdf/2310.13759 |
| primary_location.version | submittedVersion |
| primary_location.raw_type | text |
| primary_location.license_id | |
| primary_location.is_accepted | False |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | http://arxiv.org/abs/2310.13759 |
| publication_date | 2023-10-20 |
| publication_year | 2023 |
| referenced_works_count | 0 |
| abstract_inverted_index.a | 26, 67, 111 |
| abstract_inverted_index.In | 97 |
| abstract_inverted_index.To | 42 |
| abstract_inverted_index.as | 73 |
| abstract_inverted_index.be | 81 |
| abstract_inverted_index.by | 109 |
| abstract_inverted_index.in | 19, 70, 86 |
| abstract_inverted_index.of | 13, 17, 29, 94, 104 |
| abstract_inverted_index.or | 38 |
| abstract_inverted_index.to | 9, 53, 66, 80 |
| abstract_inverted_index.we | 100 |
| abstract_inverted_index.and | 118 |
| abstract_inverted_index.for | 83 |
| abstract_inverted_index.may | 33 |
| abstract_inverted_index.the | 10, 20, 30, 92, 102 |
| abstract_inverted_index.use | 93 |
| abstract_inverted_index.yet | 36, 79 |
| abstract_inverted_index.been | 51, 64 |
| abstract_inverted_index.from | 57 |
| abstract_inverted_index.have | 4, 50, 63, 78 |
| abstract_inverted_index.miss | 34 |
| abstract_inverted_index.real | 21 |
| abstract_inverted_index.such | 72 |
| abstract_inverted_index.that | 32 |
| abstract_inverted_index.they | 24, 77 |
| abstract_inverted_index.this | 44, 98 |
| abstract_inverted_index.upon | 123 |
| abstract_inverted_index.view | 28 |
| abstract_inverted_index.with | 113 |
| abstract_inverted_index.Thus, | 23 |
| abstract_inverted_index.audio | 1, 47, 85, 107 |
| abstract_inverted_index.built | 122 |
| abstract_inverted_index.class | 6, 116 |
| abstract_inverted_index.event | 15 |
| abstract_inverted_index.large | 11 |
| abstract_inverted_index.scene | 75 |
| abstract_inverted_index.small | 5 |
| abstract_inverted_index.sound | 14, 40, 55, 74, 88 |
| abstract_inverted_index.these | 61 |
| abstract_inverted_index.which | 87 |
| abstract_inverted_index.world | 31 |
| abstract_inverted_index.audio, | 71 |
| abstract_inverted_index.detect | 54 |
| abstract_inverted_index.events | 56, 89 |
| abstract_inverted_index.issue, | 45 |
| abstract_inverted_index.models | 3 |
| abstract_inverted_index.number | 12 |
| abstract_inverted_index.study, | 99 |
| abstract_inverted_index.world. | 22 |
| abstract_inverted_index.Current | 0 |
| abstract_inverted_index.address | 43 |
| abstract_inverted_index.applied | 65 |
| abstract_inverted_index.classes | 16 |
| abstract_inverted_index.context | 69 |
| abstract_inverted_index.dataset | 112 |
| abstract_inverted_index.events. | 41 |
| abstract_inverted_index.limited | 27 |
| abstract_inverted_index.methods | 62 |
| abstract_inverted_index.models. | 96 |
| abstract_inverted_index.problem | 103 |
| abstract_inverted_index.provide | 25 |
| abstract_inverted_index.unknown | 39, 58, 115 |
| abstract_inverted_index.varying | 114 |
| abstract_inverted_index.Although | 60 |
| abstract_inverted_index.baseline | 120 |
| abstract_inverted_index.classes. | 59 |
| abstract_inverted_index.creating | 110 |
| abstract_inverted_index.existing | 124 |
| abstract_inverted_index.interest | 18 |
| abstract_inverted_index.open-set | 46, 106 |
| abstract_inverted_index.overlap, | 90 |
| abstract_inverted_index.relative | 8 |
| abstract_inverted_index.developed | 52 |
| abstract_inverted_index.establish | 101 |
| abstract_inverted_index.important | 35 |
| abstract_inverted_index.requiring | 91 |
| abstract_inverted_index.approaches | 121 |
| abstract_inverted_index.evaluating | 119 |
| abstract_inverted_index.polyphonic | 84 |
| abstract_inverted_index.techniques | 49 |
| abstract_inverted_index.unexpected | 37 |
| abstract_inverted_index.multi-class | 68 |
| abstract_inverted_index.multi-label | 95, 105 |
| abstract_inverted_index.techniques. | 125 |
| abstract_inverted_index.investigated | 82 |
| abstract_inverted_index.vocabularies | 7 |
| abstract_inverted_index.distributions | 117 |
| abstract_inverted_index.classification | 2, 48, 108 |
| abstract_inverted_index.classification, | 76 |
| cited_by_percentile_year | |
| countries_distinct_count | 0 |
| institutions_distinct_count | 2 |
| sustainable_development_goals[0].id | https://metadata.un.org/sdg/4 |
| sustainable_development_goals[0].score | 0.8100000023841858 |
| sustainable_development_goals[0].display_name | Quality Education |
| citation_normalized_percentile |