Preventing Adversarial Use of Datasets through Fair Core-Set Construction Article Swipe
Benjamin Spector
,
Ravi Kumar
,
Andrew Tomkins
·
YOU?
·
· 2019
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.1910.10871
YOU?
·
· 2019
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.1910.10871
We propose improving the privacy properties of a dataset by publishing only a strategically chosen "core-set" of the data containing a subset of the instances. The core-set allows strong performance on primary tasks, but forces poor performance on unwanted tasks. We give methods for both linear models and neural networks and demonstrate their efficacy on data.
Related Topics
Concepts
Metadata
- Type
- preprint
- Language
- en
- Landing Page
- http://arxiv.org/abs/1910.10871
- https://arxiv.org/pdf/1910.10871
- OA Status
- green
- References
- 7
- Related Works
- 20
- OpenAlex ID
- https://openalex.org/W2982176728
All OpenAlex metadata
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W2982176728Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.48550/arxiv.1910.10871Digital Object Identifier
- Title
-
Preventing Adversarial Use of Datasets through Fair Core-Set ConstructionWork title
- Type
-
preprintOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2019Year of publication
- Publication date
-
2019-10-24Full publication date if available
- Authors
-
Benjamin Spector, Ravi Kumar, Andrew TomkinsList of authors in order
- Landing page
-
https://arxiv.org/abs/1910.10871Publisher landing page
- PDF URL
-
https://arxiv.org/pdf/1910.10871Direct link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://arxiv.org/pdf/1910.10871Direct OA link when available
- Concepts
-
Adversarial system, Core (optical fiber), Computer science, Set (abstract data type), Data set, Data publishing, Data mining, Machine learning, Artificial intelligence, Theoretical computer science, Publishing, Programming language, Telecommunications, Political science, LawTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
0Total citation count in OpenAlex
- References (count)
-
7Number of works referenced by this work
- Related works (count)
-
20Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W2982176728 |
|---|---|
| doi | https://doi.org/10.48550/arxiv.1910.10871 |
| ids.doi | https://doi.org/10.48550/arxiv.1910.10871 |
| ids.mag | 2982176728 |
| ids.openalex | https://openalex.org/W2982176728 |
| fwci | |
| type | preprint |
| title | Preventing Adversarial Use of Datasets through Fair Core-Set Construction |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T10764 |
| topics[0].field.id | https://openalex.org/fields/17 |
| topics[0].field.display_name | Computer Science |
| topics[0].score | 0.9998999834060669 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1702 |
| topics[0].subfield.display_name | Artificial Intelligence |
| topics[0].display_name | Privacy-Preserving Technologies in Data |
| topics[1].id | https://openalex.org/T11689 |
| topics[1].field.id | https://openalex.org/fields/17 |
| topics[1].field.display_name | Computer Science |
| topics[1].score | 0.9998000264167786 |
| topics[1].domain.id | https://openalex.org/domains/3 |
| topics[1].domain.display_name | Physical Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/1702 |
| topics[1].subfield.display_name | Artificial Intelligence |
| topics[1].display_name | Adversarial Robustness in Machine Learning |
| topics[2].id | https://openalex.org/T12026 |
| topics[2].field.id | https://openalex.org/fields/17 |
| topics[2].field.display_name | Computer Science |
| topics[2].score | 0.9940999746322632 |
| topics[2].domain.id | https://openalex.org/domains/3 |
| topics[2].domain.display_name | Physical Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/1702 |
| topics[2].subfield.display_name | Artificial Intelligence |
| topics[2].display_name | Explainable Artificial Intelligence (XAI) |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| concepts[0].id | https://openalex.org/C37736160 |
| concepts[0].level | 2 |
| concepts[0].score | 0.8893705010414124 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q1801315 |
| concepts[0].display_name | Adversarial system |
| concepts[1].id | https://openalex.org/C2164484 |
| concepts[1].level | 2 |
| concepts[1].score | 0.8294442892074585 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q5170150 |
| concepts[1].display_name | Core (optical fiber) |
| concepts[2].id | https://openalex.org/C41008148 |
| concepts[2].level | 0 |
| concepts[2].score | 0.7432308197021484 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[2].display_name | Computer science |
| concepts[3].id | https://openalex.org/C177264268 |
| concepts[3].level | 2 |
| concepts[3].score | 0.7353855967521667 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q1514741 |
| concepts[3].display_name | Set (abstract data type) |
| concepts[4].id | https://openalex.org/C58489278 |
| concepts[4].level | 2 |
| concepts[4].score | 0.5787174105644226 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q1172284 |
| concepts[4].display_name | Data set |
| concepts[5].id | https://openalex.org/C2781396290 |
| concepts[5].level | 3 |
| concepts[5].score | 0.43974053859710693 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q17051824 |
| concepts[5].display_name | Data publishing |
| concepts[6].id | https://openalex.org/C124101348 |
| concepts[6].level | 1 |
| concepts[6].score | 0.4211004972457886 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q172491 |
| concepts[6].display_name | Data mining |
| concepts[7].id | https://openalex.org/C119857082 |
| concepts[7].level | 1 |
| concepts[7].score | 0.418558806180954 |
| concepts[7].wikidata | https://www.wikidata.org/wiki/Q2539 |
| concepts[7].display_name | Machine learning |
| concepts[8].id | https://openalex.org/C154945302 |
| concepts[8].level | 1 |
| concepts[8].score | 0.4095367193222046 |
| concepts[8].wikidata | https://www.wikidata.org/wiki/Q11660 |
| concepts[8].display_name | Artificial intelligence |
| concepts[9].id | https://openalex.org/C80444323 |
| concepts[9].level | 1 |
| concepts[9].score | 0.3209807276725769 |
| concepts[9].wikidata | https://www.wikidata.org/wiki/Q2878974 |
| concepts[9].display_name | Theoretical computer science |
| concepts[10].id | https://openalex.org/C151719136 |
| concepts[10].level | 2 |
| concepts[10].score | 0.24468189477920532 |
| concepts[10].wikidata | https://www.wikidata.org/wiki/Q3972943 |
| concepts[10].display_name | Publishing |
| concepts[11].id | https://openalex.org/C199360897 |
| concepts[11].level | 1 |
| concepts[11].score | 0.06864294409751892 |
| concepts[11].wikidata | https://www.wikidata.org/wiki/Q9143 |
| concepts[11].display_name | Programming language |
| concepts[12].id | https://openalex.org/C76155785 |
| concepts[12].level | 1 |
| concepts[12].score | 0.06157568097114563 |
| concepts[12].wikidata | https://www.wikidata.org/wiki/Q418 |
| concepts[12].display_name | Telecommunications |
| concepts[13].id | https://openalex.org/C17744445 |
| concepts[13].level | 0 |
| concepts[13].score | 0.0 |
| concepts[13].wikidata | https://www.wikidata.org/wiki/Q36442 |
| concepts[13].display_name | Political science |
| concepts[14].id | https://openalex.org/C199539241 |
| concepts[14].level | 1 |
| concepts[14].score | 0.0 |
| concepts[14].wikidata | https://www.wikidata.org/wiki/Q7748 |
| concepts[14].display_name | Law |
| keywords[0].id | https://openalex.org/keywords/adversarial-system |
| keywords[0].score | 0.8893705010414124 |
| keywords[0].display_name | Adversarial system |
| keywords[1].id | https://openalex.org/keywords/core |
| keywords[1].score | 0.8294442892074585 |
| keywords[1].display_name | Core (optical fiber) |
| keywords[2].id | https://openalex.org/keywords/computer-science |
| keywords[2].score | 0.7432308197021484 |
| keywords[2].display_name | Computer science |
| keywords[3].id | https://openalex.org/keywords/set |
| keywords[3].score | 0.7353855967521667 |
| keywords[3].display_name | Set (abstract data type) |
| keywords[4].id | https://openalex.org/keywords/data-set |
| keywords[4].score | 0.5787174105644226 |
| keywords[4].display_name | Data set |
| keywords[5].id | https://openalex.org/keywords/data-publishing |
| keywords[5].score | 0.43974053859710693 |
| keywords[5].display_name | Data publishing |
| keywords[6].id | https://openalex.org/keywords/data-mining |
| keywords[6].score | 0.4211004972457886 |
| keywords[6].display_name | Data mining |
| keywords[7].id | https://openalex.org/keywords/machine-learning |
| keywords[7].score | 0.418558806180954 |
| keywords[7].display_name | Machine learning |
| keywords[8].id | https://openalex.org/keywords/artificial-intelligence |
| keywords[8].score | 0.4095367193222046 |
| keywords[8].display_name | Artificial intelligence |
| keywords[9].id | https://openalex.org/keywords/theoretical-computer-science |
| keywords[9].score | 0.3209807276725769 |
| keywords[9].display_name | Theoretical computer science |
| keywords[10].id | https://openalex.org/keywords/publishing |
| keywords[10].score | 0.24468189477920532 |
| keywords[10].display_name | Publishing |
| keywords[11].id | https://openalex.org/keywords/programming-language |
| keywords[11].score | 0.06864294409751892 |
| keywords[11].display_name | Programming language |
| keywords[12].id | https://openalex.org/keywords/telecommunications |
| keywords[12].score | 0.06157568097114563 |
| keywords[12].display_name | Telecommunications |
| language | en |
| locations[0].id | pmh:oai:arXiv.org:1910.10871 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306400194 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | arXiv (Cornell University) |
| locations[0].source.host_organization | https://openalex.org/I205783295 |
| locations[0].source.host_organization_name | Cornell University |
| locations[0].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[0].license | |
| locations[0].pdf_url | https://arxiv.org/pdf/1910.10871 |
| locations[0].version | submittedVersion |
| locations[0].raw_type | text |
| locations[0].license_id | |
| locations[0].is_accepted | False |
| locations[0].is_published | False |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | http://arxiv.org/abs/1910.10871 |
| locations[1].id | mag:2982176728 |
| locations[1].is_oa | True |
| locations[1].source.id | https://openalex.org/S4306400194 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | True |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | arXiv (Cornell University) |
| locations[1].source.host_organization | https://openalex.org/I205783295 |
| locations[1].source.host_organization_name | Cornell University |
| locations[1].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[1].license | |
| locations[1].pdf_url | |
| locations[1].version | submittedVersion |
| locations[1].raw_type | |
| locations[1].license_id | |
| locations[1].is_accepted | False |
| locations[1].is_published | False |
| locations[1].raw_source_name | arXiv (Cornell University) |
| locations[1].landing_page_url | https://arxiv.org/pdf/1910.10871.pdf |
| locations[2].id | doi:10.48550/arxiv.1910.10871 |
| locations[2].is_oa | True |
| locations[2].source.id | https://openalex.org/S4306400194 |
| locations[2].source.issn | |
| locations[2].source.type | repository |
| locations[2].source.is_oa | True |
| locations[2].source.issn_l | |
| locations[2].source.is_core | False |
| locations[2].source.is_in_doaj | False |
| locations[2].source.display_name | arXiv (Cornell University) |
| locations[2].source.host_organization | https://openalex.org/I205783295 |
| locations[2].source.host_organization_name | Cornell University |
| locations[2].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[2].license | |
| locations[2].pdf_url | |
| locations[2].version | |
| locations[2].raw_type | article |
| locations[2].license_id | |
| locations[2].is_accepted | False |
| locations[2].is_published | |
| locations[2].raw_source_name | |
| locations[2].landing_page_url | https://doi.org/10.48550/arxiv.1910.10871 |
| indexed_in | arxiv, datacite |
| authorships[0].author.id | https://openalex.org/A5004675499 |
| authorships[0].author.orcid | https://orcid.org/0000-0003-0468-5986 |
| authorships[0].author.display_name | Benjamin Spector |
| authorships[0].countries | US |
| authorships[0].affiliations[0].institution_ids | https://openalex.org/I63966007 |
| authorships[0].affiliations[0].raw_affiliation_string | Massachusetts Institute Of Technology#TAB# |
| authorships[0].institutions[0].id | https://openalex.org/I63966007 |
| authorships[0].institutions[0].ror | https://ror.org/042nb2s44 |
| authorships[0].institutions[0].type | education |
| authorships[0].institutions[0].lineage | https://openalex.org/I63966007 |
| authorships[0].institutions[0].country_code | US |
| authorships[0].institutions[0].display_name | Massachusetts Institute of Technology |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Benjamin Spector |
| authorships[0].is_corresponding | False |
| authorships[0].raw_affiliation_strings | Massachusetts Institute Of Technology#TAB# |
| authorships[1].author.id | https://openalex.org/A5100666340 |
| authorships[1].author.orcid | |
| authorships[1].author.display_name | Ravi Kumar |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Ravi Kumar |
| authorships[1].is_corresponding | False |
| authorships[2].author.id | https://openalex.org/A5068021191 |
| authorships[2].author.orcid | https://orcid.org/0000-0002-1611-9255 |
| authorships[2].author.display_name | Andrew Tomkins |
| authorships[2].author_position | last |
| authorships[2].raw_author_name | Andrew Tomkins |
| authorships[2].is_corresponding | False |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://arxiv.org/pdf/1910.10871 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | Preventing Adversarial Use of Datasets through Fair Core-Set Construction |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T06:51:31.235846 |
| primary_topic.id | https://openalex.org/T10764 |
| primary_topic.field.id | https://openalex.org/fields/17 |
| primary_topic.field.display_name | Computer Science |
| primary_topic.score | 0.9998999834060669 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1702 |
| primary_topic.subfield.display_name | Artificial Intelligence |
| primary_topic.display_name | Privacy-Preserving Technologies in Data |
| related_works | https://openalex.org/W3167910349, https://openalex.org/W2787198070, https://openalex.org/W2906169906, https://openalex.org/W3007228146, https://openalex.org/W2904885070, https://openalex.org/W3202166365, https://openalex.org/W3048511172, https://openalex.org/W2991272391, https://openalex.org/W3192357745, https://openalex.org/W2888124483, https://openalex.org/W3078442402, https://openalex.org/W3167022417, https://openalex.org/W3175557349, https://openalex.org/W3090502573, https://openalex.org/W3118709450, https://openalex.org/W2507062236, https://openalex.org/W2900598269, https://openalex.org/W2967880504, https://openalex.org/W2964018718, https://openalex.org/W2912012050 |
| cited_by_count | 0 |
| locations_count | 3 |
| best_oa_location.id | pmh:oai:arXiv.org:1910.10871 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306400194 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | arXiv (Cornell University) |
| best_oa_location.source.host_organization | https://openalex.org/I205783295 |
| best_oa_location.source.host_organization_name | Cornell University |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| best_oa_location.license | |
| best_oa_location.pdf_url | https://arxiv.org/pdf/1910.10871 |
| best_oa_location.version | submittedVersion |
| best_oa_location.raw_type | text |
| best_oa_location.license_id | |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | http://arxiv.org/abs/1910.10871 |
| primary_location.id | pmh:oai:arXiv.org:1910.10871 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306400194 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | arXiv (Cornell University) |
| primary_location.source.host_organization | https://openalex.org/I205783295 |
| primary_location.source.host_organization_name | Cornell University |
| primary_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| primary_location.license | |
| primary_location.pdf_url | https://arxiv.org/pdf/1910.10871 |
| primary_location.version | submittedVersion |
| primary_location.raw_type | text |
| primary_location.license_id | |
| primary_location.is_accepted | False |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | http://arxiv.org/abs/1910.10871 |
| publication_date | 2019-10-24 |
| publication_year | 2019 |
| referenced_works | https://openalex.org/W2162670686, https://openalex.org/W1978906111, https://openalex.org/W2777914285, https://openalex.org/W2913570153, https://openalex.org/W2911978475, https://openalex.org/W2908166511, https://openalex.org/W2155319834 |
| referenced_works_count | 7 |
| abstract_inverted_index.a | 7, 12, 20 |
| abstract_inverted_index.We | 0, 40 |
| abstract_inverted_index.by | 9 |
| abstract_inverted_index.of | 6, 16, 22 |
| abstract_inverted_index.on | 30, 37, 54 |
| abstract_inverted_index.The | 25 |
| abstract_inverted_index.and | 47, 50 |
| abstract_inverted_index.but | 33 |
| abstract_inverted_index.for | 43 |
| abstract_inverted_index.the | 3, 17, 23 |
| abstract_inverted_index.both | 44 |
| abstract_inverted_index.data | 18 |
| abstract_inverted_index.give | 41 |
| abstract_inverted_index.only | 11 |
| abstract_inverted_index.poor | 35 |
| abstract_inverted_index.data. | 55 |
| abstract_inverted_index.their | 52 |
| abstract_inverted_index.allows | 27 |
| abstract_inverted_index.chosen | 14 |
| abstract_inverted_index.forces | 34 |
| abstract_inverted_index.linear | 45 |
| abstract_inverted_index.models | 46 |
| abstract_inverted_index.neural | 48 |
| abstract_inverted_index.strong | 28 |
| abstract_inverted_index.subset | 21 |
| abstract_inverted_index.tasks, | 32 |
| abstract_inverted_index.tasks. | 39 |
| abstract_inverted_index.dataset | 8 |
| abstract_inverted_index.methods | 42 |
| abstract_inverted_index.primary | 31 |
| abstract_inverted_index.privacy | 4 |
| abstract_inverted_index.propose | 1 |
| abstract_inverted_index.core-set | 26 |
| abstract_inverted_index.efficacy | 53 |
| abstract_inverted_index.networks | 49 |
| abstract_inverted_index.unwanted | 38 |
| abstract_inverted_index.improving | 2 |
| abstract_inverted_index."core-set" | 15 |
| abstract_inverted_index.containing | 19 |
| abstract_inverted_index.instances. | 24 |
| abstract_inverted_index.properties | 5 |
| abstract_inverted_index.publishing | 10 |
| abstract_inverted_index.demonstrate | 51 |
| abstract_inverted_index.performance | 29, 36 |
| abstract_inverted_index.strategically | 13 |
| cited_by_percentile_year | |
| countries_distinct_count | 1 |
| institutions_distinct_count | 3 |
| citation_normalized_percentile |