Is this normal? A new projection pursuit index to assess a sample against a multivariate null distribution Article Swipe
YOU?
·
· 2025
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.2502.02397
Many data problems contain some reference or normal conditions, upon which to compare newly collected data. This scenario occurs in data collected as part of clinical trials to detect adverse events, or for measuring climate change against historical norms. The data is typically multivariate, and often the normal ranges are specified by a multivariate normal distribution. The work presented in this paper develops methods to compare the new sample against the reference distribution with high-dimensional visualisation. It uses a projection pursuit guided tour to produce a sequence of low-dimensional projections steered towards those where the new sample is most different from the reference. A new projection pursuit index is defined for this purpose. The tour visualisation also includes drawing of the projected ellipse, which is computed analytically, corresponding to the reference distribution. The methods are implemented in the R package, tourr.
Related Topics
- Type
- preprint
- Language
- en
- Landing Page
- http://arxiv.org/abs/2502.02397
- https://arxiv.org/pdf/2502.02397
- OA Status
- green
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4407186850
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4407186850Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.48550/arxiv.2502.02397Digital Object Identifier
- Title
-
Is this normal? A new projection pursuit index to assess a sample against a multivariate null distributionWork title
- Type
-
preprintOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2025Year of publication
- Publication date
-
2025-02-04Full publication date if available
- Authors
-
Annalisa Calvi, Ursula Laa, Dianne CookList of authors in order
- Landing page
-
https://arxiv.org/abs/2502.02397Publisher landing page
- PDF URL
-
https://arxiv.org/pdf/2502.02397Direct link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://arxiv.org/pdf/2502.02397Direct OA link when available
- Concepts
-
Projection pursuit, Null (SQL), Multivariate statistics, Statistics, Index (typography), Mathematics, Null distribution, Projection (relational algebra), Sample (material), Distribution (mathematics), Computer science, Statistical hypothesis testing, Chemistry, Data mining, Mathematical analysis, Chromatography, Algorithm, Test statistic, World Wide WebTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
0Total citation count in OpenAlex
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W4407186850 |
|---|---|
| doi | https://doi.org/10.48550/arxiv.2502.02397 |
| ids.doi | https://doi.org/10.48550/arxiv.2502.02397 |
| ids.openalex | https://openalex.org/W4407186850 |
| fwci | |
| type | preprint |
| title | Is this normal? A new projection pursuit index to assess a sample against a multivariate null distribution |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T11871 |
| topics[0].field.id | https://openalex.org/fields/26 |
| topics[0].field.display_name | Mathematics |
| topics[0].score | 0.9879000186920166 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/2613 |
| topics[0].subfield.display_name | Statistics and Probability |
| topics[0].display_name | Advanced Statistical Methods and Models |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| concepts[0].id | https://openalex.org/C118038509 |
| concepts[0].level | 2 |
| concepts[0].score | 0.8300400376319885 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q382970 |
| concepts[0].display_name | Projection pursuit |
| concepts[1].id | https://openalex.org/C203763787 |
| concepts[1].level | 2 |
| concepts[1].score | 0.7019880414009094 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q371029 |
| concepts[1].display_name | Null (SQL) |
| concepts[2].id | https://openalex.org/C161584116 |
| concepts[2].level | 2 |
| concepts[2].score | 0.6517233848571777 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q1952580 |
| concepts[2].display_name | Multivariate statistics |
| concepts[3].id | https://openalex.org/C105795698 |
| concepts[3].level | 1 |
| concepts[3].score | 0.5602944493293762 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q12483 |
| concepts[3].display_name | Statistics |
| concepts[4].id | https://openalex.org/C2777382242 |
| concepts[4].level | 2 |
| concepts[4].score | 0.5464680194854736 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q6017816 |
| concepts[4].display_name | Index (typography) |
| concepts[5].id | https://openalex.org/C33923547 |
| concepts[5].level | 0 |
| concepts[5].score | 0.5223004817962646 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q395 |
| concepts[5].display_name | Mathematics |
| concepts[6].id | https://openalex.org/C120639 |
| concepts[6].level | 4 |
| concepts[6].score | 0.49682167172431946 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q7068958 |
| concepts[6].display_name | Null distribution |
| concepts[7].id | https://openalex.org/C57493831 |
| concepts[7].level | 2 |
| concepts[7].score | 0.4765898883342743 |
| concepts[7].wikidata | https://www.wikidata.org/wiki/Q3134666 |
| concepts[7].display_name | Projection (relational algebra) |
| concepts[8].id | https://openalex.org/C198531522 |
| concepts[8].level | 2 |
| concepts[8].score | 0.47068411111831665 |
| concepts[8].wikidata | https://www.wikidata.org/wiki/Q485146 |
| concepts[8].display_name | Sample (material) |
| concepts[9].id | https://openalex.org/C110121322 |
| concepts[9].level | 2 |
| concepts[9].score | 0.4216478765010834 |
| concepts[9].wikidata | https://www.wikidata.org/wiki/Q865811 |
| concepts[9].display_name | Distribution (mathematics) |
| concepts[10].id | https://openalex.org/C41008148 |
| concepts[10].level | 0 |
| concepts[10].score | 0.20045354962348938 |
| concepts[10].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[10].display_name | Computer science |
| concepts[11].id | https://openalex.org/C87007009 |
| concepts[11].level | 2 |
| concepts[11].score | 0.16661512851715088 |
| concepts[11].wikidata | https://www.wikidata.org/wiki/Q210832 |
| concepts[11].display_name | Statistical hypothesis testing |
| concepts[12].id | https://openalex.org/C185592680 |
| concepts[12].level | 0 |
| concepts[12].score | 0.13098356127738953 |
| concepts[12].wikidata | https://www.wikidata.org/wiki/Q2329 |
| concepts[12].display_name | Chemistry |
| concepts[13].id | https://openalex.org/C124101348 |
| concepts[13].level | 1 |
| concepts[13].score | 0.10683876276016235 |
| concepts[13].wikidata | https://www.wikidata.org/wiki/Q172491 |
| concepts[13].display_name | Data mining |
| concepts[14].id | https://openalex.org/C134306372 |
| concepts[14].level | 1 |
| concepts[14].score | 0.10669630765914917 |
| concepts[14].wikidata | https://www.wikidata.org/wiki/Q7754 |
| concepts[14].display_name | Mathematical analysis |
| concepts[15].id | https://openalex.org/C43617362 |
| concepts[15].level | 1 |
| concepts[15].score | 0.10164150595664978 |
| concepts[15].wikidata | https://www.wikidata.org/wiki/Q170050 |
| concepts[15].display_name | Chromatography |
| concepts[16].id | https://openalex.org/C11413529 |
| concepts[16].level | 1 |
| concepts[16].score | 0.09810498356819153 |
| concepts[16].wikidata | https://www.wikidata.org/wiki/Q8366 |
| concepts[16].display_name | Algorithm |
| concepts[17].id | https://openalex.org/C169857963 |
| concepts[17].level | 3 |
| concepts[17].score | 0.05703887343406677 |
| concepts[17].wikidata | https://www.wikidata.org/wiki/Q1461038 |
| concepts[17].display_name | Test statistic |
| concepts[18].id | https://openalex.org/C136764020 |
| concepts[18].level | 1 |
| concepts[18].score | 0.0 |
| concepts[18].wikidata | https://www.wikidata.org/wiki/Q466 |
| concepts[18].display_name | World Wide Web |
| keywords[0].id | https://openalex.org/keywords/projection-pursuit |
| keywords[0].score | 0.8300400376319885 |
| keywords[0].display_name | Projection pursuit |
| keywords[1].id | https://openalex.org/keywords/null |
| keywords[1].score | 0.7019880414009094 |
| keywords[1].display_name | Null (SQL) |
| keywords[2].id | https://openalex.org/keywords/multivariate-statistics |
| keywords[2].score | 0.6517233848571777 |
| keywords[2].display_name | Multivariate statistics |
| keywords[3].id | https://openalex.org/keywords/statistics |
| keywords[3].score | 0.5602944493293762 |
| keywords[3].display_name | Statistics |
| keywords[4].id | https://openalex.org/keywords/index |
| keywords[4].score | 0.5464680194854736 |
| keywords[4].display_name | Index (typography) |
| keywords[5].id | https://openalex.org/keywords/mathematics |
| keywords[5].score | 0.5223004817962646 |
| keywords[5].display_name | Mathematics |
| keywords[6].id | https://openalex.org/keywords/null-distribution |
| keywords[6].score | 0.49682167172431946 |
| keywords[6].display_name | Null distribution |
| keywords[7].id | https://openalex.org/keywords/projection |
| keywords[7].score | 0.4765898883342743 |
| keywords[7].display_name | Projection (relational algebra) |
| keywords[8].id | https://openalex.org/keywords/sample |
| keywords[8].score | 0.47068411111831665 |
| keywords[8].display_name | Sample (material) |
| keywords[9].id | https://openalex.org/keywords/distribution |
| keywords[9].score | 0.4216478765010834 |
| keywords[9].display_name | Distribution (mathematics) |
| keywords[10].id | https://openalex.org/keywords/computer-science |
| keywords[10].score | 0.20045354962348938 |
| keywords[10].display_name | Computer science |
| keywords[11].id | https://openalex.org/keywords/statistical-hypothesis-testing |
| keywords[11].score | 0.16661512851715088 |
| keywords[11].display_name | Statistical hypothesis testing |
| keywords[12].id | https://openalex.org/keywords/chemistry |
| keywords[12].score | 0.13098356127738953 |
| keywords[12].display_name | Chemistry |
| keywords[13].id | https://openalex.org/keywords/data-mining |
| keywords[13].score | 0.10683876276016235 |
| keywords[13].display_name | Data mining |
| keywords[14].id | https://openalex.org/keywords/mathematical-analysis |
| keywords[14].score | 0.10669630765914917 |
| keywords[14].display_name | Mathematical analysis |
| keywords[15].id | https://openalex.org/keywords/chromatography |
| keywords[15].score | 0.10164150595664978 |
| keywords[15].display_name | Chromatography |
| keywords[16].id | https://openalex.org/keywords/algorithm |
| keywords[16].score | 0.09810498356819153 |
| keywords[16].display_name | Algorithm |
| keywords[17].id | https://openalex.org/keywords/test-statistic |
| keywords[17].score | 0.05703887343406677 |
| keywords[17].display_name | Test statistic |
| language | en |
| locations[0].id | pmh:oai:arXiv.org:2502.02397 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306400194 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | arXiv (Cornell University) |
| locations[0].source.host_organization | https://openalex.org/I205783295 |
| locations[0].source.host_organization_name | Cornell University |
| locations[0].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[0].license | |
| locations[0].pdf_url | https://arxiv.org/pdf/2502.02397 |
| locations[0].version | submittedVersion |
| locations[0].raw_type | text |
| locations[0].license_id | |
| locations[0].is_accepted | False |
| locations[0].is_published | False |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | http://arxiv.org/abs/2502.02397 |
| locations[1].id | doi:10.48550/arxiv.2502.02397 |
| locations[1].is_oa | True |
| locations[1].source.id | https://openalex.org/S4306400194 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | True |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | arXiv (Cornell University) |
| locations[1].source.host_organization | https://openalex.org/I205783295 |
| locations[1].source.host_organization_name | Cornell University |
| locations[1].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[1].license | cc-by |
| locations[1].pdf_url | |
| locations[1].version | |
| locations[1].raw_type | article |
| locations[1].license_id | https://openalex.org/licenses/cc-by |
| locations[1].is_accepted | False |
| locations[1].is_published | |
| locations[1].raw_source_name | |
| locations[1].landing_page_url | https://doi.org/10.48550/arxiv.2502.02397 |
| indexed_in | arxiv, datacite |
| authorships[0].author.id | https://openalex.org/A5116167666 |
| authorships[0].author.orcid | |
| authorships[0].author.display_name | Annalisa Calvi |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Calvi, Annalisa |
| authorships[0].is_corresponding | False |
| authorships[1].author.id | https://openalex.org/A5066847905 |
| authorships[1].author.orcid | https://orcid.org/0000-0002-0249-6439 |
| authorships[1].author.display_name | Ursula Laa |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Laa, Ursula |
| authorships[1].is_corresponding | False |
| authorships[2].author.id | https://openalex.org/A5060227214 |
| authorships[2].author.orcid | https://orcid.org/0000-0002-3813-7155 |
| authorships[2].author.display_name | Dianne Cook |
| authorships[2].author_position | last |
| authorships[2].raw_author_name | Cook, Dianne |
| authorships[2].is_corresponding | False |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://arxiv.org/pdf/2502.02397 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | Is this normal? A new projection pursuit index to assess a sample against a multivariate null distribution |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T06:51:31.235846 |
| primary_topic.id | https://openalex.org/T11871 |
| primary_topic.field.id | https://openalex.org/fields/26 |
| primary_topic.field.display_name | Mathematics |
| primary_topic.score | 0.9879000186920166 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/2613 |
| primary_topic.subfield.display_name | Statistics and Probability |
| primary_topic.display_name | Advanced Statistical Methods and Models |
| related_works | https://openalex.org/W4253679632, https://openalex.org/W2312487961, https://openalex.org/W2371601355, https://openalex.org/W2367397236, https://openalex.org/W2373463565, https://openalex.org/W2329220454, https://openalex.org/W2354037709, https://openalex.org/W2788448464, https://openalex.org/W1772501953, https://openalex.org/W2352640762 |
| cited_by_count | 0 |
| locations_count | 2 |
| best_oa_location.id | pmh:oai:arXiv.org:2502.02397 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306400194 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | arXiv (Cornell University) |
| best_oa_location.source.host_organization | https://openalex.org/I205783295 |
| best_oa_location.source.host_organization_name | Cornell University |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| best_oa_location.license | |
| best_oa_location.pdf_url | https://arxiv.org/pdf/2502.02397 |
| best_oa_location.version | submittedVersion |
| best_oa_location.raw_type | text |
| best_oa_location.license_id | |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | http://arxiv.org/abs/2502.02397 |
| primary_location.id | pmh:oai:arXiv.org:2502.02397 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306400194 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | arXiv (Cornell University) |
| primary_location.source.host_organization | https://openalex.org/I205783295 |
| primary_location.source.host_organization_name | Cornell University |
| primary_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| primary_location.license | |
| primary_location.pdf_url | https://arxiv.org/pdf/2502.02397 |
| primary_location.version | submittedVersion |
| primary_location.raw_type | text |
| primary_location.license_id | |
| primary_location.is_accepted | False |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | http://arxiv.org/abs/2502.02397 |
| publication_date | 2025-02-04 |
| publication_year | 2025 |
| referenced_works_count | 0 |
| abstract_inverted_index.A | 103 |
| abstract_inverted_index.R | 138 |
| abstract_inverted_index.a | 52, 78, 85 |
| abstract_inverted_index.It | 76 |
| abstract_inverted_index.as | 22 |
| abstract_inverted_index.by | 51 |
| abstract_inverted_index.in | 19, 59, 136 |
| abstract_inverted_index.is | 41, 97, 108, 124 |
| abstract_inverted_index.of | 24, 87, 119 |
| abstract_inverted_index.or | 6, 31 |
| abstract_inverted_index.to | 11, 27, 64, 83, 128 |
| abstract_inverted_index.The | 39, 56, 113, 132 |
| abstract_inverted_index.and | 44 |
| abstract_inverted_index.are | 49, 134 |
| abstract_inverted_index.for | 32, 110 |
| abstract_inverted_index.new | 67, 95, 104 |
| abstract_inverted_index.the | 46, 66, 70, 94, 101, 120, 129, 137 |
| abstract_inverted_index.Many | 0 |
| abstract_inverted_index.This | 16 |
| abstract_inverted_index.also | 116 |
| abstract_inverted_index.data | 1, 20, 40 |
| abstract_inverted_index.from | 100 |
| abstract_inverted_index.most | 98 |
| abstract_inverted_index.part | 23 |
| abstract_inverted_index.some | 4 |
| abstract_inverted_index.this | 60, 111 |
| abstract_inverted_index.tour | 82, 114 |
| abstract_inverted_index.upon | 9 |
| abstract_inverted_index.uses | 77 |
| abstract_inverted_index.with | 73 |
| abstract_inverted_index.work | 57 |
| abstract_inverted_index.data. | 15 |
| abstract_inverted_index.index | 107 |
| abstract_inverted_index.newly | 13 |
| abstract_inverted_index.often | 45 |
| abstract_inverted_index.paper | 61 |
| abstract_inverted_index.those | 92 |
| abstract_inverted_index.where | 93 |
| abstract_inverted_index.which | 10, 123 |
| abstract_inverted_index.change | 35 |
| abstract_inverted_index.detect | 28 |
| abstract_inverted_index.guided | 81 |
| abstract_inverted_index.normal | 7, 47, 54 |
| abstract_inverted_index.norms. | 38 |
| abstract_inverted_index.occurs | 18 |
| abstract_inverted_index.ranges | 48 |
| abstract_inverted_index.sample | 68, 96 |
| abstract_inverted_index.tourr. | 140 |
| abstract_inverted_index.trials | 26 |
| abstract_inverted_index.adverse | 29 |
| abstract_inverted_index.against | 36, 69 |
| abstract_inverted_index.climate | 34 |
| abstract_inverted_index.compare | 12, 65 |
| abstract_inverted_index.contain | 3 |
| abstract_inverted_index.defined | 109 |
| abstract_inverted_index.drawing | 118 |
| abstract_inverted_index.events, | 30 |
| abstract_inverted_index.methods | 63, 133 |
| abstract_inverted_index.produce | 84 |
| abstract_inverted_index.pursuit | 80, 106 |
| abstract_inverted_index.steered | 90 |
| abstract_inverted_index.towards | 91 |
| abstract_inverted_index.clinical | 25 |
| abstract_inverted_index.computed | 125 |
| abstract_inverted_index.develops | 62 |
| abstract_inverted_index.ellipse, | 122 |
| abstract_inverted_index.includes | 117 |
| abstract_inverted_index.package, | 139 |
| abstract_inverted_index.problems | 2 |
| abstract_inverted_index.purpose. | 112 |
| abstract_inverted_index.scenario | 17 |
| abstract_inverted_index.sequence | 86 |
| abstract_inverted_index.collected | 14, 21 |
| abstract_inverted_index.different | 99 |
| abstract_inverted_index.measuring | 33 |
| abstract_inverted_index.presented | 58 |
| abstract_inverted_index.projected | 121 |
| abstract_inverted_index.reference | 5, 71, 130 |
| abstract_inverted_index.specified | 50 |
| abstract_inverted_index.typically | 42 |
| abstract_inverted_index.historical | 37 |
| abstract_inverted_index.projection | 79, 105 |
| abstract_inverted_index.reference. | 102 |
| abstract_inverted_index.conditions, | 8 |
| abstract_inverted_index.implemented | 135 |
| abstract_inverted_index.projections | 89 |
| abstract_inverted_index.distribution | 72 |
| abstract_inverted_index.multivariate | 53 |
| abstract_inverted_index.analytically, | 126 |
| abstract_inverted_index.corresponding | 127 |
| abstract_inverted_index.distribution. | 55, 131 |
| abstract_inverted_index.multivariate, | 43 |
| abstract_inverted_index.visualisation | 115 |
| abstract_inverted_index.visualisation. | 75 |
| abstract_inverted_index.low-dimensional | 88 |
| abstract_inverted_index.high-dimensional | 74 |
| cited_by_percentile_year | |
| countries_distinct_count | 0 |
| institutions_distinct_count | 3 |
| citation_normalized_percentile |