Model Based Co-Clustering:High Dimension and Estimation Challenges Article Swipe
Le coclustering par modélisation probabiliste, qui peut être vu comme une extension du clustering par modèle de mélange, permet une réduction à la fois du nombre de lignes (individus) et de colonnes (variables) d'un ensemble de données de façon très parcimonieuse tout en préservant l'interprétabilité des données réduites. De plus, il bénéficie de la riche théorie statistique des modèles probabilistes tant pour l’estimation que pour la sélection du modèle. C’est un domaine actif dans lequel de nombreux travaux récents ont produit de nouvelles avancées, d’un point vue théorique comme méthodologique et appliqué. Après une discussion sur ces avancées, deux messages principaux sont développés, étayés par du matériel de recherche spécifique : (1) le co-clustering nécessite des recherches plus approfondies pour résoudre certains problèmes d'estimation bien identifiés, et (2) le co-clustering est une approche très prometteuse pour le clustering dans le cadre de la (très) grande dimension, qui correspond à la tendance mondiale des données modernes. Travail en collaboration avec Christophe Biernacki et Julien Jacques (J Classif 40, 332–381 (2023))
Related Topics
- Type
- other
- Language
- fr
- Landing Page
- https://inria.hal.science/hal-04862826
- OA Status
- green
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4406053446
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4406053446Canonical identifier for this work in OpenAlex
- Title
-
Model Based Co-Clustering:High Dimension and Estimation ChallengesWork title
- Type
-
otherOpenAlex work type
- Language
-
frPrimary language
- Publication year
-
2024Year of publication
- Publication date
-
2024-03-11Full publication date if available
- Authors
-
Christine Keribin, Christophe Biernacki, Julien JacquesList of authors in order
- Landing page
-
https://inria.hal.science/hal-04862826Publisher landing page
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://inria.hal.science/hal-04862826Direct OA link when available
- Concepts
-
Cluster analysis, Dimension (graph theory), Estimation, Computer science, Data mining, Mathematics, Artificial intelligence, Engineering, Combinatorics, Systems engineeringTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
0Total citation count in OpenAlex
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W4406053446 |
|---|---|
| doi | |
| ids.openalex | https://openalex.org/W4406053446 |
| fwci | 0.0 |
| type | other |
| title | Model Based Co-Clustering:High Dimension and Estimation Challenges |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T11901 |
| topics[0].field.id | https://openalex.org/fields/17 |
| topics[0].field.display_name | Computer Science |
| topics[0].score | 0.6825000047683716 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1702 |
| topics[0].subfield.display_name | Artificial Intelligence |
| topics[0].display_name | Bayesian Methods and Mixture Models |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| concepts[0].id | https://openalex.org/C73555534 |
| concepts[0].level | 2 |
| concepts[0].score | 0.647038459777832 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q622825 |
| concepts[0].display_name | Cluster analysis |
| concepts[1].id | https://openalex.org/C33676613 |
| concepts[1].level | 2 |
| concepts[1].score | 0.6424554586410522 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q13415176 |
| concepts[1].display_name | Dimension (graph theory) |
| concepts[2].id | https://openalex.org/C96250715 |
| concepts[2].level | 2 |
| concepts[2].score | 0.5986722111701965 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q965330 |
| concepts[2].display_name | Estimation |
| concepts[3].id | https://openalex.org/C41008148 |
| concepts[3].level | 0 |
| concepts[3].score | 0.4757261574268341 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[3].display_name | Computer science |
| concepts[4].id | https://openalex.org/C124101348 |
| concepts[4].level | 1 |
| concepts[4].score | 0.35630449652671814 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q172491 |
| concepts[4].display_name | Data mining |
| concepts[5].id | https://openalex.org/C33923547 |
| concepts[5].level | 0 |
| concepts[5].score | 0.30044782161712646 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q395 |
| concepts[5].display_name | Mathematics |
| concepts[6].id | https://openalex.org/C154945302 |
| concepts[6].level | 1 |
| concepts[6].score | 0.2679719924926758 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q11660 |
| concepts[6].display_name | Artificial intelligence |
| concepts[7].id | https://openalex.org/C127413603 |
| concepts[7].level | 0 |
| concepts[7].score | 0.1462286114692688 |
| concepts[7].wikidata | https://www.wikidata.org/wiki/Q11023 |
| concepts[7].display_name | Engineering |
| concepts[8].id | https://openalex.org/C114614502 |
| concepts[8].level | 1 |
| concepts[8].score | 0.08487498760223389 |
| concepts[8].wikidata | https://www.wikidata.org/wiki/Q76592 |
| concepts[8].display_name | Combinatorics |
| concepts[9].id | https://openalex.org/C201995342 |
| concepts[9].level | 1 |
| concepts[9].score | 0.0 |
| concepts[9].wikidata | https://www.wikidata.org/wiki/Q682496 |
| concepts[9].display_name | Systems engineering |
| keywords[0].id | https://openalex.org/keywords/cluster-analysis |
| keywords[0].score | 0.647038459777832 |
| keywords[0].display_name | Cluster analysis |
| keywords[1].id | https://openalex.org/keywords/dimension |
| keywords[1].score | 0.6424554586410522 |
| keywords[1].display_name | Dimension (graph theory) |
| keywords[2].id | https://openalex.org/keywords/estimation |
| keywords[2].score | 0.5986722111701965 |
| keywords[2].display_name | Estimation |
| keywords[3].id | https://openalex.org/keywords/computer-science |
| keywords[3].score | 0.4757261574268341 |
| keywords[3].display_name | Computer science |
| keywords[4].id | https://openalex.org/keywords/data-mining |
| keywords[4].score | 0.35630449652671814 |
| keywords[4].display_name | Data mining |
| keywords[5].id | https://openalex.org/keywords/mathematics |
| keywords[5].score | 0.30044782161712646 |
| keywords[5].display_name | Mathematics |
| keywords[6].id | https://openalex.org/keywords/artificial-intelligence |
| keywords[6].score | 0.2679719924926758 |
| keywords[6].display_name | Artificial intelligence |
| keywords[7].id | https://openalex.org/keywords/engineering |
| keywords[7].score | 0.1462286114692688 |
| keywords[7].display_name | Engineering |
| keywords[8].id | https://openalex.org/keywords/combinatorics |
| keywords[8].score | 0.08487498760223389 |
| keywords[8].display_name | Combinatorics |
| language | fr |
| locations[0].id | pmh:oai:HAL:hal-04862826v1 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306402512 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | False |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | HAL (Le Centre pour la Communication Scientifique Directe) |
| locations[0].source.host_organization | https://openalex.org/I1294671590 |
| locations[0].source.host_organization_name | Centre National de la Recherche Scientifique |
| locations[0].source.host_organization_lineage | https://openalex.org/I1294671590 |
| locations[0].license | cc-by |
| locations[0].pdf_url | |
| locations[0].version | submittedVersion |
| locations[0].raw_type | Other publications |
| locations[0].license_id | https://openalex.org/licenses/cc-by |
| locations[0].is_accepted | False |
| locations[0].is_published | False |
| locations[0].raw_source_name | 2024 |
| locations[0].landing_page_url | https://inria.hal.science/hal-04862826 |
| locations[1].id | pmh:oai:lilloa.univ-lille.fr:20.500.12210/121116 |
| locations[1].is_oa | True |
| locations[1].source.id | https://openalex.org/S4306402203 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | False |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | LillOA (Université de Lille (University Of Lille)) |
| locations[1].source.host_organization | https://openalex.org/I4210123514 |
| locations[1].source.host_organization_name | Centre d'Etudes en Civilisations, Langues et Littératures Etrangères |
| locations[1].source.host_organization_lineage | https://openalex.org/I4210123514 |
| locations[1].license | other-oa |
| locations[1].pdf_url | |
| locations[1].version | submittedVersion |
| locations[1].raw_type | info:eu-repo/semantics/conferenceObject |
| locations[1].license_id | https://openalex.org/licenses/other-oa |
| locations[1].is_accepted | False |
| locations[1].is_published | False |
| locations[1].raw_source_name | |
| locations[1].landing_page_url | |
| authorships[0].author.id | https://openalex.org/A5067014139 |
| authorships[0].author.orcid | |
| authorships[0].author.display_name | Christine Keribin |
| authorships[0].countries | FR |
| authorships[0].affiliations[0].institution_ids | https://openalex.org/I54006703 |
| authorships[0].affiliations[0].raw_affiliation_string | CELESTE - Statistique mathématique et apprentissage (Bâtiment 307, 91405, Orsay cedex - France) |
| authorships[0].institutions[0].id | https://openalex.org/I54006703 |
| authorships[0].institutions[0].ror | https://ror.org/002zc3t08 |
| authorships[0].institutions[0].type | facility |
| authorships[0].institutions[0].lineage | https://openalex.org/I1294671590, https://openalex.org/I1294671590, https://openalex.org/I2279609970, https://openalex.org/I2746051580, https://openalex.org/I2800004676, https://openalex.org/I39804081, https://openalex.org/I4210148025, https://openalex.org/I54006703 |
| authorships[0].institutions[0].country_code | FR |
| authorships[0].institutions[0].display_name | Institut de Mécanique Céleste et de Calcul des Éphémérides |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Christine Keribin |
| authorships[0].is_corresponding | False |
| authorships[0].raw_affiliation_strings | CELESTE - Statistique mathématique et apprentissage (Bâtiment 307, 91405, Orsay cedex - France) |
| authorships[1].author.id | https://openalex.org/A5025879237 |
| authorships[1].author.orcid | https://orcid.org/0000-0002-8995-0568 |
| authorships[1].author.display_name | Christophe Biernacki |
| authorships[1].affiliations[0].raw_affiliation_string | MODAL - MOdel for Data Analysis and Learning (France) |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Christophe Biernacki |
| authorships[1].is_corresponding | False |
| authorships[1].raw_affiliation_strings | MODAL - MOdel for Data Analysis and Learning (France) |
| authorships[2].author.id | https://openalex.org/A5026999947 |
| authorships[2].author.orcid | https://orcid.org/0000-0003-4808-2781 |
| authorships[2].author.display_name | Julien Jacques |
| authorships[2].author_position | last |
| authorships[2].raw_author_name | Julien Jacques |
| authorships[2].is_corresponding | False |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://inria.hal.science/hal-04862826 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | Model Based Co-Clustering:High Dimension and Estimation Challenges |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T04:12:42.849631 |
| primary_topic.id | https://openalex.org/T11901 |
| primary_topic.field.id | https://openalex.org/fields/17 |
| primary_topic.field.display_name | Computer Science |
| primary_topic.score | 0.6825000047683716 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1702 |
| primary_topic.subfield.display_name | Artificial Intelligence |
| primary_topic.display_name | Bayesian Methods and Mixture Models |
| related_works | https://openalex.org/W4391375266, https://openalex.org/W1979597421, https://openalex.org/W2007980826, https://openalex.org/W2061531152, https://openalex.org/W3002753104, https://openalex.org/W2077600819, https://openalex.org/W2142036596, https://openalex.org/W2072657027, https://openalex.org/W2962838298, https://openalex.org/W2600246793 |
| cited_by_count | 0 |
| locations_count | 2 |
| best_oa_location.id | pmh:oai:HAL:hal-04862826v1 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306402512 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | False |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | HAL (Le Centre pour la Communication Scientifique Directe) |
| best_oa_location.source.host_organization | https://openalex.org/I1294671590 |
| best_oa_location.source.host_organization_name | Centre National de la Recherche Scientifique |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I1294671590 |
| best_oa_location.license | cc-by |
| best_oa_location.pdf_url | |
| best_oa_location.version | submittedVersion |
| best_oa_location.raw_type | Other publications |
| best_oa_location.license_id | https://openalex.org/licenses/cc-by |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | 2024 |
| best_oa_location.landing_page_url | https://inria.hal.science/hal-04862826 |
| primary_location.id | pmh:oai:HAL:hal-04862826v1 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306402512 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | False |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | HAL (Le Centre pour la Communication Scientifique Directe) |
| primary_location.source.host_organization | https://openalex.org/I1294671590 |
| primary_location.source.host_organization_name | Centre National de la Recherche Scientifique |
| primary_location.source.host_organization_lineage | https://openalex.org/I1294671590 |
| primary_location.license | cc-by |
| primary_location.pdf_url | |
| primary_location.version | submittedVersion |
| primary_location.raw_type | Other publications |
| primary_location.license_id | https://openalex.org/licenses/cc-by |
| primary_location.is_accepted | False |
| primary_location.is_published | False |
| primary_location.raw_source_name | 2024 |
| primary_location.landing_page_url | https://inria.hal.science/hal-04862826 |
| publication_date | 2024-03-11 |
| publication_year | 2024 |
| referenced_works_count | 0 |
| abstract_inverted_index.: | 110 |
| abstract_inverted_index.(J | 164 |
| abstract_inverted_index.De | 48 |
| abstract_inverted_index.Le | 0 |
| abstract_inverted_index.de | 16, 26, 30, 35, 37, 52, 75, 81, 107, 141 |
| abstract_inverted_index.du | 12, 24, 67, 105 |
| abstract_inverted_index.en | 42, 156 |
| abstract_inverted_index.et | 29, 90, 126, 161 |
| abstract_inverted_index.il | 50 |
| abstract_inverted_index.la | 22, 53, 65, 142, 149 |
| abstract_inverted_index.le | 112, 128, 136, 139 |
| abstract_inverted_index.un | 70 |
| abstract_inverted_index.vu | 8 |
| abstract_inverted_index.à | 21, 148 |
| abstract_inverted_index.(1) | 111 |
| abstract_inverted_index.(2) | 127 |
| abstract_inverted_index.40, | 166 |
| abstract_inverted_index.ces | 96 |
| abstract_inverted_index.des | 45, 57, 115, 152 |
| abstract_inverted_index.est | 130 |
| abstract_inverted_index.ont | 79 |
| abstract_inverted_index.par | 2, 14, 104 |
| abstract_inverted_index.que | 63 |
| abstract_inverted_index.qui | 5, 146 |
| abstract_inverted_index.sur | 95 |
| abstract_inverted_index.une | 10, 19, 93, 131 |
| abstract_inverted_index.vue | 86 |
| abstract_inverted_index.avec | 158 |
| abstract_inverted_index.bien | 124 |
| abstract_inverted_index.d'un | 33 |
| abstract_inverted_index.dans | 73, 138 |
| abstract_inverted_index.deux | 98 |
| abstract_inverted_index.fois | 23 |
| abstract_inverted_index.peut | 6 |
| abstract_inverted_index.plus | 117 |
| abstract_inverted_index.pour | 61, 64, 119, 135 |
| abstract_inverted_index.sont | 101 |
| abstract_inverted_index.tant | 60 |
| abstract_inverted_index.tout | 41 |
| abstract_inverted_index.actif | 72 |
| abstract_inverted_index.cadre | 140 |
| abstract_inverted_index.comme | 9, 88 |
| abstract_inverted_index.plus, | 49 |
| abstract_inverted_index.point | 85 |
| abstract_inverted_index.riche | 54 |
| abstract_inverted_index.très | 39, 133 |
| abstract_inverted_index.être | 7 |
| abstract_inverted_index.Après | 92 |
| abstract_inverted_index.Julien | 162 |
| abstract_inverted_index.d’un | 84 |
| abstract_inverted_index.façon | 38 |
| abstract_inverted_index.grande | 144 |
| abstract_inverted_index.lequel | 74 |
| abstract_inverted_index.lignes | 27 |
| abstract_inverted_index.nombre | 25 |
| abstract_inverted_index.permet | 18 |
| abstract_inverted_index.(2023)) | 168 |
| abstract_inverted_index.(très) | 143 |
| abstract_inverted_index.Classif | 165 |
| abstract_inverted_index.C’est | 69 |
| abstract_inverted_index.Jacques | 163 |
| abstract_inverted_index.Travail | 155 |
| abstract_inverted_index.domaine | 71 |
| abstract_inverted_index.modèle | 15 |
| abstract_inverted_index.produit | 80 |
| abstract_inverted_index.travaux | 77 |
| abstract_inverted_index.approche | 132 |
| abstract_inverted_index.certains | 121 |
| abstract_inverted_index.colonnes | 31 |
| abstract_inverted_index.données | 36, 46, 153 |
| abstract_inverted_index.ensemble | 34 |
| abstract_inverted_index.messages | 99 |
| abstract_inverted_index.modèle. | 68 |
| abstract_inverted_index.modèles | 58 |
| abstract_inverted_index.mondiale | 151 |
| abstract_inverted_index.nombreux | 76 |
| abstract_inverted_index.récents | 78 |
| abstract_inverted_index.tendance | 150 |
| abstract_inverted_index.théorie | 55 |
| abstract_inverted_index.étayés | 103 |
| abstract_inverted_index.332–381 | 167 |
| abstract_inverted_index.Biernacki | 160 |
| abstract_inverted_index.extension | 11 |
| abstract_inverted_index.matériel | 106 |
| abstract_inverted_index.modernes. | 154 |
| abstract_inverted_index.mélange, | 17 |
| abstract_inverted_index.nouvelles | 82 |
| abstract_inverted_index.recherche | 108 |
| abstract_inverted_index.résoudre | 120 |
| abstract_inverted_index.Christophe | 159 |
| abstract_inverted_index.appliqué. | 91 |
| abstract_inverted_index.avancées, | 83, 97 |
| abstract_inverted_index.clustering | 13, 137 |
| abstract_inverted_index.correspond | 147 |
| abstract_inverted_index.dimension, | 145 |
| abstract_inverted_index.discussion | 94 |
| abstract_inverted_index.nécessite | 114 |
| abstract_inverted_index.principaux | 100 |
| abstract_inverted_index.problèmes | 122 |
| abstract_inverted_index.recherches | 116 |
| abstract_inverted_index.réduction | 20 |
| abstract_inverted_index.réduites. | 47 |
| abstract_inverted_index.sélection | 66 |
| abstract_inverted_index.théorique | 87 |
| abstract_inverted_index.(individus) | 28 |
| abstract_inverted_index.(variables) | 32 |
| abstract_inverted_index.bénéficie | 51 |
| abstract_inverted_index.prometteuse | 134 |
| abstract_inverted_index.préservant | 43 |
| abstract_inverted_index.spécifique | 109 |
| abstract_inverted_index.statistique | 56 |
| abstract_inverted_index.approfondies | 118 |
| abstract_inverted_index.coclustering | 1 |
| abstract_inverted_index.d'estimation | 123 |
| abstract_inverted_index.identifiés, | 125 |
| abstract_inverted_index.co-clustering | 113, 129 |
| abstract_inverted_index.collaboration | 157 |
| abstract_inverted_index.développés, | 102 |
| abstract_inverted_index.modélisation | 3 |
| abstract_inverted_index.parcimonieuse | 40 |
| abstract_inverted_index.probabiliste, | 4 |
| abstract_inverted_index.probabilistes | 59 |
| abstract_inverted_index.l’estimation | 62 |
| abstract_inverted_index.méthodologique | 89 |
| abstract_inverted_index.l'interprétabilité | 44 |
| cited_by_percentile_year | |
| countries_distinct_count | 1 |
| institutions_distinct_count | 3 |
| citation_normalized_percentile.value | 0.27967328 |
| citation_normalized_percentile.is_in_top_1_percent | False |
| citation_normalized_percentile.is_in_top_10_percent | False |