GSelf-MapReduce: A Method for Enhancing Mapreduce Performance in Distributed Heterogeneous Data Centers Article Swipe
YOU?
·
· 2024
· Open Access
·
· DOI: https://doi.org/10.1109/access.2024.3487936
Big data are often stored close to the locations where they are generated, owing to the cost of data transfer. These stored data are moved to a single location for processing or processed at that location. In the literature, it is possible to find different methods for processing data in distributed data centers. In this study, we present a new method for data processing called GSelf-MapReduce. In the proposed method, shuffling is performed among heterogeneous data center (DC) that complete the data-processing process. To calculate the data processing cost of the reduce function of the DCs, a polynomial regression model was created using the data obtained in the test environment, and the coefficients obtained from this model were used in the decision process. The key/value pairs to be shuffled are distributed according to the cost of the DCs, considering their location. In addition, not all DCs are waited to finish their job for shuffling. DCs that complete their job perform shuffling among themselves. Thus, the keys are deduplicated between these DCs. The shuffling volume in the last phase and the total job completion time are reduced. The performance of the proposed method was compared with that of four different distributed data processing methods in the literature. As a result, this work generates 15% less shuffled data than the closest work.
Related Topics
- Type
- article
- Language
- en
- Landing Page
- https://doi.org/10.1109/access.2024.3487936
- OA Status
- gold
- Cited By
- 1
- References
- 31
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4403863453
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4403863453Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.1109/access.2024.3487936Digital Object Identifier
- Title
-
GSelf-MapReduce: A Method for Enhancing Mapreduce Performance in Distributed Heterogeneous Data CentersWork title
- Type
-
articleOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2024Year of publication
- Publication date
-
2024-01-01Full publication date if available
- Authors
-
Emin Sesen, Serdar KIRIŞOĞLU, Resul KaraList of authors in order
- Landing page
-
https://doi.org/10.1109/access.2024.3487936Publisher landing page
- Open access
-
YesWhether a free full text is available
- OA status
-
goldOpen access status per OpenAlex
- OA URL
-
https://doi.org/10.1109/access.2024.3487936Direct OA link when available
- Concepts
-
Computer science, Distributed database, Big data, Distributed computing, Parallel computing, Operating systemTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
1Total citation count in OpenAlex
- Citations by year (recent)
-
2025: 1Per-year citation counts (last 5 years)
- References (count)
-
31Number of works referenced by this work
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W4403863453 |
|---|---|
| doi | https://doi.org/10.1109/access.2024.3487936 |
| ids.doi | https://doi.org/10.1109/access.2024.3487936 |
| ids.openalex | https://openalex.org/W4403863453 |
| fwci | 1.52765026 |
| type | article |
| title | GSelf-MapReduce: A Method for Enhancing Mapreduce Performance in Distributed Heterogeneous Data Centers |
| biblio.issue | |
| biblio.volume | 12 |
| biblio.last_page | 159518 |
| biblio.first_page | 159503 |
| topics[0].id | https://openalex.org/T10101 |
| topics[0].field.id | https://openalex.org/fields/17 |
| topics[0].field.display_name | Computer Science |
| topics[0].score | 0.9975000023841858 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1710 |
| topics[0].subfield.display_name | Information Systems |
| topics[0].display_name | Cloud Computing and Resource Management |
| topics[1].id | https://openalex.org/T11478 |
| topics[1].field.id | https://openalex.org/fields/17 |
| topics[1].field.display_name | Computer Science |
| topics[1].score | 0.9865999817848206 |
| topics[1].domain.id | https://openalex.org/domains/3 |
| topics[1].domain.display_name | Physical Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/1705 |
| topics[1].subfield.display_name | Computer Networks and Communications |
| topics[1].display_name | Caching and Content Delivery |
| topics[2].id | https://openalex.org/T11181 |
| topics[2].field.id | https://openalex.org/fields/17 |
| topics[2].field.display_name | Computer Science |
| topics[2].score | 0.9839000105857849 |
| topics[2].domain.id | https://openalex.org/domains/3 |
| topics[2].domain.display_name | Physical Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/1705 |
| topics[2].subfield.display_name | Computer Networks and Communications |
| topics[2].display_name | Advanced Data Storage Technologies |
| is_xpac | False |
| apc_list.value | 1850 |
| apc_list.currency | USD |
| apc_list.value_usd | 1850 |
| apc_paid.value | 1850 |
| apc_paid.currency | USD |
| apc_paid.value_usd | 1850 |
| concepts[0].id | https://openalex.org/C41008148 |
| concepts[0].level | 0 |
| concepts[0].score | 0.8551445007324219 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[0].display_name | Computer science |
| concepts[1].id | https://openalex.org/C70061542 |
| concepts[1].level | 2 |
| concepts[1].score | 0.5552809834480286 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q989016 |
| concepts[1].display_name | Distributed database |
| concepts[2].id | https://openalex.org/C75684735 |
| concepts[2].level | 2 |
| concepts[2].score | 0.5344260334968567 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q858810 |
| concepts[2].display_name | Big data |
| concepts[3].id | https://openalex.org/C120314980 |
| concepts[3].level | 1 |
| concepts[3].score | 0.4825710356235504 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q180634 |
| concepts[3].display_name | Distributed computing |
| concepts[4].id | https://openalex.org/C173608175 |
| concepts[4].level | 1 |
| concepts[4].score | 0.4760291874408722 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q232661 |
| concepts[4].display_name | Parallel computing |
| concepts[5].id | https://openalex.org/C111919701 |
| concepts[5].level | 1 |
| concepts[5].score | 0.24646461009979248 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q9135 |
| concepts[5].display_name | Operating system |
| keywords[0].id | https://openalex.org/keywords/computer-science |
| keywords[0].score | 0.8551445007324219 |
| keywords[0].display_name | Computer science |
| keywords[1].id | https://openalex.org/keywords/distributed-database |
| keywords[1].score | 0.5552809834480286 |
| keywords[1].display_name | Distributed database |
| keywords[2].id | https://openalex.org/keywords/big-data |
| keywords[2].score | 0.5344260334968567 |
| keywords[2].display_name | Big data |
| keywords[3].id | https://openalex.org/keywords/distributed-computing |
| keywords[3].score | 0.4825710356235504 |
| keywords[3].display_name | Distributed computing |
| keywords[4].id | https://openalex.org/keywords/parallel-computing |
| keywords[4].score | 0.4760291874408722 |
| keywords[4].display_name | Parallel computing |
| keywords[5].id | https://openalex.org/keywords/operating-system |
| keywords[5].score | 0.24646461009979248 |
| keywords[5].display_name | Operating system |
| language | en |
| locations[0].id | doi:10.1109/access.2024.3487936 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S2485537415 |
| locations[0].source.issn | 2169-3536 |
| locations[0].source.type | journal |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | 2169-3536 |
| locations[0].source.is_core | True |
| locations[0].source.is_in_doaj | True |
| locations[0].source.display_name | IEEE Access |
| locations[0].source.host_organization | https://openalex.org/P4310319808 |
| locations[0].source.host_organization_name | Institute of Electrical and Electronics Engineers |
| locations[0].source.host_organization_lineage | https://openalex.org/P4310319808 |
| locations[0].source.host_organization_lineage_names | Institute of Electrical and Electronics Engineers |
| locations[0].license | |
| locations[0].pdf_url | |
| locations[0].version | publishedVersion |
| locations[0].raw_type | journal-article |
| locations[0].license_id | |
| locations[0].is_accepted | True |
| locations[0].is_published | True |
| locations[0].raw_source_name | IEEE Access |
| locations[0].landing_page_url | https://doi.org/10.1109/access.2024.3487936 |
| locations[1].id | pmh:oai:doaj.org/article:145e39ce16b641459f9e62792605f0e2 |
| locations[1].is_oa | False |
| locations[1].source.id | https://openalex.org/S4306401280 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | False |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | DOAJ (DOAJ: Directory of Open Access Journals) |
| locations[1].source.host_organization | |
| locations[1].source.host_organization_name | |
| locations[1].license | |
| locations[1].pdf_url | |
| locations[1].version | submittedVersion |
| locations[1].raw_type | article |
| locations[1].license_id | |
| locations[1].is_accepted | False |
| locations[1].is_published | False |
| locations[1].raw_source_name | IEEE Access, Vol 12, Pp 159503-159518 (2024) |
| locations[1].landing_page_url | https://doaj.org/article/145e39ce16b641459f9e62792605f0e2 |
| indexed_in | crossref, doaj |
| authorships[0].author.id | https://openalex.org/A5114443841 |
| authorships[0].author.orcid | https://orcid.org/0000-0002-1284-3916 |
| authorships[0].author.display_name | Emin Sesen |
| authorships[0].affiliations[0].raw_affiliation_string | Department of Computer Engineering, Düzce University, Düzce, Türkiye |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Emin Sesen |
| authorships[0].is_corresponding | False |
| authorships[0].raw_affiliation_strings | Department of Computer Engineering, Düzce University, Düzce, Türkiye |
| authorships[1].author.id | https://openalex.org/A5059290371 |
| authorships[1].author.orcid | https://orcid.org/0000-0002-4416-6657 |
| authorships[1].author.display_name | Serdar KIRIŞOĞLU |
| authorships[1].affiliations[0].raw_affiliation_string | Department of Computer Engineering, Düzce University, Düzce, Türkiye |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Serdar Kirisoglu |
| authorships[1].is_corresponding | False |
| authorships[1].raw_affiliation_strings | Department of Computer Engineering, Düzce University, Düzce, Türkiye |
| authorships[2].author.id | https://openalex.org/A5006841801 |
| authorships[2].author.orcid | https://orcid.org/0000-0001-8902-6837 |
| authorships[2].author.display_name | Resul Kara |
| authorships[2].affiliations[0].raw_affiliation_string | Department of Computer Engineering, Düzce University, Düzce, Türkiye |
| authorships[2].author_position | last |
| authorships[2].raw_author_name | Resul Kara |
| authorships[2].is_corresponding | False |
| authorships[2].raw_affiliation_strings | Department of Computer Engineering, Düzce University, Düzce, Türkiye |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://doi.org/10.1109/access.2024.3487936 |
| open_access.oa_status | gold |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | GSelf-MapReduce: A Method for Enhancing Mapreduce Performance in Distributed Heterogeneous Data Centers |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T03:46:38.306776 |
| primary_topic.id | https://openalex.org/T10101 |
| primary_topic.field.id | https://openalex.org/fields/17 |
| primary_topic.field.display_name | Computer Science |
| primary_topic.score | 0.9975000023841858 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1710 |
| primary_topic.subfield.display_name | Information Systems |
| primary_topic.display_name | Cloud Computing and Resource Management |
| related_works | https://openalex.org/W4391375266, https://openalex.org/W2899084033, https://openalex.org/W2748952813, https://openalex.org/W4390608645, https://openalex.org/W4247566972, https://openalex.org/W4394895745, https://openalex.org/W2960264696, https://openalex.org/W3090563135, https://openalex.org/W2497432351, https://openalex.org/W4206777497 |
| cited_by_count | 1 |
| counts_by_year[0].year | 2025 |
| counts_by_year[0].cited_by_count | 1 |
| locations_count | 2 |
| best_oa_location.id | doi:10.1109/access.2024.3487936 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S2485537415 |
| best_oa_location.source.issn | 2169-3536 |
| best_oa_location.source.type | journal |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | 2169-3536 |
| best_oa_location.source.is_core | True |
| best_oa_location.source.is_in_doaj | True |
| best_oa_location.source.display_name | IEEE Access |
| best_oa_location.source.host_organization | https://openalex.org/P4310319808 |
| best_oa_location.source.host_organization_name | Institute of Electrical and Electronics Engineers |
| best_oa_location.source.host_organization_lineage | https://openalex.org/P4310319808 |
| best_oa_location.source.host_organization_lineage_names | Institute of Electrical and Electronics Engineers |
| best_oa_location.license | |
| best_oa_location.pdf_url | |
| best_oa_location.version | publishedVersion |
| best_oa_location.raw_type | journal-article |
| best_oa_location.license_id | |
| best_oa_location.is_accepted | True |
| best_oa_location.is_published | True |
| best_oa_location.raw_source_name | IEEE Access |
| best_oa_location.landing_page_url | https://doi.org/10.1109/access.2024.3487936 |
| primary_location.id | doi:10.1109/access.2024.3487936 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S2485537415 |
| primary_location.source.issn | 2169-3536 |
| primary_location.source.type | journal |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | 2169-3536 |
| primary_location.source.is_core | True |
| primary_location.source.is_in_doaj | True |
| primary_location.source.display_name | IEEE Access |
| primary_location.source.host_organization | https://openalex.org/P4310319808 |
| primary_location.source.host_organization_name | Institute of Electrical and Electronics Engineers |
| primary_location.source.host_organization_lineage | https://openalex.org/P4310319808 |
| primary_location.source.host_organization_lineage_names | Institute of Electrical and Electronics Engineers |
| primary_location.license | |
| primary_location.pdf_url | |
| primary_location.version | publishedVersion |
| primary_location.raw_type | journal-article |
| primary_location.license_id | |
| primary_location.is_accepted | True |
| primary_location.is_published | True |
| primary_location.raw_source_name | IEEE Access |
| primary_location.landing_page_url | https://doi.org/10.1109/access.2024.3487936 |
| publication_date | 2024-01-01 |
| publication_year | 2024 |
| referenced_works | https://openalex.org/W2522626811, https://openalex.org/W2095178814, https://openalex.org/W2131715540, https://openalex.org/W1976226642, https://openalex.org/W2004255221, https://openalex.org/W3000407226, https://openalex.org/W4313291253, https://openalex.org/W1968324103, https://openalex.org/W2955087022, https://openalex.org/W2173213060, https://openalex.org/W3085985816, https://openalex.org/W2519041509, https://openalex.org/W2804455053, https://openalex.org/W2013658930, https://openalex.org/W2999588841, https://openalex.org/W1951344599, https://openalex.org/W4312050775, https://openalex.org/W3173985525, https://openalex.org/W2618014212, https://openalex.org/W2806259969, https://openalex.org/W4285147188, https://openalex.org/W3024796234, https://openalex.org/W1985482557, https://openalex.org/W2086392024, https://openalex.org/W2329835557, https://openalex.org/W2909224120, https://openalex.org/W2064349969, https://openalex.org/W3142038251, https://openalex.org/W2072129576, https://openalex.org/W2995564009, https://openalex.org/W1505837402 |
| referenced_works_count | 31 |
| abstract_inverted_index.a | 26, 58, 96, 207 |
| abstract_inverted_index.As | 206 |
| abstract_inverted_index.In | 36, 53, 66, 141 |
| abstract_inverted_index.To | 83 |
| abstract_inverted_index.at | 33 |
| abstract_inverted_index.be | 127 |
| abstract_inverted_index.in | 49, 106, 119, 174, 203 |
| abstract_inverted_index.is | 40, 71 |
| abstract_inverted_index.it | 39 |
| abstract_inverted_index.of | 17, 89, 93, 135, 188, 196 |
| abstract_inverted_index.or | 31 |
| abstract_inverted_index.to | 6, 14, 25, 42, 126, 132, 148 |
| abstract_inverted_index.we | 56 |
| abstract_inverted_index.15% | 212 |
| abstract_inverted_index.Big | 0 |
| abstract_inverted_index.DCs | 145, 154 |
| abstract_inverted_index.The | 123, 171, 186 |
| abstract_inverted_index.all | 144 |
| abstract_inverted_index.and | 110, 178 |
| abstract_inverted_index.are | 2, 11, 23, 129, 146, 166, 184 |
| abstract_inverted_index.for | 29, 46, 61, 152 |
| abstract_inverted_index.job | 151, 158, 181 |
| abstract_inverted_index.new | 59 |
| abstract_inverted_index.not | 143 |
| abstract_inverted_index.the | 7, 15, 37, 67, 80, 85, 90, 94, 103, 107, 111, 120, 133, 136, 164, 175, 179, 189, 204, 217 |
| abstract_inverted_index.was | 100, 192 |
| abstract_inverted_index.(DC) | 77 |
| abstract_inverted_index.DCs, | 95, 137 |
| abstract_inverted_index.DCs. | 170 |
| abstract_inverted_index.cost | 16, 88, 134 |
| abstract_inverted_index.data | 1, 18, 22, 48, 51, 62, 75, 86, 104, 200, 215 |
| abstract_inverted_index.find | 43 |
| abstract_inverted_index.four | 197 |
| abstract_inverted_index.from | 114 |
| abstract_inverted_index.keys | 165 |
| abstract_inverted_index.last | 176 |
| abstract_inverted_index.less | 213 |
| abstract_inverted_index.test | 108 |
| abstract_inverted_index.than | 216 |
| abstract_inverted_index.that | 34, 78, 155, 195 |
| abstract_inverted_index.they | 10 |
| abstract_inverted_index.this | 54, 115, 209 |
| abstract_inverted_index.time | 183 |
| abstract_inverted_index.used | 118 |
| abstract_inverted_index.were | 117 |
| abstract_inverted_index.with | 194 |
| abstract_inverted_index.work | 210 |
| abstract_inverted_index.These | 20 |
| abstract_inverted_index.Thus, | 163 |
| abstract_inverted_index.among | 73, 161 |
| abstract_inverted_index.close | 5 |
| abstract_inverted_index.model | 99, 116 |
| abstract_inverted_index.moved | 24 |
| abstract_inverted_index.often | 3 |
| abstract_inverted_index.owing | 13 |
| abstract_inverted_index.pairs | 125 |
| abstract_inverted_index.phase | 177 |
| abstract_inverted_index.their | 139, 150, 157 |
| abstract_inverted_index.these | 169 |
| abstract_inverted_index.total | 180 |
| abstract_inverted_index.using | 102 |
| abstract_inverted_index.where | 9 |
| abstract_inverted_index.work. | 219 |
| abstract_inverted_index.called | 64 |
| abstract_inverted_index.center | 76 |
| abstract_inverted_index.finish | 149 |
| abstract_inverted_index.method | 60, 191 |
| abstract_inverted_index.reduce | 91 |
| abstract_inverted_index.single | 27 |
| abstract_inverted_index.stored | 4, 21 |
| abstract_inverted_index.study, | 55 |
| abstract_inverted_index.volume | 173 |
| abstract_inverted_index.waited | 147 |
| abstract_inverted_index.between | 168 |
| abstract_inverted_index.closest | 218 |
| abstract_inverted_index.created | 101 |
| abstract_inverted_index.method, | 69 |
| abstract_inverted_index.methods | 45, 202 |
| abstract_inverted_index.perform | 159 |
| abstract_inverted_index.present | 57 |
| abstract_inverted_index.result, | 208 |
| abstract_inverted_index.centers. | 52 |
| abstract_inverted_index.compared | 193 |
| abstract_inverted_index.complete | 79, 156 |
| abstract_inverted_index.decision | 121 |
| abstract_inverted_index.function | 92 |
| abstract_inverted_index.location | 28 |
| abstract_inverted_index.obtained | 105, 113 |
| abstract_inverted_index.possible | 41 |
| abstract_inverted_index.process. | 82, 122 |
| abstract_inverted_index.proposed | 68, 190 |
| abstract_inverted_index.reduced. | 185 |
| abstract_inverted_index.shuffled | 128, 214 |
| abstract_inverted_index.according | 131 |
| abstract_inverted_index.addition, | 142 |
| abstract_inverted_index.calculate | 84 |
| abstract_inverted_index.different | 44, 198 |
| abstract_inverted_index.generates | 211 |
| abstract_inverted_index.key/value | 124 |
| abstract_inverted_index.location. | 35, 140 |
| abstract_inverted_index.locations | 8 |
| abstract_inverted_index.performed | 72 |
| abstract_inverted_index.processed | 32 |
| abstract_inverted_index.shuffling | 70, 160, 172 |
| abstract_inverted_index.transfer. | 19 |
| abstract_inverted_index.completion | 182 |
| abstract_inverted_index.generated, | 12 |
| abstract_inverted_index.polynomial | 97 |
| abstract_inverted_index.processing | 30, 47, 63, 87, 201 |
| abstract_inverted_index.regression | 98 |
| abstract_inverted_index.shuffling. | 153 |
| abstract_inverted_index.considering | 138 |
| abstract_inverted_index.distributed | 50, 130, 199 |
| abstract_inverted_index.literature, | 38 |
| abstract_inverted_index.literature. | 205 |
| abstract_inverted_index.performance | 187 |
| abstract_inverted_index.themselves. | 162 |
| abstract_inverted_index.coefficients | 112 |
| abstract_inverted_index.deduplicated | 167 |
| abstract_inverted_index.environment, | 109 |
| abstract_inverted_index.heterogeneous | 74 |
| abstract_inverted_index.data-processing | 81 |
| abstract_inverted_index.GSelf-MapReduce. | 65 |
| cited_by_percentile_year.max | 95 |
| cited_by_percentile_year.min | 91 |
| countries_distinct_count | 0 |
| institutions_distinct_count | 3 |
| citation_normalized_percentile.value | 0.83078603 |
| citation_normalized_percentile.is_in_top_1_percent | False |
| citation_normalized_percentile.is_in_top_10_percent | True |