Shared-PIM: Enabling Concurrent Computation and Data Flow for Faster Processing-in-DRAM Article Swipe
YOU?
·
· 2024
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.2408.15489
Processing-in-Memory (PIM) enhances memory with computational capabilities, potentially solving energy and latency issues associated with data transfer between memory and processors. However, managing concurrent computation and data flow within the PIM architecture incurs significant latency and energy penalty for applications. This paper introduces Shared-PIM, an architecture for in-DRAM PIM that strategically allocates rows in memory banks, bolstered by memory peripherals, for concurrent processing and data movement. Shared-PIM enables simultaneous computation and data transfer within a memory bank. When compared to LISA, a state-of-the-art architecture that facilitates data transfers for in-DRAM PIM, Shared-PIM reduces data movement latency and energy by 5x and 1.2x respectively. Furthermore, when integrated to a state-of-the-art (SOTA) in-DRAM PIM architecture (pLUTo), Shared-PIM achieves 1.4x faster addition and multiplication, and thereby improves the performance of matrix multiplication (MM) tasks by 40%, polynomial multiplication (PMM) by 44%, and numeric number transfer (NTT) tasks by 31%. Moreover, for graph processing tasks like Breadth-First Search (BFS) and Depth-First Search (DFS), Shared-PIM achieves a 29% improvement in speed, all with an area overhead of just 7.16% compared to the baseline pLUTo.
Related Topics
- Type
- preprint
- Language
- en
- Landing Page
- http://arxiv.org/abs/2408.15489
- https://arxiv.org/pdf/2408.15489
- OA Status
- green
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4402705796
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4402705796Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.48550/arxiv.2408.15489Digital Object Identifier
- Title
-
Shared-PIM: Enabling Concurrent Computation and Data Flow for Faster Processing-in-DRAMWork title
- Type
-
preprintOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2024Year of publication
- Publication date
-
2024-08-28Full publication date if available
- Authors
-
Ahmed Mamdouh, Haoran Geng, Michael Niemier, Xiaobo Sharon Hu, Dayane ReisList of authors in order
- Landing page
-
https://arxiv.org/abs/2408.15489Publisher landing page
- PDF URL
-
https://arxiv.org/pdf/2408.15489Direct link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://arxiv.org/pdf/2408.15489Direct OA link when available
- Concepts
-
Dram, Computer science, Parallel computing, Computation, Distributed computing, Embedded system, Computer architecture, Data flow diagram, Computer hardware, Programming language, DatabaseTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
0Total citation count in OpenAlex
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W4402705796 |
|---|---|
| doi | https://doi.org/10.48550/arxiv.2408.15489 |
| ids.doi | https://doi.org/10.48550/arxiv.2408.15489 |
| ids.openalex | https://openalex.org/W4402705796 |
| fwci | 0.0 |
| type | preprint |
| title | Shared-PIM: Enabling Concurrent Computation and Data Flow for Faster Processing-in-DRAM |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T11181 |
| topics[0].field.id | https://openalex.org/fields/17 |
| topics[0].field.display_name | Computer Science |
| topics[0].score | 0.9933000206947327 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1705 |
| topics[0].subfield.display_name | Computer Networks and Communications |
| topics[0].display_name | Advanced Data Storage Technologies |
| topics[1].id | https://openalex.org/T10558 |
| topics[1].field.id | https://openalex.org/fields/22 |
| topics[1].field.display_name | Engineering |
| topics[1].score | 0.9793999791145325 |
| topics[1].domain.id | https://openalex.org/domains/3 |
| topics[1].domain.display_name | Physical Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/2208 |
| topics[1].subfield.display_name | Electrical and Electronic Engineering |
| topics[1].display_name | Advancements in Semiconductor Devices and Circuit Design |
| topics[2].id | https://openalex.org/T10472 |
| topics[2].field.id | https://openalex.org/fields/22 |
| topics[2].field.display_name | Engineering |
| topics[2].score | 0.9715999960899353 |
| topics[2].domain.id | https://openalex.org/domains/3 |
| topics[2].domain.display_name | Physical Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/2208 |
| topics[2].subfield.display_name | Electrical and Electronic Engineering |
| topics[2].display_name | Semiconductor materials and devices |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| concepts[0].id | https://openalex.org/C7366592 |
| concepts[0].level | 2 |
| concepts[0].score | 0.8469196557998657 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q1255620 |
| concepts[0].display_name | Dram |
| concepts[1].id | https://openalex.org/C41008148 |
| concepts[1].level | 0 |
| concepts[1].score | 0.7537375688552856 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[1].display_name | Computer science |
| concepts[2].id | https://openalex.org/C173608175 |
| concepts[2].level | 1 |
| concepts[2].score | 0.5976927280426025 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q232661 |
| concepts[2].display_name | Parallel computing |
| concepts[3].id | https://openalex.org/C45374587 |
| concepts[3].level | 2 |
| concepts[3].score | 0.5768355131149292 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q12525525 |
| concepts[3].display_name | Computation |
| concepts[4].id | https://openalex.org/C120314980 |
| concepts[4].level | 1 |
| concepts[4].score | 0.42876145243644714 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q180634 |
| concepts[4].display_name | Distributed computing |
| concepts[5].id | https://openalex.org/C149635348 |
| concepts[5].level | 1 |
| concepts[5].score | 0.42380592226982117 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q193040 |
| concepts[5].display_name | Embedded system |
| concepts[6].id | https://openalex.org/C118524514 |
| concepts[6].level | 1 |
| concepts[6].score | 0.4162021577358246 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q173212 |
| concepts[6].display_name | Computer architecture |
| concepts[7].id | https://openalex.org/C489000 |
| concepts[7].level | 2 |
| concepts[7].score | 0.41426488757133484 |
| concepts[7].wikidata | https://www.wikidata.org/wiki/Q747385 |
| concepts[7].display_name | Data flow diagram |
| concepts[8].id | https://openalex.org/C9390403 |
| concepts[8].level | 1 |
| concepts[8].score | 0.27552151679992676 |
| concepts[8].wikidata | https://www.wikidata.org/wiki/Q3966 |
| concepts[8].display_name | Computer hardware |
| concepts[9].id | https://openalex.org/C199360897 |
| concepts[9].level | 1 |
| concepts[9].score | 0.23916468024253845 |
| concepts[9].wikidata | https://www.wikidata.org/wiki/Q9143 |
| concepts[9].display_name | Programming language |
| concepts[10].id | https://openalex.org/C77088390 |
| concepts[10].level | 1 |
| concepts[10].score | 0.17503142356872559 |
| concepts[10].wikidata | https://www.wikidata.org/wiki/Q8513 |
| concepts[10].display_name | Database |
| keywords[0].id | https://openalex.org/keywords/dram |
| keywords[0].score | 0.8469196557998657 |
| keywords[0].display_name | Dram |
| keywords[1].id | https://openalex.org/keywords/computer-science |
| keywords[1].score | 0.7537375688552856 |
| keywords[1].display_name | Computer science |
| keywords[2].id | https://openalex.org/keywords/parallel-computing |
| keywords[2].score | 0.5976927280426025 |
| keywords[2].display_name | Parallel computing |
| keywords[3].id | https://openalex.org/keywords/computation |
| keywords[3].score | 0.5768355131149292 |
| keywords[3].display_name | Computation |
| keywords[4].id | https://openalex.org/keywords/distributed-computing |
| keywords[4].score | 0.42876145243644714 |
| keywords[4].display_name | Distributed computing |
| keywords[5].id | https://openalex.org/keywords/embedded-system |
| keywords[5].score | 0.42380592226982117 |
| keywords[5].display_name | Embedded system |
| keywords[6].id | https://openalex.org/keywords/computer-architecture |
| keywords[6].score | 0.4162021577358246 |
| keywords[6].display_name | Computer architecture |
| keywords[7].id | https://openalex.org/keywords/data-flow-diagram |
| keywords[7].score | 0.41426488757133484 |
| keywords[7].display_name | Data flow diagram |
| keywords[8].id | https://openalex.org/keywords/computer-hardware |
| keywords[8].score | 0.27552151679992676 |
| keywords[8].display_name | Computer hardware |
| keywords[9].id | https://openalex.org/keywords/programming-language |
| keywords[9].score | 0.23916468024253845 |
| keywords[9].display_name | Programming language |
| keywords[10].id | https://openalex.org/keywords/database |
| keywords[10].score | 0.17503142356872559 |
| keywords[10].display_name | Database |
| language | en |
| locations[0].id | pmh:oai:arXiv.org:2408.15489 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306400194 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | arXiv (Cornell University) |
| locations[0].source.host_organization | https://openalex.org/I205783295 |
| locations[0].source.host_organization_name | Cornell University |
| locations[0].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[0].license | |
| locations[0].pdf_url | https://arxiv.org/pdf/2408.15489 |
| locations[0].version | submittedVersion |
| locations[0].raw_type | text |
| locations[0].license_id | |
| locations[0].is_accepted | False |
| locations[0].is_published | False |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | http://arxiv.org/abs/2408.15489 |
| locations[1].id | doi:10.48550/arxiv.2408.15489 |
| locations[1].is_oa | True |
| locations[1].source.id | https://openalex.org/S4306400194 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | True |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | arXiv (Cornell University) |
| locations[1].source.host_organization | https://openalex.org/I205783295 |
| locations[1].source.host_organization_name | Cornell University |
| locations[1].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[1].license | public-domain |
| locations[1].pdf_url | |
| locations[1].version | |
| locations[1].raw_type | article-journal |
| locations[1].license_id | https://openalex.org/licenses/public-domain |
| locations[1].is_accepted | False |
| locations[1].is_published | |
| locations[1].raw_source_name | |
| locations[1].landing_page_url | https://doi.org/10.48550/arxiv.2408.15489 |
| indexed_in | arxiv, datacite |
| authorships[0].author.id | https://openalex.org/A5082215979 |
| authorships[0].author.orcid | |
| authorships[0].author.display_name | Ahmed Mamdouh |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Mamdouh, Ahmed |
| authorships[0].is_corresponding | False |
| authorships[1].author.id | https://openalex.org/A5108009646 |
| authorships[1].author.orcid | https://orcid.org/0000-0001-5115-4446 |
| authorships[1].author.display_name | Haoran Geng |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Geng, Haoran |
| authorships[1].is_corresponding | False |
| authorships[2].author.id | https://openalex.org/A5003032564 |
| authorships[2].author.orcid | https://orcid.org/0000-0001-7776-4306 |
| authorships[2].author.display_name | Michael Niemier |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Niemier, Michael |
| authorships[2].is_corresponding | False |
| authorships[3].author.id | https://openalex.org/A5100600905 |
| authorships[3].author.orcid | https://orcid.org/0000-0002-6636-9738 |
| authorships[3].author.display_name | Xiaobo Sharon Hu |
| authorships[3].author_position | middle |
| authorships[3].raw_author_name | Hu, Xiaobo Sharon |
| authorships[3].is_corresponding | False |
| authorships[4].author.id | https://openalex.org/A5064528017 |
| authorships[4].author.orcid | https://orcid.org/0000-0002-8571-1308 |
| authorships[4].author.display_name | Dayane Reis |
| authorships[4].author_position | last |
| authorships[4].raw_author_name | Reis, Dayane |
| authorships[4].is_corresponding | False |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://arxiv.org/pdf/2408.15489 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | Shared-PIM: Enabling Concurrent Computation and Data Flow for Faster Processing-in-DRAM |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T06:51:31.235846 |
| primary_topic.id | https://openalex.org/T11181 |
| primary_topic.field.id | https://openalex.org/fields/17 |
| primary_topic.field.display_name | Computer Science |
| primary_topic.score | 0.9933000206947327 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1705 |
| primary_topic.subfield.display_name | Computer Networks and Communications |
| primary_topic.display_name | Advanced Data Storage Technologies |
| related_works | https://openalex.org/W3120961607, https://openalex.org/W4401568740, https://openalex.org/W2098207691, https://openalex.org/W3148568549, https://openalex.org/W1648516568, https://openalex.org/W361036515, https://openalex.org/W2269474412, https://openalex.org/W4386903460, https://openalex.org/W4211178602, https://openalex.org/W2433923775 |
| cited_by_count | 0 |
| locations_count | 2 |
| best_oa_location.id | pmh:oai:arXiv.org:2408.15489 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306400194 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | arXiv (Cornell University) |
| best_oa_location.source.host_organization | https://openalex.org/I205783295 |
| best_oa_location.source.host_organization_name | Cornell University |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| best_oa_location.license | |
| best_oa_location.pdf_url | https://arxiv.org/pdf/2408.15489 |
| best_oa_location.version | submittedVersion |
| best_oa_location.raw_type | text |
| best_oa_location.license_id | |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | http://arxiv.org/abs/2408.15489 |
| primary_location.id | pmh:oai:arXiv.org:2408.15489 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306400194 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | arXiv (Cornell University) |
| primary_location.source.host_organization | https://openalex.org/I205783295 |
| primary_location.source.host_organization_name | Cornell University |
| primary_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| primary_location.license | |
| primary_location.pdf_url | https://arxiv.org/pdf/2408.15489 |
| primary_location.version | submittedVersion |
| primary_location.raw_type | text |
| primary_location.license_id | |
| primary_location.is_accepted | False |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | http://arxiv.org/abs/2408.15489 |
| publication_date | 2024-08-28 |
| publication_year | 2024 |
| referenced_works_count | 0 |
| abstract_inverted_index.a | 74, 81, 107, 161 |
| abstract_inverted_index.5x | 99 |
| abstract_inverted_index.an | 44, 168 |
| abstract_inverted_index.by | 57, 98, 131, 136, 144 |
| abstract_inverted_index.in | 53, 164 |
| abstract_inverted_index.of | 126, 171 |
| abstract_inverted_index.to | 79, 106, 175 |
| abstract_inverted_index.29% | 162 |
| abstract_inverted_index.PIM | 30, 48, 111 |
| abstract_inverted_index.all | 166 |
| abstract_inverted_index.and | 10, 19, 25, 35, 63, 70, 96, 100, 119, 121, 138, 155 |
| abstract_inverted_index.for | 38, 46, 60, 88, 147 |
| abstract_inverted_index.the | 29, 124, 176 |
| abstract_inverted_index.(MM) | 129 |
| abstract_inverted_index.1.2x | 101 |
| abstract_inverted_index.1.4x | 116 |
| abstract_inverted_index.31%. | 145 |
| abstract_inverted_index.40%, | 132 |
| abstract_inverted_index.44%, | 137 |
| abstract_inverted_index.PIM, | 90 |
| abstract_inverted_index.This | 40 |
| abstract_inverted_index.When | 77 |
| abstract_inverted_index.area | 169 |
| abstract_inverted_index.data | 15, 26, 64, 71, 86, 93 |
| abstract_inverted_index.flow | 27 |
| abstract_inverted_index.just | 172 |
| abstract_inverted_index.like | 151 |
| abstract_inverted_index.rows | 52 |
| abstract_inverted_index.that | 49, 84 |
| abstract_inverted_index.when | 104 |
| abstract_inverted_index.with | 4, 14, 167 |
| abstract_inverted_index.(BFS) | 154 |
| abstract_inverted_index.(NTT) | 142 |
| abstract_inverted_index.(PIM) | 1 |
| abstract_inverted_index.(PMM) | 135 |
| abstract_inverted_index.7.16% | 173 |
| abstract_inverted_index.LISA, | 80 |
| abstract_inverted_index.bank. | 76 |
| abstract_inverted_index.graph | 148 |
| abstract_inverted_index.paper | 41 |
| abstract_inverted_index.tasks | 130, 143, 150 |
| abstract_inverted_index.(DFS), | 158 |
| abstract_inverted_index.(SOTA) | 109 |
| abstract_inverted_index.Search | 153, 157 |
| abstract_inverted_index.banks, | 55 |
| abstract_inverted_index.energy | 9, 36, 97 |
| abstract_inverted_index.faster | 117 |
| abstract_inverted_index.incurs | 32 |
| abstract_inverted_index.issues | 12 |
| abstract_inverted_index.matrix | 127 |
| abstract_inverted_index.memory | 3, 18, 54, 58, 75 |
| abstract_inverted_index.number | 140 |
| abstract_inverted_index.pLUTo. | 178 |
| abstract_inverted_index.speed, | 165 |
| abstract_inverted_index.within | 28, 73 |
| abstract_inverted_index.between | 17 |
| abstract_inverted_index.enables | 67 |
| abstract_inverted_index.in-DRAM | 47, 89, 110 |
| abstract_inverted_index.latency | 11, 34, 95 |
| abstract_inverted_index.numeric | 139 |
| abstract_inverted_index.penalty | 37 |
| abstract_inverted_index.reduces | 92 |
| abstract_inverted_index.solving | 8 |
| abstract_inverted_index.thereby | 122 |
| abstract_inverted_index.(pLUTo), | 113 |
| abstract_inverted_index.However, | 21 |
| abstract_inverted_index.achieves | 115, 160 |
| abstract_inverted_index.addition | 118 |
| abstract_inverted_index.baseline | 177 |
| abstract_inverted_index.compared | 78, 174 |
| abstract_inverted_index.enhances | 2 |
| abstract_inverted_index.improves | 123 |
| abstract_inverted_index.managing | 22 |
| abstract_inverted_index.movement | 94 |
| abstract_inverted_index.overhead | 170 |
| abstract_inverted_index.transfer | 16, 72, 141 |
| abstract_inverted_index.Moreover, | 146 |
| abstract_inverted_index.allocates | 51 |
| abstract_inverted_index.bolstered | 56 |
| abstract_inverted_index.movement. | 65 |
| abstract_inverted_index.transfers | 87 |
| abstract_inverted_index.Shared-PIM | 66, 91, 114, 159 |
| abstract_inverted_index.associated | 13 |
| abstract_inverted_index.concurrent | 23, 61 |
| abstract_inverted_index.integrated | 105 |
| abstract_inverted_index.introduces | 42 |
| abstract_inverted_index.polynomial | 133 |
| abstract_inverted_index.processing | 62, 149 |
| abstract_inverted_index.Depth-First | 156 |
| abstract_inverted_index.Shared-PIM, | 43 |
| abstract_inverted_index.computation | 24, 69 |
| abstract_inverted_index.facilitates | 85 |
| abstract_inverted_index.improvement | 163 |
| abstract_inverted_index.performance | 125 |
| abstract_inverted_index.potentially | 7 |
| abstract_inverted_index.processors. | 20 |
| abstract_inverted_index.significant | 33 |
| abstract_inverted_index.Furthermore, | 103 |
| abstract_inverted_index.architecture | 31, 45, 83, 112 |
| abstract_inverted_index.peripherals, | 59 |
| abstract_inverted_index.simultaneous | 68 |
| abstract_inverted_index.Breadth-First | 152 |
| abstract_inverted_index.applications. | 39 |
| abstract_inverted_index.capabilities, | 6 |
| abstract_inverted_index.computational | 5 |
| abstract_inverted_index.respectively. | 102 |
| abstract_inverted_index.strategically | 50 |
| abstract_inverted_index.multiplication | 128, 134 |
| abstract_inverted_index.multiplication, | 120 |
| abstract_inverted_index.state-of-the-art | 82, 108 |
| abstract_inverted_index.Processing-in-Memory | 0 |
| cited_by_percentile_year | |
| countries_distinct_count | 0 |
| institutions_distinct_count | 5 |
| citation_normalized_percentile.value | 0.20465996 |
| citation_normalized_percentile.is_in_top_1_percent | False |
| citation_normalized_percentile.is_in_top_10_percent | False |