Improving Predication Efficiency through Compaction/Restoration of SIMD Instructions Article Swipe
YOU?
·
· 2020
· Open Access
·
· DOI: https://doi.org/10.1109/hpca47549.2020.00064
Vector processors offer a wide range of unexplored opportunities to improve performance and energy efficiency. However, despite its potential, vector code generation and execution have significant challenges, the most relevant ones being control flow divergence. Most modern processors including SIMD extensions (such as AVX) rely on predication to support divergence control. In predicated codes, performance and energy consumption are usually insensitive to the number of true values in a predicated mask. This implies that the system efficiency becomes sub-optimal as vector length increases. In this paper we focus on SIMD extensions and propose a novel approach to improve execution efficiency in predicated SIMD instructions, the Compaction/Restoration (CR) technique. CR delays predicated SIMD instructions with inactive elements and compacts them with instances of the same instruction from different loop iterations to form an equivalent dense vector instruction, where, in the best case, all the elements are active. After executing such dense instructions, their results are restored to the original instructions. Our evaluation shows that CR improves performance by up to 25% and reduces dynamic energy consumption by up to 43% on real unmodified applications with predicated execution. Moreover, CR allows executing unmodified legacy code with short vector instructions (AVX-2) on newer architectures with wider vectors (AVX-512), achieving up to 56% performance benefits.
Related Topics
- Type
- article
- Language
- en
- Landing Page
- https://doi.org/10.1109/hpca47549.2020.00064
- OA Status
- green
- Cited By
- 7
- References
- 36
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W3016723581
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W3016723581Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.1109/hpca47549.2020.00064Digital Object Identifier
- Title
-
Improving Predication Efficiency through Compaction/Restoration of SIMD InstructionsWork title
- Type
-
articleOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2020Year of publication
- Publication date
-
2020-02-01Full publication date if available
- Authors
-
Adrián Barredo, Juan M. Cebrián, Miquel Moretó, Marc Casas, Mateo ValeroList of authors in order
- Landing page
-
https://doi.org/10.1109/hpca47549.2020.00064Publisher landing page
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://hdl.handle.net/2117/341626Direct OA link when available
- Concepts
-
SIMD, Computer science, Parallel computing, Control flow, Code (set theory), Energy consumption, Instruction set, Code generation, Performance improvement, Programming language, Operating system, Electrical engineering, Key (lock), Economics, Operations management, Set (abstract data type), EngineeringTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
7Total citation count in OpenAlex
- Citations by year (recent)
-
2023: 1, 2022: 3, 2021: 3Per-year citation counts (last 5 years)
- References (count)
-
36Number of works referenced by this work
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W3016723581 |
|---|---|
| doi | https://doi.org/10.1109/hpca47549.2020.00064 |
| ids.doi | https://doi.org/10.1109/hpca47549.2020.00064 |
| ids.mag | 3016723581 |
| ids.openalex | https://openalex.org/W3016723581 |
| fwci | 1.75744791 |
| type | article |
| title | Improving Predication Efficiency through Compaction/Restoration of SIMD Instructions |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | 728 |
| biblio.first_page | 717 |
| topics[0].id | https://openalex.org/T10054 |
| topics[0].field.id | https://openalex.org/fields/17 |
| topics[0].field.display_name | Computer Science |
| topics[0].score | 1.0 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1708 |
| topics[0].subfield.display_name | Hardware and Architecture |
| topics[0].display_name | Parallel Computing and Optimization Techniques |
| topics[1].id | https://openalex.org/T10904 |
| topics[1].field.id | https://openalex.org/fields/17 |
| topics[1].field.display_name | Computer Science |
| topics[1].score | 0.9976999759674072 |
| topics[1].domain.id | https://openalex.org/domains/3 |
| topics[1].domain.display_name | Physical Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/1708 |
| topics[1].subfield.display_name | Hardware and Architecture |
| topics[1].display_name | Embedded Systems Design Techniques |
| topics[2].id | https://openalex.org/T10363 |
| topics[2].field.id | https://openalex.org/fields/22 |
| topics[2].field.display_name | Engineering |
| topics[2].score | 0.9970999956130981 |
| topics[2].domain.id | https://openalex.org/domains/3 |
| topics[2].domain.display_name | Physical Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/2208 |
| topics[2].subfield.display_name | Electrical and Electronic Engineering |
| topics[2].display_name | Low-power high-performance VLSI design |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| concepts[0].id | https://openalex.org/C150552126 |
| concepts[0].level | 2 |
| concepts[0].score | 0.9388108253479004 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q339387 |
| concepts[0].display_name | SIMD |
| concepts[1].id | https://openalex.org/C41008148 |
| concepts[1].level | 0 |
| concepts[1].score | 0.8075833916664124 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[1].display_name | Computer science |
| concepts[2].id | https://openalex.org/C173608175 |
| concepts[2].level | 1 |
| concepts[2].score | 0.7294371724128723 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q232661 |
| concepts[2].display_name | Parallel computing |
| concepts[3].id | https://openalex.org/C160191386 |
| concepts[3].level | 2 |
| concepts[3].score | 0.5677383542060852 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q868299 |
| concepts[3].display_name | Control flow |
| concepts[4].id | https://openalex.org/C2776760102 |
| concepts[4].level | 3 |
| concepts[4].score | 0.5474177598953247 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q5139990 |
| concepts[4].display_name | Code (set theory) |
| concepts[5].id | https://openalex.org/C2780165032 |
| concepts[5].level | 2 |
| concepts[5].score | 0.48767197132110596 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q16869822 |
| concepts[5].display_name | Energy consumption |
| concepts[6].id | https://openalex.org/C202491316 |
| concepts[6].level | 2 |
| concepts[6].score | 0.4443853497505188 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q272683 |
| concepts[6].display_name | Instruction set |
| concepts[7].id | https://openalex.org/C133162039 |
| concepts[7].level | 3 |
| concepts[7].score | 0.4377528429031372 |
| concepts[7].wikidata | https://www.wikidata.org/wiki/Q1061077 |
| concepts[7].display_name | Code generation |
| concepts[8].id | https://openalex.org/C2778915421 |
| concepts[8].level | 2 |
| concepts[8].score | 0.43607646226882935 |
| concepts[8].wikidata | https://www.wikidata.org/wiki/Q3643177 |
| concepts[8].display_name | Performance improvement |
| concepts[9].id | https://openalex.org/C199360897 |
| concepts[9].level | 1 |
| concepts[9].score | 0.1943715214729309 |
| concepts[9].wikidata | https://www.wikidata.org/wiki/Q9143 |
| concepts[9].display_name | Programming language |
| concepts[10].id | https://openalex.org/C111919701 |
| concepts[10].level | 1 |
| concepts[10].score | 0.1796482801437378 |
| concepts[10].wikidata | https://www.wikidata.org/wiki/Q9135 |
| concepts[10].display_name | Operating system |
| concepts[11].id | https://openalex.org/C119599485 |
| concepts[11].level | 1 |
| concepts[11].score | 0.07988634705543518 |
| concepts[11].wikidata | https://www.wikidata.org/wiki/Q43035 |
| concepts[11].display_name | Electrical engineering |
| concepts[12].id | https://openalex.org/C26517878 |
| concepts[12].level | 2 |
| concepts[12].score | 0.0 |
| concepts[12].wikidata | https://www.wikidata.org/wiki/Q228039 |
| concepts[12].display_name | Key (lock) |
| concepts[13].id | https://openalex.org/C162324750 |
| concepts[13].level | 0 |
| concepts[13].score | 0.0 |
| concepts[13].wikidata | https://www.wikidata.org/wiki/Q8134 |
| concepts[13].display_name | Economics |
| concepts[14].id | https://openalex.org/C21547014 |
| concepts[14].level | 1 |
| concepts[14].score | 0.0 |
| concepts[14].wikidata | https://www.wikidata.org/wiki/Q1423657 |
| concepts[14].display_name | Operations management |
| concepts[15].id | https://openalex.org/C177264268 |
| concepts[15].level | 2 |
| concepts[15].score | 0.0 |
| concepts[15].wikidata | https://www.wikidata.org/wiki/Q1514741 |
| concepts[15].display_name | Set (abstract data type) |
| concepts[16].id | https://openalex.org/C127413603 |
| concepts[16].level | 0 |
| concepts[16].score | 0.0 |
| concepts[16].wikidata | https://www.wikidata.org/wiki/Q11023 |
| concepts[16].display_name | Engineering |
| keywords[0].id | https://openalex.org/keywords/simd |
| keywords[0].score | 0.9388108253479004 |
| keywords[0].display_name | SIMD |
| keywords[1].id | https://openalex.org/keywords/computer-science |
| keywords[1].score | 0.8075833916664124 |
| keywords[1].display_name | Computer science |
| keywords[2].id | https://openalex.org/keywords/parallel-computing |
| keywords[2].score | 0.7294371724128723 |
| keywords[2].display_name | Parallel computing |
| keywords[3].id | https://openalex.org/keywords/control-flow |
| keywords[3].score | 0.5677383542060852 |
| keywords[3].display_name | Control flow |
| keywords[4].id | https://openalex.org/keywords/code |
| keywords[4].score | 0.5474177598953247 |
| keywords[4].display_name | Code (set theory) |
| keywords[5].id | https://openalex.org/keywords/energy-consumption |
| keywords[5].score | 0.48767197132110596 |
| keywords[5].display_name | Energy consumption |
| keywords[6].id | https://openalex.org/keywords/instruction-set |
| keywords[6].score | 0.4443853497505188 |
| keywords[6].display_name | Instruction set |
| keywords[7].id | https://openalex.org/keywords/code-generation |
| keywords[7].score | 0.4377528429031372 |
| keywords[7].display_name | Code generation |
| keywords[8].id | https://openalex.org/keywords/performance-improvement |
| keywords[8].score | 0.43607646226882935 |
| keywords[8].display_name | Performance improvement |
| keywords[9].id | https://openalex.org/keywords/programming-language |
| keywords[9].score | 0.1943715214729309 |
| keywords[9].display_name | Programming language |
| keywords[10].id | https://openalex.org/keywords/operating-system |
| keywords[10].score | 0.1796482801437378 |
| keywords[10].display_name | Operating system |
| keywords[11].id | https://openalex.org/keywords/electrical-engineering |
| keywords[11].score | 0.07988634705543518 |
| keywords[11].display_name | Electrical engineering |
| language | en |
| locations[0].id | doi:10.1109/hpca47549.2020.00064 |
| locations[0].is_oa | False |
| locations[0].source | |
| locations[0].license | |
| locations[0].pdf_url | |
| locations[0].version | publishedVersion |
| locations[0].raw_type | proceedings-article |
| locations[0].license_id | |
| locations[0].is_accepted | True |
| locations[0].is_published | True |
| locations[0].raw_source_name | 2020 IEEE International Symposium on High Performance Computer Architecture (HPCA) |
| locations[0].landing_page_url | https://doi.org/10.1109/hpca47549.2020.00064 |
| locations[1].id | pmh:oai:upcommons.upc.edu:2117/341626 |
| locations[1].is_oa | True |
| locations[1].source.id | https://openalex.org/S4306400551 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | False |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | UPCommons (Polytechnic University of Catalonia) |
| locations[1].source.host_organization | https://openalex.org/I9617848 |
| locations[1].source.host_organization_name | Universitat Politècnica de Catalunya |
| locations[1].source.host_organization_lineage | https://openalex.org/I9617848 |
| locations[1].license | public-domain |
| locations[1].pdf_url | |
| locations[1].version | submittedVersion |
| locations[1].raw_type | info:eu-repo/semantics/submittedVersion |
| locations[1].license_id | https://openalex.org/licenses/public-domain |
| locations[1].is_accepted | False |
| locations[1].is_published | False |
| locations[1].raw_source_name | |
| locations[1].landing_page_url | http://hdl.handle.net/2117/341626 |
| indexed_in | crossref |
| authorships[0].author.id | https://openalex.org/A5019112047 |
| authorships[0].author.orcid | https://orcid.org/0000-0001-9435-3234 |
| authorships[0].author.display_name | Adrián Barredo |
| authorships[0].countries | ES |
| authorships[0].affiliations[0].institution_ids | https://openalex.org/I2799803557, https://openalex.org/I9617848 |
| authorships[0].affiliations[0].raw_affiliation_string | Barcelona Supercomputing Center, Barcelona, Spain |
| authorships[0].institutions[0].id | https://openalex.org/I2799803557 |
| authorships[0].institutions[0].ror | https://ror.org/05sd8tv96 |
| authorships[0].institutions[0].type | facility |
| authorships[0].institutions[0].lineage | https://openalex.org/I2799803557, https://openalex.org/I9617848 |
| authorships[0].institutions[0].country_code | ES |
| authorships[0].institutions[0].display_name | Barcelona Supercomputing Center |
| authorships[0].institutions[1].id | https://openalex.org/I9617848 |
| authorships[0].institutions[1].ror | https://ror.org/03mb6wj31 |
| authorships[0].institutions[1].type | education |
| authorships[0].institutions[1].lineage | https://openalex.org/I9617848 |
| authorships[0].institutions[1].country_code | ES |
| authorships[0].institutions[1].display_name | Universitat Politècnica de Catalunya |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Adrian Barredo |
| authorships[0].is_corresponding | False |
| authorships[0].raw_affiliation_strings | Barcelona Supercomputing Center, Barcelona, Spain |
| authorships[1].author.id | https://openalex.org/A5029097110 |
| authorships[1].author.orcid | https://orcid.org/0000-0002-3731-9301 |
| authorships[1].author.display_name | Juan M. Cebrián |
| authorships[1].countries | ES |
| authorships[1].affiliations[0].institution_ids | https://openalex.org/I80180929 |
| authorships[1].affiliations[0].raw_affiliation_string | Universidad de Murcia |
| authorships[1].affiliations[1].institution_ids | https://openalex.org/I2799803557, https://openalex.org/I9617848 |
| authorships[1].affiliations[1].raw_affiliation_string | Barcelona Supercomputing Center |
| authorships[1].institutions[0].id | https://openalex.org/I2799803557 |
| authorships[1].institutions[0].ror | https://ror.org/05sd8tv96 |
| authorships[1].institutions[0].type | facility |
| authorships[1].institutions[0].lineage | https://openalex.org/I2799803557, https://openalex.org/I9617848 |
| authorships[1].institutions[0].country_code | ES |
| authorships[1].institutions[0].display_name | Barcelona Supercomputing Center |
| authorships[1].institutions[1].id | https://openalex.org/I80180929 |
| authorships[1].institutions[1].ror | https://ror.org/03p3aeb86 |
| authorships[1].institutions[1].type | education |
| authorships[1].institutions[1].lineage | https://openalex.org/I80180929 |
| authorships[1].institutions[1].country_code | ES |
| authorships[1].institutions[1].display_name | Universidad de Murcia |
| authorships[1].institutions[2].id | https://openalex.org/I9617848 |
| authorships[1].institutions[2].ror | https://ror.org/03mb6wj31 |
| authorships[1].institutions[2].type | education |
| authorships[1].institutions[2].lineage | https://openalex.org/I9617848 |
| authorships[1].institutions[2].country_code | ES |
| authorships[1].institutions[2].display_name | Universitat Politècnica de Catalunya |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Juan M. Cebrian |
| authorships[1].is_corresponding | False |
| authorships[1].raw_affiliation_strings | Barcelona Supercomputing Center, Universidad de Murcia |
| authorships[2].author.id | https://openalex.org/A5088729105 |
| authorships[2].author.orcid | https://orcid.org/0000-0002-9848-8758 |
| authorships[2].author.display_name | Miquel Moretó |
| authorships[2].countries | ES |
| authorships[2].affiliations[0].institution_ids | https://openalex.org/I2799803557, https://openalex.org/I9617848 |
| authorships[2].affiliations[0].raw_affiliation_string | Barcelona Supercomputing Center, Barcelona, Spain |
| authorships[2].institutions[0].id | https://openalex.org/I2799803557 |
| authorships[2].institutions[0].ror | https://ror.org/05sd8tv96 |
| authorships[2].institutions[0].type | facility |
| authorships[2].institutions[0].lineage | https://openalex.org/I2799803557, https://openalex.org/I9617848 |
| authorships[2].institutions[0].country_code | ES |
| authorships[2].institutions[0].display_name | Barcelona Supercomputing Center |
| authorships[2].institutions[1].id | https://openalex.org/I9617848 |
| authorships[2].institutions[1].ror | https://ror.org/03mb6wj31 |
| authorships[2].institutions[1].type | education |
| authorships[2].institutions[1].lineage | https://openalex.org/I9617848 |
| authorships[2].institutions[1].country_code | ES |
| authorships[2].institutions[1].display_name | Universitat Politècnica de Catalunya |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Miquel Moreto |
| authorships[2].is_corresponding | False |
| authorships[2].raw_affiliation_strings | Barcelona Supercomputing Center, Barcelona, Spain |
| authorships[3].author.id | https://openalex.org/A5044183257 |
| authorships[3].author.orcid | https://orcid.org/0000-0003-4564-2093 |
| authorships[3].author.display_name | Marc Casas |
| authorships[3].countries | ES |
| authorships[3].affiliations[0].institution_ids | https://openalex.org/I2799803557, https://openalex.org/I9617848 |
| authorships[3].affiliations[0].raw_affiliation_string | Barcelona Supercomputing Center |
| authorships[3].institutions[0].id | https://openalex.org/I2799803557 |
| authorships[3].institutions[0].ror | https://ror.org/05sd8tv96 |
| authorships[3].institutions[0].type | facility |
| authorships[3].institutions[0].lineage | https://openalex.org/I2799803557, https://openalex.org/I9617848 |
| authorships[3].institutions[0].country_code | ES |
| authorships[3].institutions[0].display_name | Barcelona Supercomputing Center |
| authorships[3].institutions[1].id | https://openalex.org/I9617848 |
| authorships[3].institutions[1].ror | https://ror.org/03mb6wj31 |
| authorships[3].institutions[1].type | education |
| authorships[3].institutions[1].lineage | https://openalex.org/I9617848 |
| authorships[3].institutions[1].country_code | ES |
| authorships[3].institutions[1].display_name | Universitat Politècnica de Catalunya |
| authorships[3].author_position | middle |
| authorships[3].raw_author_name | Marc Casas |
| authorships[3].is_corresponding | False |
| authorships[3].raw_affiliation_strings | Barcelona Supercomputing Center |
| authorships[4].author.id | https://openalex.org/A5020844763 |
| authorships[4].author.orcid | https://orcid.org/0000-0003-2917-2482 |
| authorships[4].author.display_name | Mateo Valero |
| authorships[4].countries | ES |
| authorships[4].affiliations[0].institution_ids | https://openalex.org/I9617848 |
| authorships[4].affiliations[0].raw_affiliation_string | Universitat Politècnica de Catalunya |
| authorships[4].affiliations[1].institution_ids | https://openalex.org/I2799803557, https://openalex.org/I9617848 |
| authorships[4].affiliations[1].raw_affiliation_string | Barcelona Supercomputing Center |
| authorships[4].institutions[0].id | https://openalex.org/I2799803557 |
| authorships[4].institutions[0].ror | https://ror.org/05sd8tv96 |
| authorships[4].institutions[0].type | facility |
| authorships[4].institutions[0].lineage | https://openalex.org/I2799803557, https://openalex.org/I9617848 |
| authorships[4].institutions[0].country_code | ES |
| authorships[4].institutions[0].display_name | Barcelona Supercomputing Center |
| authorships[4].institutions[1].id | https://openalex.org/I9617848 |
| authorships[4].institutions[1].ror | https://ror.org/03mb6wj31 |
| authorships[4].institutions[1].type | education |
| authorships[4].institutions[1].lineage | https://openalex.org/I9617848 |
| authorships[4].institutions[1].country_code | ES |
| authorships[4].institutions[1].display_name | Universitat Politècnica de Catalunya |
| authorships[4].author_position | last |
| authorships[4].raw_author_name | Mateo Valero |
| authorships[4].is_corresponding | False |
| authorships[4].raw_affiliation_strings | Barcelona Supercomputing Center, Universitat Politècnica de Catalunya |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | http://hdl.handle.net/2117/341626 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | Improving Predication Efficiency through Compaction/Restoration of SIMD Instructions |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T03:46:38.306776 |
| primary_topic.id | https://openalex.org/T10054 |
| primary_topic.field.id | https://openalex.org/fields/17 |
| primary_topic.field.display_name | Computer Science |
| primary_topic.score | 1.0 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1708 |
| primary_topic.subfield.display_name | Hardware and Architecture |
| primary_topic.display_name | Parallel Computing and Optimization Techniques |
| related_works | https://openalex.org/W2994245508, https://openalex.org/W2028931088, https://openalex.org/W4242172182, https://openalex.org/W2082875307, https://openalex.org/W4237780868, https://openalex.org/W4285302443, https://openalex.org/W4396938741, https://openalex.org/W2019451907, https://openalex.org/W2069096825, https://openalex.org/W4248145683 |
| cited_by_count | 7 |
| counts_by_year[0].year | 2023 |
| counts_by_year[0].cited_by_count | 1 |
| counts_by_year[1].year | 2022 |
| counts_by_year[1].cited_by_count | 3 |
| counts_by_year[2].year | 2021 |
| counts_by_year[2].cited_by_count | 3 |
| locations_count | 2 |
| best_oa_location.id | pmh:oai:upcommons.upc.edu:2117/341626 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306400551 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | False |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | UPCommons (Polytechnic University of Catalonia) |
| best_oa_location.source.host_organization | https://openalex.org/I9617848 |
| best_oa_location.source.host_organization_name | Universitat Politècnica de Catalunya |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I9617848 |
| best_oa_location.license | public-domain |
| best_oa_location.pdf_url | |
| best_oa_location.version | submittedVersion |
| best_oa_location.raw_type | info:eu-repo/semantics/submittedVersion |
| best_oa_location.license_id | https://openalex.org/licenses/public-domain |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | http://hdl.handle.net/2117/341626 |
| primary_location.id | doi:10.1109/hpca47549.2020.00064 |
| primary_location.is_oa | False |
| primary_location.source | |
| primary_location.license | |
| primary_location.pdf_url | |
| primary_location.version | publishedVersion |
| primary_location.raw_type | proceedings-article |
| primary_location.license_id | |
| primary_location.is_accepted | True |
| primary_location.is_published | True |
| primary_location.raw_source_name | 2020 IEEE International Symposium on High Performance Computer Architecture (HPCA) |
| primary_location.landing_page_url | https://doi.org/10.1109/hpca47549.2020.00064 |
| publication_date | 2020-02-01 |
| publication_year | 2020 |
| referenced_works | https://openalex.org/W2613264175, https://openalex.org/W2144481293, https://openalex.org/W1998886328, https://openalex.org/W2278832452, https://openalex.org/W2124273985, https://openalex.org/W2475636809, https://openalex.org/W2120585153, https://openalex.org/W3145971282, https://openalex.org/W2236252626, https://openalex.org/W2085171150, https://openalex.org/W4255229080, https://openalex.org/W2170382128, https://openalex.org/W2167399819, https://openalex.org/W2001984859, https://openalex.org/W2155568054, https://openalex.org/W2081373884, https://openalex.org/W2963768428, https://openalex.org/W2757156483, https://openalex.org/W2013259664, https://openalex.org/W6759043259, https://openalex.org/W3148394109, https://openalex.org/W3140574618, https://openalex.org/W2029014329, https://openalex.org/W2135018441, https://openalex.org/W6987144925, https://openalex.org/W1981153984, https://openalex.org/W1973538724, https://openalex.org/W2066862978, https://openalex.org/W2097572763, https://openalex.org/W2012252449, https://openalex.org/W2108283760, https://openalex.org/W2913795363, https://openalex.org/W4386017381, https://openalex.org/W2168921806, https://openalex.org/W1492601037, https://openalex.org/W3123542955 |
| referenced_works_count | 36 |
| abstract_inverted_index.a | 3, 68, 93 |
| abstract_inverted_index.CR | 108, 163, 187 |
| abstract_inverted_index.In | 51, 83 |
| abstract_inverted_index.an | 131 |
| abstract_inverted_index.as | 42, 79 |
| abstract_inverted_index.by | 166, 175 |
| abstract_inverted_index.in | 67, 100, 137 |
| abstract_inverted_index.of | 6, 64, 121 |
| abstract_inverted_index.on | 45, 88, 179, 198 |
| abstract_inverted_index.to | 9, 47, 61, 96, 129, 155, 168, 177, 207 |
| abstract_inverted_index.up | 167, 176, 206 |
| abstract_inverted_index.we | 86 |
| abstract_inverted_index.25% | 169 |
| abstract_inverted_index.43% | 178 |
| abstract_inverted_index.56% | 208 |
| abstract_inverted_index.Our | 159 |
| abstract_inverted_index.all | 141 |
| abstract_inverted_index.and | 12, 22, 55, 91, 116, 170 |
| abstract_inverted_index.are | 58, 144, 153 |
| abstract_inverted_index.its | 17 |
| abstract_inverted_index.the | 27, 62, 74, 104, 122, 138, 142, 156 |
| abstract_inverted_index.(CR) | 106 |
| abstract_inverted_index.AVX) | 43 |
| abstract_inverted_index.Most | 35 |
| abstract_inverted_index.SIMD | 39, 89, 102, 111 |
| abstract_inverted_index.This | 71 |
| abstract_inverted_index.best | 139 |
| abstract_inverted_index.code | 20, 192 |
| abstract_inverted_index.flow | 33 |
| abstract_inverted_index.form | 130 |
| abstract_inverted_index.from | 125 |
| abstract_inverted_index.have | 24 |
| abstract_inverted_index.loop | 127 |
| abstract_inverted_index.most | 28 |
| abstract_inverted_index.ones | 30 |
| abstract_inverted_index.real | 180 |
| abstract_inverted_index.rely | 44 |
| abstract_inverted_index.same | 123 |
| abstract_inverted_index.such | 148 |
| abstract_inverted_index.that | 73, 162 |
| abstract_inverted_index.them | 118 |
| abstract_inverted_index.this | 84 |
| abstract_inverted_index.true | 65 |
| abstract_inverted_index.wide | 4 |
| abstract_inverted_index.with | 113, 119, 183, 193, 201 |
| abstract_inverted_index.(such | 41 |
| abstract_inverted_index.After | 146 |
| abstract_inverted_index.being | 31 |
| abstract_inverted_index.case, | 140 |
| abstract_inverted_index.dense | 133, 149 |
| abstract_inverted_index.focus | 87 |
| abstract_inverted_index.mask. | 70 |
| abstract_inverted_index.newer | 199 |
| abstract_inverted_index.novel | 94 |
| abstract_inverted_index.offer | 2 |
| abstract_inverted_index.paper | 85 |
| abstract_inverted_index.range | 5 |
| abstract_inverted_index.short | 194 |
| abstract_inverted_index.shows | 161 |
| abstract_inverted_index.their | 151 |
| abstract_inverted_index.wider | 202 |
| abstract_inverted_index.Vector | 0 |
| abstract_inverted_index.allows | 188 |
| abstract_inverted_index.codes, | 53 |
| abstract_inverted_index.delays | 109 |
| abstract_inverted_index.energy | 13, 56, 173 |
| abstract_inverted_index.legacy | 191 |
| abstract_inverted_index.length | 81 |
| abstract_inverted_index.modern | 36 |
| abstract_inverted_index.number | 63 |
| abstract_inverted_index.system | 75 |
| abstract_inverted_index.values | 66 |
| abstract_inverted_index.vector | 19, 80, 134, 195 |
| abstract_inverted_index.where, | 136 |
| abstract_inverted_index.(AVX-2) | 197 |
| abstract_inverted_index.active. | 145 |
| abstract_inverted_index.becomes | 77 |
| abstract_inverted_index.control | 32 |
| abstract_inverted_index.despite | 16 |
| abstract_inverted_index.dynamic | 172 |
| abstract_inverted_index.implies | 72 |
| abstract_inverted_index.improve | 10, 97 |
| abstract_inverted_index.propose | 92 |
| abstract_inverted_index.reduces | 171 |
| abstract_inverted_index.results | 152 |
| abstract_inverted_index.support | 48 |
| abstract_inverted_index.usually | 59 |
| abstract_inverted_index.vectors | 203 |
| abstract_inverted_index.However, | 15 |
| abstract_inverted_index.approach | 95 |
| abstract_inverted_index.compacts | 117 |
| abstract_inverted_index.control. | 50 |
| abstract_inverted_index.elements | 115, 143 |
| abstract_inverted_index.improves | 164 |
| abstract_inverted_index.inactive | 114 |
| abstract_inverted_index.original | 157 |
| abstract_inverted_index.relevant | 29 |
| abstract_inverted_index.restored | 154 |
| abstract_inverted_index.Moreover, | 186 |
| abstract_inverted_index.achieving | 205 |
| abstract_inverted_index.benefits. | 210 |
| abstract_inverted_index.different | 126 |
| abstract_inverted_index.executing | 147, 189 |
| abstract_inverted_index.execution | 23, 98 |
| abstract_inverted_index.including | 38 |
| abstract_inverted_index.instances | 120 |
| abstract_inverted_index.(AVX-512), | 204 |
| abstract_inverted_index.divergence | 49 |
| abstract_inverted_index.efficiency | 76, 99 |
| abstract_inverted_index.equivalent | 132 |
| abstract_inverted_index.evaluation | 160 |
| abstract_inverted_index.execution. | 185 |
| abstract_inverted_index.extensions | 40, 90 |
| abstract_inverted_index.generation | 21 |
| abstract_inverted_index.increases. | 82 |
| abstract_inverted_index.iterations | 128 |
| abstract_inverted_index.potential, | 18 |
| abstract_inverted_index.predicated | 52, 69, 101, 110, 184 |
| abstract_inverted_index.processors | 1, 37 |
| abstract_inverted_index.technique. | 107 |
| abstract_inverted_index.unexplored | 7 |
| abstract_inverted_index.unmodified | 181, 190 |
| abstract_inverted_index.challenges, | 26 |
| abstract_inverted_index.consumption | 57, 174 |
| abstract_inverted_index.divergence. | 34 |
| abstract_inverted_index.efficiency. | 14 |
| abstract_inverted_index.insensitive | 60 |
| abstract_inverted_index.instruction | 124 |
| abstract_inverted_index.performance | 11, 54, 165, 209 |
| abstract_inverted_index.predication | 46 |
| abstract_inverted_index.significant | 25 |
| abstract_inverted_index.sub-optimal | 78 |
| abstract_inverted_index.applications | 182 |
| abstract_inverted_index.instruction, | 135 |
| abstract_inverted_index.instructions | 112, 196 |
| abstract_inverted_index.architectures | 200 |
| abstract_inverted_index.instructions, | 103, 150 |
| abstract_inverted_index.instructions. | 158 |
| abstract_inverted_index.opportunities | 8 |
| abstract_inverted_index.Compaction/Restoration | 105 |
| cited_by_percentile_year.max | 97 |
| cited_by_percentile_year.min | 89 |
| countries_distinct_count | 1 |
| institutions_distinct_count | 5 |
| sustainable_development_goals[0].id | https://metadata.un.org/sdg/7 |
| sustainable_development_goals[0].score | 0.9200000166893005 |
| sustainable_development_goals[0].display_name | Affordable and clean energy |
| citation_normalized_percentile.value | 0.83479491 |
| citation_normalized_percentile.is_in_top_1_percent | False |
| citation_normalized_percentile.is_in_top_10_percent | False |