High Performance Implementation of 3D Convolutional Neural Networks on a GPU Article Swipe
YOU?
·
· 2017
· Open Access
·
· DOI: https://doi.org/10.1155/2017/8348671
Convolutional neural networks have proven to be highly successful in applications such as image classification, object tracking, and many other tasks based on 2D inputs. Recently, researchers have started to apply convolutional neural networks to video classification, which constitutes a 3D input and requires far larger amounts of memory and much more computation. FFT based methods can reduce the amount of computation, but this generally comes at the cost of an increased memory requirement. On the other hand, the Winograd Minimal Filtering Algorithm (WMFA) can reduce the number of operations required and thus can speed up the computation, without increasing the required memory. This strategy was shown to be successful for 2D neural networks. We implement the algorithm for 3D convolutional neural networks and apply it to a popular 3D convolutional neural network which is used to classify videos and compare it to cuDNN. For our highly optimized implementation of the algorithm, we observe a twofold speedup for most of the 3D convolution layers of our test network compared to the cuDNN version.
Related Topics
- Type
- article
- Language
- en
- Landing Page
- https://doi.org/10.1155/2017/8348671
- http://downloads.hindawi.com/journals/cin/2017/8348671.pdf
- OA Status
- hybrid
- Cited By
- 30
- References
- 7
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W2767899175
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W2767899175Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.1155/2017/8348671Digital Object Identifier
- Title
-
High Performance Implementation of 3D Convolutional Neural Networks on a GPUWork title
- Type
-
articleOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2017Year of publication
- Publication date
-
2017-01-01Full publication date if available
- Authors
-
Qiang Lan, Zelong Wang, Mei Wen, Chunyuan Zhang, Yijie WangList of authors in order
- Landing page
-
https://doi.org/10.1155/2017/8348671Publisher landing page
- PDF URL
-
https://downloads.hindawi.com/journals/cin/2017/8348671.pdfDirect link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
hybridOpen access status per OpenAlex
- OA URL
-
https://downloads.hindawi.com/journals/cin/2017/8348671.pdfDirect OA link when available
- Concepts
-
Computer science, Convolutional neural network, Speedup, Computation, Convolution (computer science), Fast Fourier transform, Artificial intelligence, Artificial neural network, Auxiliary memory, Pattern recognition (psychology), Parallel computing, Algorithm, Computer hardwareTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
30Total citation count in OpenAlex
- Citations by year (recent)
-
2025: 3, 2024: 2, 2023: 2, 2022: 1, 2021: 9Per-year citation counts (last 5 years)
- References (count)
-
7Number of works referenced by this work
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W2767899175 |
|---|---|
| doi | https://doi.org/10.1155/2017/8348671 |
| ids.doi | https://doi.org/10.1155/2017/8348671 |
| ids.mag | 2767899175 |
| ids.pmid | https://pubmed.ncbi.nlm.nih.gov/29250109 |
| ids.openalex | https://openalex.org/W2767899175 |
| fwci | 1.65243288 |
| mesh[0].qualifier_ui | |
| mesh[0].descriptor_ui | D003196 |
| mesh[0].is_major_topic | True |
| mesh[0].qualifier_name | |
| mesh[0].descriptor_name | Computer Graphics |
| mesh[1].qualifier_ui | Q000379 |
| mesh[1].descriptor_ui | D007091 |
| mesh[1].is_major_topic | False |
| mesh[1].qualifier_name | methods |
| mesh[1].descriptor_name | Image Processing, Computer-Assisted |
| mesh[2].qualifier_ui | |
| mesh[2].descriptor_ui | D016571 |
| mesh[2].is_major_topic | True |
| mesh[2].qualifier_name | |
| mesh[2].descriptor_name | Neural Networks, Computer |
| mesh[3].qualifier_ui | |
| mesh[3].descriptor_ui | D014741 |
| mesh[3].is_major_topic | False |
| mesh[3].qualifier_name | |
| mesh[3].descriptor_name | Video Recording |
| mesh[4].qualifier_ui | |
| mesh[4].descriptor_ui | D003196 |
| mesh[4].is_major_topic | True |
| mesh[4].qualifier_name | |
| mesh[4].descriptor_name | Computer Graphics |
| mesh[5].qualifier_ui | Q000379 |
| mesh[5].descriptor_ui | D007091 |
| mesh[5].is_major_topic | False |
| mesh[5].qualifier_name | methods |
| mesh[5].descriptor_name | Image Processing, Computer-Assisted |
| mesh[6].qualifier_ui | |
| mesh[6].descriptor_ui | D016571 |
| mesh[6].is_major_topic | True |
| mesh[6].qualifier_name | |
| mesh[6].descriptor_name | Neural Networks, Computer |
| mesh[7].qualifier_ui | |
| mesh[7].descriptor_ui | D014741 |
| mesh[7].is_major_topic | False |
| mesh[7].qualifier_name | |
| mesh[7].descriptor_name | Video Recording |
| type | article |
| title | High Performance Implementation of 3D Convolutional Neural Networks on a GPU |
| awards[0].id | https://openalex.org/G8878789636 |
| awards[0].funder_id | https://openalex.org/F4320335777 |
| awards[0].display_name | |
| awards[0].funder_award_id | 20124307130004 |
| awards[0].funder_display_name | National Key Research and Development Program of China |
| awards[1].id | https://openalex.org/G3440238326 |
| awards[1].funder_id | https://openalex.org/F4320321001 |
| awards[1].display_name | |
| awards[1].funder_award_id | 2012AA012706 |
| awards[1].funder_display_name | National Natural Science Foundation of China |
| awards[2].id | https://openalex.org/G4728997783 |
| awards[2].funder_id | https://openalex.org/F4320335777 |
| awards[2].display_name | |
| awards[2].funder_award_id | 2016YFB1000401 |
| awards[2].funder_display_name | National Key Research and Development Program of China |
| awards[3].id | https://openalex.org/G1457086666 |
| awards[3].funder_id | https://openalex.org/F4320336024 |
| awards[3].display_name | |
| awards[3].funder_award_id | 61402504 |
| awards[3].funder_display_name | Specialized Research Fund for the Doctoral Program of Higher Education of China |
| awards[4].id | https://openalex.org/G2658364145 |
| awards[4].funder_id | https://openalex.org/F4320321001 |
| awards[4].display_name | |
| awards[4].funder_award_id | 2016YFB1000401 |
| awards[4].funder_display_name | National Natural Science Foundation of China |
| awards[5].id | https://openalex.org/G2918677723 |
| awards[5].funder_id | https://openalex.org/F4320335777 |
| awards[5].display_name | |
| awards[5].funder_award_id | 61272145 |
| awards[5].funder_display_name | National Key Research and Development Program of China |
| awards[6].id | https://openalex.org/G4182008862 |
| awards[6].funder_id | https://openalex.org/F4320336024 |
| awards[6].display_name | |
| awards[6].funder_award_id | 61272145 |
| awards[6].funder_display_name | Specialized Research Fund for the Doctoral Program of Higher Education of China |
| awards[7].id | https://openalex.org/G5431238508 |
| awards[7].funder_id | https://openalex.org/F4320321001 |
| awards[7].display_name | |
| awards[7].funder_award_id | 61402504 |
| awards[7].funder_display_name | National Natural Science Foundation of China |
| awards[8].id | https://openalex.org/G4723255270 |
| awards[8].funder_id | https://openalex.org/F4320321001 |
| awards[8].display_name | |
| awards[8].funder_award_id | 20124307130004 |
| awards[8].funder_display_name | National Natural Science Foundation of China |
| awards[9].id | https://openalex.org/G4179778637 |
| awards[9].funder_id | https://openalex.org/F4320335777 |
| awards[9].display_name | |
| awards[9].funder_award_id | 61402504 |
| awards[9].funder_display_name | National Key Research and Development Program of China |
| awards[10].id | https://openalex.org/G7528517500 |
| awards[10].funder_id | https://openalex.org/F4320321001 |
| awards[10].display_name | |
| awards[10].funder_award_id | 61272145 |
| awards[10].funder_display_name | National Natural Science Foundation of China |
| awards[11].id | https://openalex.org/G3842827335 |
| awards[11].funder_id | https://openalex.org/F4320321001 |
| awards[11].display_name | |
| awards[11].funder_award_id | 61502509 |
| awards[11].funder_display_name | National Natural Science Foundation of China |
| awards[12].id | https://openalex.org/G4533110181 |
| awards[12].funder_id | https://openalex.org/F4320336024 |
| awards[12].display_name | |
| awards[12].funder_award_id | 2012AA012706 |
| awards[12].funder_display_name | Specialized Research Fund for the Doctoral Program of Higher Education of China |
| awards[13].id | https://openalex.org/G8918358744 |
| awards[13].funder_id | https://openalex.org/F4320336024 |
| awards[13].display_name | |
| awards[13].funder_award_id | 2016YFB1000401 |
| awards[13].funder_display_name | Specialized Research Fund for the Doctoral Program of Higher Education of China |
| awards[14].id | https://openalex.org/G7699651490 |
| awards[14].funder_id | https://openalex.org/F4320336024 |
| awards[14].display_name | |
| awards[14].funder_award_id | 20124307130004 |
| awards[14].funder_display_name | Specialized Research Fund for the Doctoral Program of Higher Education of China |
| awards[15].id | https://openalex.org/G6673706365 |
| awards[15].funder_id | https://openalex.org/F4320335777 |
| awards[15].display_name | |
| awards[15].funder_award_id | 2012AA012706 |
| awards[15].funder_display_name | National Key Research and Development Program of China |
| awards[16].id | https://openalex.org/G5178501968 |
| awards[16].funder_id | https://openalex.org/F4320335777 |
| awards[16].display_name | |
| awards[16].funder_award_id | 61502509 |
| awards[16].funder_display_name | National Key Research and Development Program of China |
| awards[17].id | https://openalex.org/G5206713092 |
| awards[17].funder_id | https://openalex.org/F4320336024 |
| awards[17].display_name | |
| awards[17].funder_award_id | 61502509 |
| awards[17].funder_display_name | Specialized Research Fund for the Doctoral Program of Higher Education of China |
| biblio.issue | |
| biblio.volume | 2017 |
| biblio.last_page | 8 |
| biblio.first_page | 1 |
| topics[0].id | https://openalex.org/T10812 |
| topics[0].field.id | https://openalex.org/fields/17 |
| topics[0].field.display_name | Computer Science |
| topics[0].score | 0.9998999834060669 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1707 |
| topics[0].subfield.display_name | Computer Vision and Pattern Recognition |
| topics[0].display_name | Human Pose and Action Recognition |
| topics[1].id | https://openalex.org/T10531 |
| topics[1].field.id | https://openalex.org/fields/17 |
| topics[1].field.display_name | Computer Science |
| topics[1].score | 0.9995999932289124 |
| topics[1].domain.id | https://openalex.org/domains/3 |
| topics[1].domain.display_name | Physical Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/1707 |
| topics[1].subfield.display_name | Computer Vision and Pattern Recognition |
| topics[1].display_name | Advanced Vision and Imaging |
| topics[2].id | https://openalex.org/T10036 |
| topics[2].field.id | https://openalex.org/fields/17 |
| topics[2].field.display_name | Computer Science |
| topics[2].score | 0.9994999766349792 |
| topics[2].domain.id | https://openalex.org/domains/3 |
| topics[2].domain.display_name | Physical Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/1707 |
| topics[2].subfield.display_name | Computer Vision and Pattern Recognition |
| topics[2].display_name | Advanced Neural Network Applications |
| funders[0].id | https://openalex.org/F4320321001 |
| funders[0].ror | https://ror.org/01h0zpd94 |
| funders[0].display_name | National Natural Science Foundation of China |
| funders[1].id | https://openalex.org/F4320335777 |
| funders[1].ror | |
| funders[1].display_name | National Key Research and Development Program of China |
| funders[2].id | https://openalex.org/F4320336024 |
| funders[2].ror | |
| funders[2].display_name | Specialized Research Fund for the Doctoral Program of Higher Education of China |
| is_xpac | False |
| apc_list.value | 2100 |
| apc_list.currency | USD |
| apc_list.value_usd | 2100 |
| apc_paid.value | 2100 |
| apc_paid.currency | USD |
| apc_paid.value_usd | 2100 |
| concepts[0].id | https://openalex.org/C41008148 |
| concepts[0].level | 0 |
| concepts[0].score | 0.8802144527435303 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[0].display_name | Computer science |
| concepts[1].id | https://openalex.org/C81363708 |
| concepts[1].level | 2 |
| concepts[1].score | 0.8785181045532227 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q17084460 |
| concepts[1].display_name | Convolutional neural network |
| concepts[2].id | https://openalex.org/C68339613 |
| concepts[2].level | 2 |
| concepts[2].score | 0.8572865724563599 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q1549489 |
| concepts[2].display_name | Speedup |
| concepts[3].id | https://openalex.org/C45374587 |
| concepts[3].level | 2 |
| concepts[3].score | 0.7273931503295898 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q12525525 |
| concepts[3].display_name | Computation |
| concepts[4].id | https://openalex.org/C45347329 |
| concepts[4].level | 3 |
| concepts[4].score | 0.669745683670044 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q5166604 |
| concepts[4].display_name | Convolution (computer science) |
| concepts[5].id | https://openalex.org/C75172450 |
| concepts[5].level | 2 |
| concepts[5].score | 0.4830530285835266 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q623950 |
| concepts[5].display_name | Fast Fourier transform |
| concepts[6].id | https://openalex.org/C154945302 |
| concepts[6].level | 1 |
| concepts[6].score | 0.46001195907592773 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q11660 |
| concepts[6].display_name | Artificial intelligence |
| concepts[7].id | https://openalex.org/C50644808 |
| concepts[7].level | 2 |
| concepts[7].score | 0.43547219038009644 |
| concepts[7].wikidata | https://www.wikidata.org/wiki/Q192776 |
| concepts[7].display_name | Artificial neural network |
| concepts[8].id | https://openalex.org/C82687282 |
| concepts[8].level | 2 |
| concepts[8].score | 0.42212679982185364 |
| concepts[8].wikidata | https://www.wikidata.org/wiki/Q66221 |
| concepts[8].display_name | Auxiliary memory |
| concepts[9].id | https://openalex.org/C153180895 |
| concepts[9].level | 2 |
| concepts[9].score | 0.4052404761314392 |
| concepts[9].wikidata | https://www.wikidata.org/wiki/Q7148389 |
| concepts[9].display_name | Pattern recognition (psychology) |
| concepts[10].id | https://openalex.org/C173608175 |
| concepts[10].level | 1 |
| concepts[10].score | 0.37316036224365234 |
| concepts[10].wikidata | https://www.wikidata.org/wiki/Q232661 |
| concepts[10].display_name | Parallel computing |
| concepts[11].id | https://openalex.org/C11413529 |
| concepts[11].level | 1 |
| concepts[11].score | 0.29743528366088867 |
| concepts[11].wikidata | https://www.wikidata.org/wiki/Q8366 |
| concepts[11].display_name | Algorithm |
| concepts[12].id | https://openalex.org/C9390403 |
| concepts[12].level | 1 |
| concepts[12].score | 0.09286317229270935 |
| concepts[12].wikidata | https://www.wikidata.org/wiki/Q3966 |
| concepts[12].display_name | Computer hardware |
| keywords[0].id | https://openalex.org/keywords/computer-science |
| keywords[0].score | 0.8802144527435303 |
| keywords[0].display_name | Computer science |
| keywords[1].id | https://openalex.org/keywords/convolutional-neural-network |
| keywords[1].score | 0.8785181045532227 |
| keywords[1].display_name | Convolutional neural network |
| keywords[2].id | https://openalex.org/keywords/speedup |
| keywords[2].score | 0.8572865724563599 |
| keywords[2].display_name | Speedup |
| keywords[3].id | https://openalex.org/keywords/computation |
| keywords[3].score | 0.7273931503295898 |
| keywords[3].display_name | Computation |
| keywords[4].id | https://openalex.org/keywords/convolution |
| keywords[4].score | 0.669745683670044 |
| keywords[4].display_name | Convolution (computer science) |
| keywords[5].id | https://openalex.org/keywords/fast-fourier-transform |
| keywords[5].score | 0.4830530285835266 |
| keywords[5].display_name | Fast Fourier transform |
| keywords[6].id | https://openalex.org/keywords/artificial-intelligence |
| keywords[6].score | 0.46001195907592773 |
| keywords[6].display_name | Artificial intelligence |
| keywords[7].id | https://openalex.org/keywords/artificial-neural-network |
| keywords[7].score | 0.43547219038009644 |
| keywords[7].display_name | Artificial neural network |
| keywords[8].id | https://openalex.org/keywords/auxiliary-memory |
| keywords[8].score | 0.42212679982185364 |
| keywords[8].display_name | Auxiliary memory |
| keywords[9].id | https://openalex.org/keywords/pattern-recognition |
| keywords[9].score | 0.4052404761314392 |
| keywords[9].display_name | Pattern recognition (psychology) |
| keywords[10].id | https://openalex.org/keywords/parallel-computing |
| keywords[10].score | 0.37316036224365234 |
| keywords[10].display_name | Parallel computing |
| keywords[11].id | https://openalex.org/keywords/algorithm |
| keywords[11].score | 0.29743528366088867 |
| keywords[11].display_name | Algorithm |
| keywords[12].id | https://openalex.org/keywords/computer-hardware |
| keywords[12].score | 0.09286317229270935 |
| keywords[12].display_name | Computer hardware |
| language | en |
| locations[0].id | doi:10.1155/2017/8348671 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S72372694 |
| locations[0].source.issn | 1687-5265, 1687-5273 |
| locations[0].source.type | journal |
| locations[0].source.is_oa | False |
| locations[0].source.issn_l | 1687-5265 |
| locations[0].source.is_core | True |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | Computational Intelligence and Neuroscience |
| locations[0].source.host_organization | https://openalex.org/P4310319869 |
| locations[0].source.host_organization_name | Hindawi Publishing Corporation |
| locations[0].source.host_organization_lineage | https://openalex.org/P4310319869 |
| locations[0].source.host_organization_lineage_names | Hindawi Publishing Corporation |
| locations[0].license | cc-by |
| locations[0].pdf_url | http://downloads.hindawi.com/journals/cin/2017/8348671.pdf |
| locations[0].version | publishedVersion |
| locations[0].raw_type | journal-article |
| locations[0].license_id | https://openalex.org/licenses/cc-by |
| locations[0].is_accepted | True |
| locations[0].is_published | True |
| locations[0].raw_source_name | Computational Intelligence and Neuroscience |
| locations[0].landing_page_url | https://doi.org/10.1155/2017/8348671 |
| locations[1].id | pmid:29250109 |
| locations[1].is_oa | False |
| locations[1].source.id | https://openalex.org/S4306525036 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | False |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | PubMed |
| locations[1].source.host_organization | https://openalex.org/I1299303238 |
| locations[1].source.host_organization_name | National Institutes of Health |
| locations[1].source.host_organization_lineage | https://openalex.org/I1299303238 |
| locations[1].license | |
| locations[1].pdf_url | |
| locations[1].version | publishedVersion |
| locations[1].raw_type | |
| locations[1].license_id | |
| locations[1].is_accepted | True |
| locations[1].is_published | True |
| locations[1].raw_source_name | Computational intelligence and neuroscience |
| locations[1].landing_page_url | https://pubmed.ncbi.nlm.nih.gov/29250109 |
| locations[2].id | pmh:oai:doaj.org/article:bf020872a3c04909831fdc9ad286639d |
| locations[2].is_oa | False |
| locations[2].source.id | https://openalex.org/S4306401280 |
| locations[2].source.issn | |
| locations[2].source.type | repository |
| locations[2].source.is_oa | False |
| locations[2].source.issn_l | |
| locations[2].source.is_core | False |
| locations[2].source.is_in_doaj | False |
| locations[2].source.display_name | DOAJ (DOAJ: Directory of Open Access Journals) |
| locations[2].source.host_organization | |
| locations[2].source.host_organization_name | |
| locations[2].license | |
| locations[2].pdf_url | |
| locations[2].version | submittedVersion |
| locations[2].raw_type | article |
| locations[2].license_id | |
| locations[2].is_accepted | False |
| locations[2].is_published | False |
| locations[2].raw_source_name | Computational Intelligence and Neuroscience, Vol 2017 (2017) |
| locations[2].landing_page_url | https://doaj.org/article/bf020872a3c04909831fdc9ad286639d |
| locations[3].id | pmh:oai:europepmc.org:4644079 |
| locations[3].is_oa | True |
| locations[3].source.id | https://openalex.org/S4306400806 |
| locations[3].source.issn | |
| locations[3].source.type | repository |
| locations[3].source.is_oa | False |
| locations[3].source.issn_l | |
| locations[3].source.is_core | False |
| locations[3].source.is_in_doaj | False |
| locations[3].source.display_name | Europe PMC (PubMed Central) |
| locations[3].source.host_organization | https://openalex.org/I1303153112 |
| locations[3].source.host_organization_name | European Bioinformatics Institute |
| locations[3].source.host_organization_lineage | https://openalex.org/I1303153112 |
| locations[3].license | cc-by |
| locations[3].pdf_url | |
| locations[3].version | submittedVersion |
| locations[3].raw_type | Text |
| locations[3].license_id | https://openalex.org/licenses/cc-by |
| locations[3].is_accepted | False |
| locations[3].is_published | False |
| locations[3].raw_source_name | |
| locations[3].landing_page_url | https://www.ncbi.nlm.nih.gov/pmc/articles/5698830 |
| indexed_in | crossref, doaj, pubmed |
| authorships[0].author.id | https://openalex.org/A5101436439 |
| authorships[0].author.orcid | https://orcid.org/0000-0001-9667-3485 |
| authorships[0].author.display_name | Qiang Lan |
| authorships[0].countries | CN |
| authorships[0].affiliations[0].institution_ids | https://openalex.org/I170215575 |
| authorships[0].affiliations[0].raw_affiliation_string | College of Computer, National University of Defense Technology, Changsha 410073, China |
| authorships[0].affiliations[1].raw_affiliation_string | National Key Laboratory of Parallel and Distributed Processing, Changsha 410073, China |
| authorships[0].institutions[0].id | https://openalex.org/I170215575 |
| authorships[0].institutions[0].ror | https://ror.org/05d2yfz11 |
| authorships[0].institutions[0].type | education |
| authorships[0].institutions[0].lineage | https://openalex.org/I170215575 |
| authorships[0].institutions[0].country_code | CN |
| authorships[0].institutions[0].display_name | National University of Defense Technology |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Qiang Lan |
| authorships[0].is_corresponding | True |
| authorships[0].raw_affiliation_strings | College of Computer, National University of Defense Technology, Changsha 410073, China, National Key Laboratory of Parallel and Distributed Processing, Changsha 410073, China |
| authorships[1].author.id | https://openalex.org/A5034746334 |
| authorships[1].author.orcid | https://orcid.org/0000-0001-8517-6862 |
| authorships[1].author.display_name | Zelong Wang |
| authorships[1].countries | CN |
| authorships[1].affiliations[0].raw_affiliation_string | National Key Laboratory of Parallel and Distributed Processing, Changsha 410073, China |
| authorships[1].affiliations[1].institution_ids | https://openalex.org/I170215575 |
| authorships[1].affiliations[1].raw_affiliation_string | College of Computer, National University of Defense Technology, Changsha 410073, China |
| authorships[1].institutions[0].id | https://openalex.org/I170215575 |
| authorships[1].institutions[0].ror | https://ror.org/05d2yfz11 |
| authorships[1].institutions[0].type | education |
| authorships[1].institutions[0].lineage | https://openalex.org/I170215575 |
| authorships[1].institutions[0].country_code | CN |
| authorships[1].institutions[0].display_name | National University of Defense Technology |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Zelong Wang |
| authorships[1].is_corresponding | False |
| authorships[1].raw_affiliation_strings | College of Computer, National University of Defense Technology, Changsha 410073, China, National Key Laboratory of Parallel and Distributed Processing, Changsha 410073, China |
| authorships[2].author.id | https://openalex.org/A5101937502 |
| authorships[2].author.orcid | https://orcid.org/0000-0002-5875-3297 |
| authorships[2].author.display_name | Mei Wen |
| authorships[2].countries | CN |
| authorships[2].affiliations[0].raw_affiliation_string | National Key Laboratory of Parallel and Distributed Processing, Changsha 410073, China |
| authorships[2].affiliations[1].institution_ids | https://openalex.org/I170215575 |
| authorships[2].affiliations[1].raw_affiliation_string | College of Computer, National University of Defense Technology, Changsha 410073, China |
| authorships[2].institutions[0].id | https://openalex.org/I170215575 |
| authorships[2].institutions[0].ror | https://ror.org/05d2yfz11 |
| authorships[2].institutions[0].type | education |
| authorships[2].institutions[0].lineage | https://openalex.org/I170215575 |
| authorships[2].institutions[0].country_code | CN |
| authorships[2].institutions[0].display_name | National University of Defense Technology |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Mei Wen |
| authorships[2].is_corresponding | False |
| authorships[2].raw_affiliation_strings | College of Computer, National University of Defense Technology, Changsha 410073, China, National Key Laboratory of Parallel and Distributed Processing, Changsha 410073, China |
| authorships[3].author.id | https://openalex.org/A5100710936 |
| authorships[3].author.orcid | https://orcid.org/0000-0002-0944-2708 |
| authorships[3].author.display_name | Chunyuan Zhang |
| authorships[3].countries | CN |
| authorships[3].affiliations[0].institution_ids | https://openalex.org/I170215575 |
| authorships[3].affiliations[0].raw_affiliation_string | College of Computer, National University of Defense Technology, Changsha 410073, China |
| authorships[3].affiliations[1].raw_affiliation_string | National Key Laboratory of Parallel and Distributed Processing, Changsha 410073, China |
| authorships[3].institutions[0].id | https://openalex.org/I170215575 |
| authorships[3].institutions[0].ror | https://ror.org/05d2yfz11 |
| authorships[3].institutions[0].type | education |
| authorships[3].institutions[0].lineage | https://openalex.org/I170215575 |
| authorships[3].institutions[0].country_code | CN |
| authorships[3].institutions[0].display_name | National University of Defense Technology |
| authorships[3].author_position | middle |
| authorships[3].raw_author_name | Chunyuan Zhang |
| authorships[3].is_corresponding | False |
| authorships[3].raw_affiliation_strings | College of Computer, National University of Defense Technology, Changsha 410073, China, National Key Laboratory of Parallel and Distributed Processing, Changsha 410073, China |
| authorships[4].author.id | https://openalex.org/A5100429826 |
| authorships[4].author.orcid | https://orcid.org/0000-0002-2913-4016 |
| authorships[4].author.display_name | Yijie Wang |
| authorships[4].countries | CN |
| authorships[4].affiliations[0].institution_ids | https://openalex.org/I170215575 |
| authorships[4].affiliations[0].raw_affiliation_string | College of Computer, National University of Defense Technology, Changsha 410073, China |
| authorships[4].affiliations[1].raw_affiliation_string | National Key Laboratory of Parallel and Distributed Processing, Changsha 410073, China |
| authorships[4].institutions[0].id | https://openalex.org/I170215575 |
| authorships[4].institutions[0].ror | https://ror.org/05d2yfz11 |
| authorships[4].institutions[0].type | education |
| authorships[4].institutions[0].lineage | https://openalex.org/I170215575 |
| authorships[4].institutions[0].country_code | CN |
| authorships[4].institutions[0].display_name | National University of Defense Technology |
| authorships[4].author_position | last |
| authorships[4].raw_author_name | Yijie Wang |
| authorships[4].is_corresponding | False |
| authorships[4].raw_affiliation_strings | College of Computer, National University of Defense Technology, Changsha 410073, China, National Key Laboratory of Parallel and Distributed Processing, Changsha 410073, China |
| has_content.pdf | True |
| has_content.grobid_xml | True |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | http://downloads.hindawi.com/journals/cin/2017/8348671.pdf |
| open_access.oa_status | hybrid |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | High Performance Implementation of 3D Convolutional Neural Networks on a GPU |
| has_fulltext | True |
| is_retracted | False |
| updated_date | 2025-11-06T03:46:38.306776 |
| primary_topic.id | https://openalex.org/T10812 |
| primary_topic.field.id | https://openalex.org/fields/17 |
| primary_topic.field.display_name | Computer Science |
| primary_topic.score | 0.9998999834060669 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1707 |
| primary_topic.subfield.display_name | Computer Vision and Pattern Recognition |
| primary_topic.display_name | Human Pose and Action Recognition |
| related_works | https://openalex.org/W2058965144, https://openalex.org/W2164382479, https://openalex.org/W2146343568, https://openalex.org/W98480971, https://openalex.org/W2150291671, https://openalex.org/W2013643406, https://openalex.org/W2027972911, https://openalex.org/W2157978810, https://openalex.org/W3157543420, https://openalex.org/W2964954556 |
| cited_by_count | 30 |
| counts_by_year[0].year | 2025 |
| counts_by_year[0].cited_by_count | 3 |
| counts_by_year[1].year | 2024 |
| counts_by_year[1].cited_by_count | 2 |
| counts_by_year[2].year | 2023 |
| counts_by_year[2].cited_by_count | 2 |
| counts_by_year[3].year | 2022 |
| counts_by_year[3].cited_by_count | 1 |
| counts_by_year[4].year | 2021 |
| counts_by_year[4].cited_by_count | 9 |
| counts_by_year[5].year | 2020 |
| counts_by_year[5].cited_by_count | 5 |
| counts_by_year[6].year | 2019 |
| counts_by_year[6].cited_by_count | 4 |
| counts_by_year[7].year | 2018 |
| counts_by_year[7].cited_by_count | 4 |
| locations_count | 4 |
| best_oa_location.id | doi:10.1155/2017/8348671 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S72372694 |
| best_oa_location.source.issn | 1687-5265, 1687-5273 |
| best_oa_location.source.type | journal |
| best_oa_location.source.is_oa | False |
| best_oa_location.source.issn_l | 1687-5265 |
| best_oa_location.source.is_core | True |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | Computational Intelligence and Neuroscience |
| best_oa_location.source.host_organization | https://openalex.org/P4310319869 |
| best_oa_location.source.host_organization_name | Hindawi Publishing Corporation |
| best_oa_location.source.host_organization_lineage | https://openalex.org/P4310319869 |
| best_oa_location.source.host_organization_lineage_names | Hindawi Publishing Corporation |
| best_oa_location.license | cc-by |
| best_oa_location.pdf_url | http://downloads.hindawi.com/journals/cin/2017/8348671.pdf |
| best_oa_location.version | publishedVersion |
| best_oa_location.raw_type | journal-article |
| best_oa_location.license_id | https://openalex.org/licenses/cc-by |
| best_oa_location.is_accepted | True |
| best_oa_location.is_published | True |
| best_oa_location.raw_source_name | Computational Intelligence and Neuroscience |
| best_oa_location.landing_page_url | https://doi.org/10.1155/2017/8348671 |
| primary_location.id | doi:10.1155/2017/8348671 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S72372694 |
| primary_location.source.issn | 1687-5265, 1687-5273 |
| primary_location.source.type | journal |
| primary_location.source.is_oa | False |
| primary_location.source.issn_l | 1687-5265 |
| primary_location.source.is_core | True |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | Computational Intelligence and Neuroscience |
| primary_location.source.host_organization | https://openalex.org/P4310319869 |
| primary_location.source.host_organization_name | Hindawi Publishing Corporation |
| primary_location.source.host_organization_lineage | https://openalex.org/P4310319869 |
| primary_location.source.host_organization_lineage_names | Hindawi Publishing Corporation |
| primary_location.license | cc-by |
| primary_location.pdf_url | http://downloads.hindawi.com/journals/cin/2017/8348671.pdf |
| primary_location.version | publishedVersion |
| primary_location.raw_type | journal-article |
| primary_location.license_id | https://openalex.org/licenses/cc-by |
| primary_location.is_accepted | True |
| primary_location.is_published | True |
| primary_location.raw_source_name | Computational Intelligence and Neuroscience |
| primary_location.landing_page_url | https://doi.org/10.1155/2017/8348671 |
| publication_date | 2017-01-01 |
| publication_year | 2017 |
| referenced_works | https://openalex.org/W2168117308, https://openalex.org/W2144354855, https://openalex.org/W4255246772, https://openalex.org/W2519379752, https://openalex.org/W1983364832, https://openalex.org/W1005811612, https://openalex.org/W4231896027 |
| referenced_works_count | 7 |
| abstract_inverted_index.a | 39, 127, 154 |
| abstract_inverted_index.2D | 23, 111 |
| abstract_inverted_index.3D | 40, 119, 129, 161 |
| abstract_inverted_index.On | 74 |
| abstract_inverted_index.We | 114 |
| abstract_inverted_index.an | 70 |
| abstract_inverted_index.as | 12 |
| abstract_inverted_index.at | 66 |
| abstract_inverted_index.be | 6, 108 |
| abstract_inverted_index.in | 9 |
| abstract_inverted_index.is | 134 |
| abstract_inverted_index.it | 125, 141 |
| abstract_inverted_index.of | 47, 60, 69, 88, 149, 159, 164 |
| abstract_inverted_index.on | 22 |
| abstract_inverted_index.to | 5, 29, 34, 107, 126, 136, 142, 169 |
| abstract_inverted_index.up | 95 |
| abstract_inverted_index.we | 152 |
| abstract_inverted_index.FFT | 53 |
| abstract_inverted_index.For | 144 |
| abstract_inverted_index.and | 17, 42, 49, 91, 123, 139 |
| abstract_inverted_index.but | 62 |
| abstract_inverted_index.can | 56, 84, 93 |
| abstract_inverted_index.far | 44 |
| abstract_inverted_index.for | 110, 118, 157 |
| abstract_inverted_index.our | 145, 165 |
| abstract_inverted_index.the | 58, 67, 75, 78, 86, 96, 100, 116, 150, 160, 170 |
| abstract_inverted_index.was | 105 |
| abstract_inverted_index.This | 103 |
| abstract_inverted_index.cost | 68 |
| abstract_inverted_index.have | 3, 27 |
| abstract_inverted_index.many | 18 |
| abstract_inverted_index.more | 51 |
| abstract_inverted_index.most | 158 |
| abstract_inverted_index.much | 50 |
| abstract_inverted_index.such | 11 |
| abstract_inverted_index.test | 166 |
| abstract_inverted_index.this | 63 |
| abstract_inverted_index.thus | 92 |
| abstract_inverted_index.used | 135 |
| abstract_inverted_index.apply | 30, 124 |
| abstract_inverted_index.based | 21, 54 |
| abstract_inverted_index.comes | 65 |
| abstract_inverted_index.cuDNN | 171 |
| abstract_inverted_index.hand, | 77 |
| abstract_inverted_index.image | 13 |
| abstract_inverted_index.input | 41 |
| abstract_inverted_index.other | 19, 76 |
| abstract_inverted_index.shown | 106 |
| abstract_inverted_index.speed | 94 |
| abstract_inverted_index.tasks | 20 |
| abstract_inverted_index.video | 35 |
| abstract_inverted_index.which | 37, 133 |
| abstract_inverted_index.(WMFA) | 83 |
| abstract_inverted_index.amount | 59 |
| abstract_inverted_index.cuDNN. | 143 |
| abstract_inverted_index.highly | 7, 146 |
| abstract_inverted_index.larger | 45 |
| abstract_inverted_index.layers | 163 |
| abstract_inverted_index.memory | 48, 72 |
| abstract_inverted_index.neural | 1, 32, 112, 121, 131 |
| abstract_inverted_index.number | 87 |
| abstract_inverted_index.object | 15 |
| abstract_inverted_index.proven | 4 |
| abstract_inverted_index.reduce | 57, 85 |
| abstract_inverted_index.videos | 138 |
| abstract_inverted_index.Minimal | 80 |
| abstract_inverted_index.amounts | 46 |
| abstract_inverted_index.compare | 140 |
| abstract_inverted_index.inputs. | 24 |
| abstract_inverted_index.memory. | 102 |
| abstract_inverted_index.methods | 55 |
| abstract_inverted_index.network | 132, 167 |
| abstract_inverted_index.observe | 153 |
| abstract_inverted_index.popular | 128 |
| abstract_inverted_index.speedup | 156 |
| abstract_inverted_index.started | 28 |
| abstract_inverted_index.twofold | 155 |
| abstract_inverted_index.without | 98 |
| abstract_inverted_index.Winograd | 79 |
| abstract_inverted_index.classify | 137 |
| abstract_inverted_index.compared | 168 |
| abstract_inverted_index.networks | 2, 33, 122 |
| abstract_inverted_index.required | 90, 101 |
| abstract_inverted_index.requires | 43 |
| abstract_inverted_index.strategy | 104 |
| abstract_inverted_index.version. | 172 |
| abstract_inverted_index.Algorithm | 82 |
| abstract_inverted_index.Filtering | 81 |
| abstract_inverted_index.Recently, | 25 |
| abstract_inverted_index.algorithm | 117 |
| abstract_inverted_index.generally | 64 |
| abstract_inverted_index.implement | 115 |
| abstract_inverted_index.increased | 71 |
| abstract_inverted_index.networks. | 113 |
| abstract_inverted_index.optimized | 147 |
| abstract_inverted_index.tracking, | 16 |
| abstract_inverted_index.algorithm, | 151 |
| abstract_inverted_index.increasing | 99 |
| abstract_inverted_index.operations | 89 |
| abstract_inverted_index.successful | 8, 109 |
| abstract_inverted_index.constitutes | 38 |
| abstract_inverted_index.convolution | 162 |
| abstract_inverted_index.researchers | 26 |
| abstract_inverted_index.applications | 10 |
| abstract_inverted_index.computation, | 61, 97 |
| abstract_inverted_index.computation. | 52 |
| abstract_inverted_index.requirement. | 73 |
| abstract_inverted_index.Convolutional | 0 |
| abstract_inverted_index.convolutional | 31, 120, 130 |
| abstract_inverted_index.implementation | 148 |
| abstract_inverted_index.classification, | 14, 36 |
| cited_by_percentile_year.max | 99 |
| cited_by_percentile_year.min | 89 |
| corresponding_author_ids | https://openalex.org/A5101436439 |
| countries_distinct_count | 1 |
| institutions_distinct_count | 5 |
| corresponding_institution_ids | https://openalex.org/I170215575 |
| citation_normalized_percentile.value | 0.87615747 |
| citation_normalized_percentile.is_in_top_1_percent | False |
| citation_normalized_percentile.is_in_top_10_percent | False |