Explanipedia

AutoRAC: Automated Processing-in-Memory Accelerator Design for Recommender Systems Open

Feng Cheng, Tunhou Zhang, Junyao Zhang, Jonathan Ku, Y.R. Wang , et al. · 2025

AutoRAC: Automated Processing-in-Memory Accelerator Design for Recommender Systems Open

Feng Cheng, Tunhou Zhang, Junyao Zhang, Jonathan Ku, Hai , et al. · 2025

The performance bottleneck of deep-learning-based recommender systems resides in their backbone Deep Neural Networks. By integrating Processing-In-Memory~(PIM) architectures, researchers can reduce data movement and enhance energy efficien…

Towards Automated Model Design on Recommender Systems Open

Tunhou Zhang, Dehua Cheng, Yuchen He, Zhengxing Chen, Xiaoliang Dai , et al. · 2024

The increasing popularity of deep learning models has created new opportunities for developing artificial intelligence–based recommender systems. Designing recommender systems using deep neural networks (DNNs) requires careful architecture…

Towards Automated Model Design on Recommender Systems Open

Tunhou Zhang, Dehua Cheng, Yuchen He, Zhengxing Chen, Xiaoliang Dai , et al. · 2024

The increasing popularity of deep learning models has created new opportunities for developing AI-based recommender systems. Designing recommender systems using deep neural networks requires careful architecture design, and further optimiz…

Enhancing Performance and Scalability of Large-Scale Recommendation Systems with Jagged Flash Attention Open

Rengan Xu, Junjie Yang, Yifan Xu, Hong Li, Xing Liu , et al. · 2024

The integration of hardware accelerators has significantly advanced the\ncapabilities of modern recommendation systems, enabling the exploration of\ncomplex ranking paradigms previously deemed impractical. However, the GPU-based\ncomputati…

DistDNAS: Search Efficient Feature Interactions within 2 Hours Open

Tunhou Zhang, Wei Wen, Igor Fedorov, Xi Liu, Buyun Zhang , et al. · 2023

Search efficiency and serving efficiency are two major axes in building feature interactions and expediting the model development process in recommender systems. On large-scale benchmarks, searching for the optimal feature interaction desi…

Farthest Greedy Path Sampling for Two-shot Recommender Search Open

Yufan Cao, Tunhou Zhang, Wei Wen, Feng Yan, Hai Li , et al. · 2023

Weight-sharing Neural Architecture Search (WS-NAS) provides an efficient mechanism for developing end-to-end deep recommender models. However, in complex search spaces, distinguishing between superior and inferior architectures (or paths) …

LISSNAS: Locality-based Iterative Search Space Shrinkage for Neural Architecture Search Open

Bhavna Gopal, Arjun Sridhar, Tunhou Zhang, Yiran Chen · 2023

Search spaces hallmark the advancement of Neural Architecture Search (NAS). Large and complex search spaces with versatile building operators and structures provide more opportunities to brew promising architectures, yet pose severe challe…

LISSNAS: Locality-based Iterative Search Space Shrinkage for Neural Architecture Search Open

Bhavna Gopal, Arjun Sridhar, Tunhou Zhang, Yiran Chen · 2023

Search spaces hallmark the advancement of Neural Architecture Search (NAS). Large and complex search spaces with versatile building operators and structures provide more opportunities to brew promising architectures, yet pose severe challe…

PIDS: Joint Point Interaction-Dimension Search for 3D Point Cloud Open

Tunhou Zhang, Mingyuan Ma, Yan Feng, Hai Li, Yiran Chen · 2022

The interaction and dimension of points are two important axes in designing point operators to serve hierarchical 3D models. Yet, these two axes are heterogeneous and challenging to fully explore. Existing works craft point operator under …

NASRec: Weight Sharing Neural Architecture Search for Recommender Systems Open

Tunhou Zhang, Dehua Cheng, Yuchen He, Zhengxing Chen, Xiaoliang Dai , et al. · 2022

The rise of deep neural networks offers new opportunities in optimizing recommender systems. However, optimizing recommender systems using deep neural networks requires delicate architecture fabrication. We propose NASRec, a paradigm that …

Towards Collaborative Intelligence: Routability Estimation based on Decentralized Private Data Open

Jingyu Pan, Chen-Chia Chang, Zhiyao Xie, Ang Li, Minxue Tang , et al. · 2022

Applying machine learning (ML) in design flow is a popular trend in EDA with various applications from design quality predictions to optimizations. Despite its promise, which has been demonstrated in both academic researches and industrial…

NASGEM: Neural Architecture Search via Graph Embedding Method Open

Hsin-Pai Cheng, Tunhou Zhang, Yixing Zhang, Shiyu Li, Feng Liang , et al. · 2021

Neural Architecture Search (NAS) automates and prospers the design of neural networks. Estimator-based NAS has been proposed recently to model the relationship between architectures and their performance to enable scalable and flexible sea…

Automatic Routability Predictor Development Using Neural Architecture Search Open

Chen-Chia Chang, Jingyu Pan, Tunhou Zhang, Zhiyao Xie, Jiang Hu , et al. · 2020

The rise of machine learning technology inspires a boom of its applications in electronic design automation (EDA) and helps improve the degree of automation in chip designs. However, manually crafted machine learning models require extensi…

NASGEM: Neural Architecture Search via Graph Embedding Method Open

Hsin-Pai Cheng, Tunhou Zhang, Yixing Zhang, Shiyu Li, Feng Liang , et al. · 2020

Neural Architecture Search (NAS) automates and prospers the design of neural networks. Estimator-based NAS has been proposed recently to model the relationship between architectures and their performance to enable scalable and flexible sea…

AutoShrink: A Topology-Aware NAS for Discovering Efficient Neural Architecture Open

Tunhou Zhang, Hsin-Pai Cheng, Zhenwen Li, Feng Yan, Chenyu Huang , et al. · 2020

Resource is an important constraint when deploying Deep Neural Networks (DNNs) on mobile and edge devices. Existing works commonly adopt the cell-based search approach, which limits the flexibility of network patterns in learned cell struc…

AutoShrink: A Topology-aware NAS for Discovering Efficient Neural Architecture Open

Tunhou Zhang, Hsin-Pai Cheng, Zhenwen Li, Feng Yan, Chenyu Huang , et al. · 2019

Resource is an important constraint when deploying Deep Neural Networks (DNNs) on mobile and edge devices. Existing works commonly adopt the cell-based search approach, which limits the flexibility of network patterns in learned cell struc…

SwiftNet: Using Graph Propagation as Meta-knowledge to Search Highly Representative Neural Architectures Open

Hsin-Pai Cheng, Tunhou Zhang, Yukun Yang, Feng Yan, Shiyu Li , et al. · 2019

Designing neural architectures for edge devices is subject to constraints of accuracy, inference latency, and computational cost. Traditionally, researchers manually craft deep neural networks to meet the needs of mobile devices. Neural Ar…

Tunhou Zhang YOU? Author Swipe