


default search action
Yash Akhauri
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [i16]Yuzong Chen, Xilai Dai, Chi-chih Chang, Yash Akhauri, Mohamed S. Abdelfattah:
The Power of Negative Zero: Datatype Customization for Quantized Large Language Models. CoRR abs/2501.04052 (2025) - [i15]Ahmed F. AbouElhamayed, Jordan Dotzel, Yash Akhauri, Chi-Chih Chang, Sameh Gobriel, J. Pablo Muñoz, Vui Seng Chua, Nilesh Jain, Mohamed S. Abdelfattah:
SparAMX: Accelerating Compressed LLMs Token Generation on AMX-powered CPUs. CoRR abs/2502.12444 (2025) - [i14]Yash Akhauri, Ahmed F. AbouElhamayed, Yifei Gao, Chi-Chih Chang, Nilesh Jain, Mohamed S. Abdelfattah:
TokenButler: Token Importance is Predictable. CoRR abs/2503.07518 (2025) - [i13]Chi-Chih Chang, Chien-Yu Lin, Yash Akhauri, Wei-Cheng Lin, Kai-Chiang Wu, Luis Ceze, Mohamed S. Abdelfattah:
xKV: Cross-Layer SVD for KV-Cache Compression. CoRR abs/2503.18893 (2025) - 2024
- [c8]Yash Akhauri, Ahmed F. AbouElhamayed, Jordan Dotzel, Zhiru Zhang, Alexander M. Rush, Safeen Huda, Mohamed S. Abdelfattah:
ShadowLLM: Predictor-based Contextual Sparsity for Large Language Models. EMNLP 2024: 19154-19167 - [c7]Yash Akhauri, Mohamed S. Abdelfattah:
Encodings for Prediction-based Neural Architecture Search. ICML 2024 - [c6]Yash Akhauri, Mohamed S. Abdelfattah:
On Latency Predictors for Neural Architecture Search. MLSys 2024 - [i12]Yash Akhauri, Mohamed S. Abdelfattah:
On Latency Predictors for Neural Architecture Search. CoRR abs/2403.02446 (2024) - [i11]Yash Akhauri, Mohamed S. Abdelfattah:
Encodings for Prediction-based Neural Architecture Search. CoRR abs/2403.02484 (2024) - [i10]Jordan Dotzel, Yash Akhauri, Ahmed S. AbouElhamayed, Carly Jiang, Mohamed S. Abdelfattah, Zhiru Zhang:
Radial Networks: Dynamic Layer Routing for High-Performance Large Language Models. CoRR abs/2404.04900 (2024) - [i9]Yash Akhauri, Ahmed F. AbouElhamayed, Jordan Dotzel, Zhiru Zhang, Alexander M. Rush, Safeen Huda, Mohamed S. Abdelfattah:
ShadowLLM: Predictor-based Contextual Sparsity for Large Language Models. CoRR abs/2406.16635 (2024) - [i8]Yash Akhauri, Safeen Huda, Mohamed S. Abdelfattah:
Attamba: Attending To Multi-Token States. CoRR abs/2411.17685 (2024) - 2023
- [c5]Yash Akhauri, Mohamed S. Abdelfattah:
Multi-Predict: Few Shot Predictors For Efficient Neural Architecture Search. AutoML 2023: 23/1-23 - [i7]Yash Akhauri, Mohamed S. Abdelfattah:
Multi-Predict: Few Shot Predictors For Efficient Neural Architecture Search. CoRR abs/2306.02459 (2023) - 2022
- [c4]Yash Akhauri, Juan Pablo Muñoz, Nilesh Jain, Ravi Iyer:
EZNAS: Evolving Zero-Cost Proxies For Neural Architecture Scoring. NeurIPS 2022 - [i6]Yash Akhauri, J. Pablo Munoz, Nilesh Jain, Ravi Iyer:
Evolving Zero Cost Proxies For Neural Architecture Scoring. CoRR abs/2209.07413 (2022) - 2021
- [i5]Yash Akhauri, Adithya Niranjan, J. Pablo Muñoz, Suvadeep Banerjee, Abhijit Davare, Pasquale Cocchini, Anton A. Sorokin, Ravi Iyer, Nilesh Jain:
RHNAS: Realizable Hardware and Neural Architecture Search. CoRR abs/2106.09180 (2021) - [i4]J. Pablo Muñoz, Nikolay Lyalyushkin, Yash Akhauri, Anastasia Senina, Alexander Kozlov, Nilesh Jain:
Enabling NAS with Automated Super-Network Generation. CoRR abs/2112.10878 (2021) - 2020
- [c3]Yaman Umuroglu, Yash Akhauri, Nicholas J. Fraser, Michaela Blott:
High-Throughput DNN Inference with LogicNets. FCCM 2020: 238 - [c2]Yaman Umuroglu, Yash Akhauri, Nicholas James Fraser, Michaela Blott:
LogicNets: Co-Designed Neural Networks and Circuits for Extreme-Throughput Applications. FPL 2020: 291-297 - [i3]Yaman Umuroglu, Yash Akhauri, Nicholas J. Fraser, Michaela Blott:
LogicNets: Co-Designed Neural Networks and Circuits for Extreme-Throughput Applications. CoRR abs/2004.03021 (2020) - [i2]Yash Akhauri:
Exposing Hardware Building Blocks to Machine Learning Frameworks. CoRR abs/2004.05898 (2020)
2010 – 2019
- 2019
- [c1]Yash Akhauri:
HadaNets: Flexible Quantization Strategies for Neural Networks. CVPR Workshops 2019: 526-534 - [i1]Yash Akhauri:
HadaNets: Flexible Quantization Strategies for Neural Networks. CoRR abs/1905.10759 (2019)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-04-20 23:52 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint