Toggle navigation
OpenReview
.net
Login
×
Back to
ISCA
ISCA 2024 Workshop MLArchSys Submissions
MoE-ERAS: Expert Residency Aware Selection
Abhimanyu Rajeshkumar Bambhaniya
,
Sashankh Chengavalli Kumar
,
Tushar Krishna
Published: 30 May 2024, Last Modified: 08 Jun 2024
MLArchSys 2024 OralPoster
Readers:
Everyone
Fast DL-based Simulation with Microarchitecture Agnostic Traces and Instruction Embeddings
Santosh Pandey
,
Amir Yazdanbakhsh
,
Hang Liu
Published: 30 May 2024, Last Modified: 14 Jun 2024
MLArchSys 2024 OralPoster
Readers:
Everyone
LLMServingSim: A Simulation Infrastructure for LLM Inference Serving Systems
Jaehong Cho
,
Minsu Kim
,
Hyunmin Choi
,
Jongse Park
Published: 30 May 2024, Last Modified: 08 Jun 2024
MLArchSys 2024 OralPoster
Readers:
Everyone
Towards a Standardized Representation for Deep Learning Collective Algorithms
Jinsun Yoo
,
William Won
,
Meghan Cowan
,
Nan Jiang
,
Benjamin Klenk
,
Srinivas
,
Tushar Krishna
Published: 30 May 2024, Last Modified: 07 Jun 2024
MLArchSys 2024 OralPoster
Readers:
Everyone
LayerDAG: A Layerwise Autoregressive Diffusion Model of Directed Acyclic Graphs for System
Mufei Li
,
Viraj Shitole
,
Eli Chien
,
Changhai Man
,
Zhaodong Wang
,
Srinivas
,
Ying Zhang
,
Tushar Krishna
,
Pan Li
Published: 30 May 2024, Last Modified: 17 Jun 2024
MLArchSys 2024 OralPoster
Readers:
Everyone
Lightweight Vision Transformers for Low Energy Edge Inference
Shashank Nag
,
Logan Liberty
,
Aishwarya Sivakumar
,
Neeraja J Yadwadkar
,
Lizy Kurian John
Published: 30 May 2024, Last Modified: 16 Jun 2024
MLArchSys 2024 OralPoster
Readers:
Everyone
PINCH: Accelerating Distributed GNN Training through In-Kernel Operation Using eBPF
Jianchang Su
,
Yifan Zhang
,
Wei Zhang
Published: 30 May 2024, Last Modified: 16 Jun 2024
MLArchSys 2024 OralPoster
Readers:
Everyone
Misam: Using ML in Dataflow Selection of Sparse-Sparse Matrix Multiplication
Sanjali Yadav
,
Bahar Asgari
Published: 30 May 2024, Last Modified: 08 Jun 2024
MLArchSys 2024 OralPoster
Readers:
Everyone
Peridot: Accelerating Out-of-Core GCN Data Reuse Pattern and Co-Design on GPU
Jayakody Arachchige Shakya Druvichapa
,
Jun Wang
Published: 30 May 2024, Last Modified: 08 Jun 2024
MLArchSys 2024 OralPoster
Readers:
Everyone
FuseMax: Leveraging Extended Einsums to Optimize Attention Accelerator Design
Nandeeka Nayak
,
Xinrui Wu
,
Toluwanimi O. Odemuyiwa
,
Michael Pellauer
,
Joel Emer
,
Christopher W Fletcher
Published: 30 May 2024, Last Modified: 08 Jun 2024
MLArchSys 2024 OralPoster
Readers:
Everyone
ConvBench: A Comprehensive Benchmark for 2D Convolution Primitive Evaluation
Lucas Fernando Alvarenga e Silva
,
Victor Ferrari
,
RAFAEL CARDOSO FERNANDES SOUSA
,
Marcio Pereira
,
Guido Araujo
Published: 30 May 2024, Last Modified: 16 Jun 2024
MLArchSys 2024 OralPoster
Readers:
Everyone
Allegro: GPU Simulation Acceleration for Machine Learning Workloads
Euijun Chung
,
Seonjin Na
,
Hyesoon Kim
Published: 30 May 2024, Last Modified: 11 Jun 2024
MLArchSys 2024 OralPoster
Readers:
Everyone
Fine-grained Trace-driven Performance Modeling and Simulation for Large-scale ML Training
Mingyu Liang
,
Hiwot Tadese Kassa
,
Wenyin Fu
,
Brian Coutinho
,
Louis Feng
,
Christina Delimitrou
Published: 30 May 2024, Last Modified: 23 Jun 2024
MLArchSys 2024 OralPoster
Readers:
Everyone
FedRepre: An Efficient and Scalable Federated Learning Framework with Client Representative Mechanism and Specialized Server Architecture
Yitu Wang
,
Minxue Tang
,
Hanqiu Chen
,
Shiyu Li
,
Qilin Zheng
,
Cong Guo
,
Andrew Chang
,
Callie Hao
,
Hai Li
,
Yiran Chen
Published: 30 May 2024, Last Modified: 17 Jun 2024
MLArchSys 2024 OralPoster
Readers:
Everyone