Join CastLab
  • Home
  • Research
    • AI Accelerators
    • Multi-FPGA Systems
    • Processing-in-Memory
    • Near-Data Processing
  • People
    • Professor
    • Our Team
    • Alumni
  • Publications
    • Conference Papers
    • Journal Papers
    • Patents
    • Books
    • Open Source
  • Talks & Events
  • Awards
  • News
  • LINKEDIN
  • March 24, 2025
  • Comments off

“ADOR: A Design Exploration Framework for LLM Serving with Enhanced Latency and Throughput” IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2025

Post navigation

Previous post
Next post

Recent Posts

April 02, 2025

[VLSI 2025] Jung-Hoon Kim’s paper on Adelia: A 4nm LLM Accelerator with Streamlined Dataflow and Dual-Mode Parallelization for Efficient Generative AI Inference is accepted

March 24, 2025

[ISCA 2025] Seungjae Moon and Junseo Cha’s paper on Hybe: GPU-NPU Hybrid System for Efficient LLM Inference with Million-Token Context Window is accepted

March 24, 2025

[ISCA 2025] Sungmin Hong’s paper on Oaken: Fast and Efficient LLM Serving with Online-Offline Hybrid KV Cache Quantization is accepted

Address: #4209, School of Electrical Engineering (E3-2), KAIST, 291 Daehak-ro, Yuseong-gu, Daejeon 34141, South Korea     Tel: +82-42-350-7461     Email: castlab@kaist.ac.kr

Copyright© 2019 - CastLab