| Year | Rank | Type | Title / Venue / Authors |
|---|---|---|---|
| 2026 | A* | conf |
HPCA
|
| 2026 | J | jnl |
CoRR
|
| 2025 | A* | conf |
MICRO
|
| 2025 | J | jnl |
CoRR
|
| 2024 | J | jnl |
IEEE Access
|
| 2024 | — | conf |
ICTC
|
| 2024 | — | conf |
ASPLOS (3)
|
| 2024 | A* | conf |
InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management.
OSDI
|
| 2024 | J | jnl |
InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management.
CoRR
|
| 2024 | A* | conf |
ISCA
|
| 2024 | J | jnl |
CoRR
|
| 2023 | — | conf |
ICTC
|
| 2023 | A* | conf |
ISCA
|
| 2023 | — | conf |
ICTC
|
| 2022 | — | conf |
ICTC
|
| 2021 | — | conf |
ICTC
|
| 2017 | C | conf |
APNOMS
|