knox 修订了这个 Gist . 跳至此修订
1 file changed, 6 insertions
vLLM-readme-2.bib(file created)
@@ -0,0 +1,6 @@ | |||
1 | + | @inproceedings{kwon2023efficient, | |
2 | + | title={Efficient Memory Management for Large Language Model Serving with PagedAttention}, | |
3 | + | author={Woosuk Kwon and Zhuohan Li and Siyuan Zhuang and Ying Sheng and Lianmin Zheng and Cody Hao Yu and Joseph E. Gonzalez and Hao Zhang and Ion Stoica}, | |
4 | + | booktitle={Proceedings of the ACM SIGOPS 29th Symposium on Operating Systems Principles}, | |
5 | + | year={2023} | |
6 | + | } |
更新
更早