EUNO.NEWS EUNO.NEWS
  • All (20292) +229
  • AI (3103) +13
  • DevOps (906) +6
  • Software (10480) +161
  • IT (5755) +49
  • Education (48)
  • Notice
  • All (20292) +229
    • AI (3103) +13
    • DevOps (906) +6
    • Software (10480) +161
    • IT (5755) +49
    • Education (48)
  • Notice
  • All (20292) +229
  • AI (3103) +13
  • DevOps (906) +6
  • Software (10480) +161
  • IT (5755) +49
  • Education (48)
  • Notice
Sources Tags Search
한국어 English 中文
  • 1个月前 · ai

    [Paper] AugServe:自适应请求调度用于增强大型语言模型推理服务

    随着带有外部工具的增强型大型语言模型(LLMs)在网页应用中日益流行,提升增强型 LLM 推理服务的效率……

    #LLM serving #adaptive scheduling #dynamic batching #inference optimization #augmented LLM
EUNO.NEWS
RSS GitHub © 2026