EUNO.NEWS EUNO.NEWS
  • All (2379) +221
  • AI (548) +19
  • DevOps (142) +2
  • Software (998) +131
  • IT (686) +68
  • Education (5) +1
  • Notice
  • All (2379) +221
    • AI (548) +19
    • DevOps (142) +2
    • Software (998) +131
    • IT (686) +68
    • Education (5) +1
  • Notice
  • All (2379) +221
  • AI (548) +19
  • DevOps (142) +2
  • Software (998) +131
  • IT (686) +68
  • Education (5) +1
  • Notice
Sources Tags Search
한국어 English 中文
  • 1周前 · ai

    [Paper] DSD:一种用于边缘‑云敏捷大模型服务的分布式投机解码方案

    大型语言模型(LLM)推理通常面临高解码延迟以及在异构边缘‑云环境中的可扩展性受限。现有的…

    #speculative decoding #LLM serving #edge‑cloud inference #distributed inference #adaptive window control
EUNO.NEWS
RSS GitHub © 2025