# I am the Watcher. I am your guide through this vast new twtiverse.
# 
# Usage:
#     https://watcher.sour.is/api/plain/users              View list of users and latest twt date.
#     https://watcher.sour.is/api/plain/twt                View all twts.
#     https://watcher.sour.is/api/plain/mentions?uri=:uri  View all mentions for uri.
#     https://watcher.sour.is/api/plain/conv/:hash         View all twts for a conversation subject.
# 
# Options:
#     uri     Filter to show a specific users twts.
#     offset  Start index for quey.
#     limit   Count of items to return (going back in time).
# 
# twt range = 1 1
# self = https://watcher.sour.is/conv/eei2jdq
vLLM 源碼之分離式架構**
1,背景----本文主要分析 vLLM 分離式架構原理。關於大模型推理分離式架構,一直是今年討論的一個熱點,筆者之前針對該技術也做了一些總結。作爲大模型推理最流行的框架之一,vLLM 功能迭代非常的快。關於 vLLM 的一些個人理解,筆者之前也做了一些總結。當前,vLLM 社區已經有分離式架構的 pr,分別是如下 2 個。本文以第一個爲基礎介紹一下 vLLM 分離式架構簡單實現。1https://Read more