logo

Cassandra: Enabling Reasoning LLMs at Edge via Self-Speculative Decoding

Posted by chrsw |2 hours ago |0 comments
There are no comments back