Open source SwiftInfer from the Colossal-AI team: a TensorRT-based implementation of the StreamingLLM algorithm
Colossal-ai team is open source Swiftlnfer, a TensorRT-based implementation of the StreamingLLM algorithm. The StreamingLLM algorithm addresses the challenge that ...