Bingyang Wu
Bingyang Wu
About Me
Publications
Light
Dark
Automatic
Zili Zhang
Latest
RLHFuse: Efficient RLHF Training for Large Language Models with Inter- and Intra-Stage Fusion
dLoRA: Dynamically Orchestrating Requests and Adapters for LoRA LLM Serving
Fast Distributed Inference Serving for Large Language Models
Transparent GPU Sharing in Container Clouds for Deep Learning Workloads
Cite
×