Bingyang Wu
Bingyang Wu
About Me
Publications
Light
Dark
Automatic
3
RLHFuse: Efficient RLHF Training for Large Language Models with Inter- and Intra-Stage Fusion
Reinforcement Learning from Human Feedback (RLHF) enhances the alignment between LLMs and human preference. The workflow of RLHF …
Yinmin Zhong
,
Zili Zhang
,
Bingyang Wu
,
Shengyu Liu
,
Yukun Chen
,
Changyi Wan
,
Hanpeng Hu
,
Lei Xia
,
Ranchen Ming
,
Yibo Zhu
,
Xin Jin
PDF
Cite
A Survey of Resource-efficient LLM and Multimodal Foundation Models
Large foundation models, including large language models (LLMs), vision transformers (ViTs), diffusion, and LLM-based multimodal …
Mengwei Xu
,
Wangsong Yin
,
Dongqi Cai
,
Rongjie Yi
,
Daliang Xu
,
Qipeng Wang
,
Bingyang Wu
,
Yihao Zhao
,
Chen Yang
,
Shihe Wang
,
Qiyang Zhang
,
Zhenyan Lu
,
Li Zhang
,
Shangguang Wang
,
Yuanchun Li
,
Yunxin Liu
,
Xin Jin
,
Xuanzhe Liu
PDF
Cite
DOI
Fast Distributed Inference Serving for Large Language Models
Large language models (LLMs) power a new generation of interactive AI applications exemplified by ChatGPT. The interactive nature of …
Bingyang Wu
,
Yinmin Zhong
,
Zili Zhang
,
Gang Huang
,
Xuanzhe Liu
,
Xin Jin
PDF
Cite
DOI
Cite
×