About Me
I am a Senior Research Scientist at Salesforce Research. Prior to that, I obtained my Ph.D. degree at The Chinese University of Hong Kong, under the supervision of Prof. Michael R. Lyu and Prof. Irwin King.
I have rich research experience in both academia and industry with top-tier AI publications at NeurIPS, ACL, EMNLP, NAACL, FSE, IJCAI, and ICASSP and working experience at leading AI research labs such as Microsoft Research Asia, Tencent AI, Amazon AWS AI, and Salesforce Research.
My research interest focuses on large language model (LLM) pretraining for code. My passion is developing AI-powered software solutions to improve programmers’ productivity.
See my Google scholar for more details. My Email: wangyue2714@gmail.com.
Research Interests
- In general: Deep Learning, Machine Learning for Code, Natural Language Processing, Multimodal Learning
- LLMs for Code: Language Model Pretraining for Code, Code Understanding and Generation
- Multimodal Learning: Vision-Language Pretraining, Visual Dialog, Cross-media Understanding
Publications
- Yue Wang*, Hung Le*, Akhilesh Deepak Gotmare, Nghi D.Q. Bui, Junnan Li, Steven Hoi. “CodeT5+: Open Code Large Language Models for Code Understanding and Generation.”, (preprint 2023). [paper][code][blog] (* equal contribution)
- Weishi Wang*, Yue Wang*, Shafiq Joty, Steven Hoi. “RAP-Gen: Retrieval-Augmented Patch Generation with CodeT5 for Automatic Program Repair.”, (FSE 2023). [paper] (* equal contribution)
- Nghi D.Q. Bui, Hung Le, Yue Wang, Junnan Li, Akhilesh Deepak Gotmare, Steven Hoi. CODETF: One-stop Transformer Library for State-of-the-art Code LLMs. (preprint 2023) [paper][code]
- Hung Le*, Yue Wang*, Akhilesh Deepak Gotmare, Silvio Savarese, Steven Hoi. “CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning”, (NeurIPS 2022) [paper][code][blog][media] (* equal contribution)
- Nghi D. Q. Bui, Yue Wang, Steven Hoi. Detect-Localize-Repair: A Unified Framework for Learning to Debug with CodeT5. (EMNLP2022 Findings) [paper]
- Yue Wang, Weishi Wang, Shafiq Joty, and Steven C.H. Hoi. “CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation”, (EMNLP 2021) [paper][code][blog][media][slide][poster]
- Yue Wang, Cuong Hoang, and Marcello Federico. “Towards Modeling the Style of Translators in Neural Machine Translation”. (NAACL 2021) [paper]
- Surafel M Lakew, Marcello Federico, Yue Wang, Cuong Hoang, Yogesh Virkar, Roberto Barra-Chicote, Robert Enyedi. “Machine Translation Verbosity Control for Automatic Dubbing”. (ICASSP 2021) [paper]
- Yue Wang, Shafiq Joty, Michael R. Lyu, Irwin King, Caiming Xiong, and Steven C.H. Hoi. VD-BERT: A Unified Vision and Dialog Transformer with BERT. (EMNLP 2020, our VD-BERT has been ranked No.1 in the Visual Dialog leaderboard from 01/2020-05/2020).[paper][code][slide][zhihu][video]
- Yue Wang, Jing Li, Michael Lyu and Irwin King. Cross-Media Keyphrase Prediction: A Unified Framework with Multi-Modality Multi-Head Attention and Image Wordings. (EMNLP2020)[paper][code][slide][video]
- Yue Wang, Jing Li, Hou Pong Chan, Irwin King, Michael R. Lyu, Shuming Shi. Topic-Aware Neural Keyphrase Generation for Social Media Language. Florence, Italy (ACL 2019). [paper][code][poster]
- Yue Wang, Jing Li, Irwin King, Michael R. Lyu, Shuming Shi. Microblog Hashtag Generation via Encoding Conversation Contexts. Minneapolis, USA (NAACL-HLT 2019, Oral). [paper][code][slide][video]
- Jian Li, Yue Wang, Michael R. Lyu, Irwin King. Code Completion with Neural Attention and Pointer Networks. Stockholm, Sweden (IJCAI 2018, Oral). [paper][slide]
Professtional Activities
- Journal Reviewer: TKDE, TIST, TOSEM
- Conference Reviewer: EMNLP2019, AAAI2019, AAAI2020, COLING2020, ICONIP2020, ACL2021, EMNLP2021, ARR 2022
- Conference Subreviewer: NIPS2017, SIGIR2017, EMNLP2018, IJCAI2018, NAACL2019, ACL2019, ACL2020, EMNLP2020
- Attended Conferences: ICSE2018, International PhD Forum 2018, NAACL2019, ACL2019, EMNLP2020, EMNLP2021, NeurIPS 2022
Education
Experience
- Summer 2015, Research Intern, System Group@Microsoft Research Asia, Beijing. Mentor: Cheng Chen
- Spring 2016, Visiting Student, Department of Electrical and Computer Engineering@Brigham Young University, Utah, USA. Advisor: Prof. Brent Nelson
- Summer 2017, Project Collaborator, 2012 Lab@Huawei, Shenzhen
- Summer 2018, Research Intern, NLP Center@Tencent AI Lab, Shenzhen. Mentor: Prof. Jing Li
- Summer 2019, Research Intern, Salesforce Research, Singapore. Mentor: Prof. Steven Hoi and Prof. Shafiq Rayhan Joty
- Summer 2020, Applied Scientist Intern, Amazon AWS AI Lab, California, USA. Mentor: Marcello Federico and Hoang Cuong
Selected Awards
- 2013, National Scholarship awarded by The Ministry of Education, China (Top 1)
- 2014, 2015, National Endeavor Fellowship awarded by The Ministry of Education, China (Top 3)
- 2015, Honorable Mention in Mathematical Contest in Modeling
- 2015, Third Prize in the 13th Guangdong Collegiate Programming Contest (ACM-ICPC GDCPC’2015)
- 2016, Outstanding Graduate Student of Sun Yat-sen University (Top 3)
- 2016-2020, CUHK Postgraduate Student Scholarship
Hobbies
- Playing badminton/tennis/table tennis, hiking/cycling/swimming, drone video making
- Volunteer activities: I co-founded Batou volunteer association (Shantou, China) in summer 2013. We offer free tutoring to high school students each summer in my hometown. More than 1000 students have been served until now.
Some Useful Links