My name is Qiyuan Chen (陈启源) and I am currently a Ph.D. student at the College of Computer Science and Technology, Zhejiang University, co-supervised by Prof. Jian Wu and Dr. Jintai Chen. Before that, I got my B.Sc. degree from the School of Mathematics and Statistics, Central China Normal University, under the guidance of Prof. Bo Li and Dr. Haitong Yang.
My current research interests primarily include: Multi Modal Learning (MM), Machine Learning (ML) and Natural Language Processing (NLP). Specifically,
- Multi Modal Learning and its Applications in Medical
- Theories and Methods in Semi-Supervised Learning
- Tabular Data Prediction and Tabular Reasoning
- Modules and Further Applications in RAG systems
Feel free to contact me if your research lies within these or related areas!
And I work really closely with Qian Shao, Xuming Hu, Hongsen Huang and Zepeng Li.
In my free time, I really enjoy developing interesting websites and tools using Java (sometimes Python with FastAPI) and Vue.js. You can check out my projects on my GitHub if you’re interested.
I also enjoy digital devices (mechanical keyboards/NAS, etc.), badminton and chess in my spare time.
🔥 News
-
2024.3, One paper accepted by NAACL 2024.
-
2023.11, I am honored to have been awarded the Principal’s Scholarship.
📝 Publications
(*: Equal contribution; $\dagger$: Corresponding author(s))
2024
- Mind’s Mirror: Distilling Self-Evaluation Capability and Comprehensive Thinking from Large Language Models [NLP]; Weize Liu, Guocong Li, Kai Zhang, Bang Du, Qiyuan Chen, Xuming Hu, Hongxia Xu, Jintai Chen, Jian Wu; NAACL; 2024.
💻 Projects
1. Luotuo: An Instruction-following Chinese Language model, LoRA tuning on LLaMA (Founder)
This project has already received over 3.5k stars.
This project contains multiple sub-projects. For more sub-projects, please view the project homepage.
-
LuotuoEmbedding: Generative Text Embedding Model distilled from OpenAI API
-
LuotuoQA: Better Conversational Question Answering Model with Answer Completion
2. ChatInterview (Team Lead, Main Contributor)
You can check out the DEMO here.
This is a web-based tool that utilizes Large Language Models (LLMs) to help job seekers practice for interviews using state-of-the-art technologies such as RAG and Agent.
As the team lead, I spearheaded both the architectural design and the development of backend algorithms.
NOTE: Due to privacy considerations, this project will be fully open-sourced in the second half of 2024.
3. GPU Server Helper (Still in Developing, Founder)
This is a web-based tool designed for viewing and managing GPU server resources, suitable for laboratories or research institutions to manage and schedule large numbers of GPU servers.
I did all the coding for this project by myself, using FastAPI and Vue3.
🏅 Honors and Awards
- 2023.11, Principal’s Scholarship (Undergraduate) (ONLY 10 people in the whole school each year)
- 2021.11, National First Prize in the China Undergraduate Mathematical Contest in Model (CUMCM)
📖 Educations
-
2024.09 - 2029.06 (expected), Ph.D. in Artificial Intelligence, College of Computer Science and Technology, Zhejiang University. Supervised by Prof. Jian Wu and Dr. Jintai Chen.
-
2020.09 - 2024.06, B.Sc. in Statistics, School of Mathematics and Statistics, Central China Normal University. Advised by Prof. Bo Li and Dr. Haitong Yang.
🎒 Visiting and Internship
🔎 Reviews
- Review for Conferences: ACL, EMNLP
- Review for Journals: