About Me
I am pursuing a bachelor’s degree at The Hong Kong University of Science and Technology (HKUST), double majoring in Mathematics and Computer Science. I am currently working on Conformal Prediction and Selective Inference, where I am fortunate to be advised by Prof. Ying Jin at the Wharton School of the University of Pennsylvania. Additionally, I am exploring Conformal Prediction with Latent Structure under the supervision of Prof. Xinzhou Guo in the Mathematics Department at HKUST. Before that, I was a research assistant working on Cognitive Science and Intention Understanding at the KnowComp Group at HKUST, where I have been fortunate to be advised by Prof. Yangqiu Song. I visited the School of Mathematics at ETH Zurich as a university exchange student during the spring of 2025.
Under Review
[1] Multi-Distribution Robust Conformal Prediction [Code] [Paper]
[2] SessionIntentBench: Multi-task Inter-Session Intention-Shift Modeling Benchmark for E-commerce Customer Behavior Understanding [Code] [Paper]
Research Experience
Sep2025-Present Capstone Project at the Mathematics Department, HKUST, adviced by Prof. Xinzhou Guo
- Conformal prediction with latent structure, leveraging effective latent cluster modeling at both the covariate and residual levels; has already demonstrated strong empirical performance compared to existing methods such as PCP (Posterior Conformal Prediction). [preliminary results]
May2025-Present Research Assistant at Wharton, adviced by Prof. Ying Jin
- Developed a novel conformal prediction method addressing both classification and regression problems in a multi-source environment, where data are drawn from heterogeneous distributions. Established conformal set with high efficiency and tight worst-case coverage under both marginal and conditional settings.
Apr2024-Feb2025 Research Assistant at KnowComp Group, adviced by Prof. Yangqiu Song
- Proposed new intention modeling frameworks for underexplored session interactions:
- developed a paradigm that is easy to deploy, automate and scale
- Implemented the whole benchmarking / dataset generation pipeline:
- data cleaning, preprocessing
- prompt engineering: Zero-shot, Few-shots, COT
- L(V)LM generations: feature extraction, aligned generation, aligned verification
- data labeling stage: annotation protocol writing, testing and evaluation
- model evaluation: LLM, LVLM / open-source models (mostly Transformer models), proprietary API models
- model finetuning: LLM, LVLM finetuning via Llama-Factory
- structural codes: coding the logics and structures that enable above functionalities
Oct2024-Dec2024 Research Project adviced by Prof. Can Yang
Feb2024-May2024 Research Project adviced by Prof. Qifeng Chen
Industry Experience
Jan2024-Jan2024 Research Analyst Intern at Jihai Investment (Hedge Fund)
Dec2023-Jan2024 Investment Banking Intern at Zhe Shang Securities
Jun2023-Aug2023 Software Developer Intern at Hundsun (world top 50 Fintech)
Projects
[1] Gaussian Process Boosting Theories and Simulation Studies
[2] Deep Convolutional GAN and CycleGAN for Image Processing and Generation [Code]
Awards
Aug2024 China Construction Bank Asia Scholarships (top 3 first prize winning awardee)
May2024 The S.S. Chern Class Scholarship (top 20 math student researchers)
Apr2024 Chiaphua Industries Limited Scholarships (top 1% tier invitation and final awardee)
Oct2023 University’s Scholarship Scheme for Continuing UG Student (top 1% tier)
Jun2022 National College Entrance Examination (GaoKao) (top 0.5% among 360,000 participants)
Contact
Reach me at yyangfd [at] connect [dot] ust [dot] hk for potential collaborations or visit my GitHub / LinkedIn home page for more information.
