Education
- Ph.D. Candidate in Computer Science
- Virginia Tech, Blacksburg, VA
- Sep 2024 - Present
- M.S. in Data Analytics Engineering
- Northeastern University, Seattle, WA
- Sep 2021 - May 2023
- M.S. in Actuarial Science
- Columbia University, New York, NY
- Sep 2019 - Dec 2020
- B.A. in Mathematics
- Rutgers University, New Brunswick, NJ
- Sep 2015 - May 2019
Research Interests
- LLM agents & tool use
- Multimodal Modeling
- RAG & knowledge-grounded reasoning
- Trustworthiness & Explainable AI
Publications
- BTW: A Non-Parametric Variance Stabilization Framework for Multimodal Model Integration
- Jun Hou, Le Wang, Xuan Wang
- EMNLP 2025
- Explainable AI for Clinical Outcome Prediction: A Survey of Clinician Perceptions and Preferences
- Jun Hou, Lucy Lu Wang
- In AMIA 2025 Informatics Summit
Experience
- Graduate Research Assistant
- Virginia Tech, Blacksburg, VA
- Sep 2024 - Present
- Advisor: Dr. Xuan Wang
- Building personalized LLM agents for healthcare, centered on a patient-centric temporal EHR graph to improve personalization accuracy and clinical relevance.
- Integrating the graph with retrieval-augmented LLM reasoning for prediction and explanation.
- Adopted a multimodal mixture-of-experts architecture for efficient and comprehensive clinical prediction tasks using the curated MIMIC-IV dataset, integrating time series, text, imaging, and signal data.
- Teaching Assistant (Applied Natural Language Processing)
- Northeastern University, Seattle, WA
- Sep 2023 - Dec 2023
- Held office hours, graded student projects and assignments, and collaborated with the professor on course design.
- Insurance Linked Security Co-Op
- Amundi US, Boston, MA
- Jul 2022 - Dec 2022
- Created an interactive Shiny dashboard to enable user-friendly SQL database management with automated procedures.
- Research Assistant
- Columbia University, New York, NY
- May 2020 - Dec 2020
- Analyzed and predicted mortality data trends using queries from the SnowFlake Database.
- Developed a determination model for Mortality Improvement Trends with smoothing and interpolation.
Research Projects
- Survey for Validation of Explainable AI (XAI) in Clinical NLP
- Advisor: Dr. Lucy Lu Wang
- Jun 2023 - Mar 2024
- Conducted a comprehensive review of XAI methods used in clinical NLP and designed a questionnaire for clinical practitioners.
- Implemented XAI techniques, including Local Interpretable Model-Agnostic Explanations (LIME) and self-attention visualization on ULMS-BERT and Bio-BERT models.
- Fine-tuned a BERT-based clinical outcome classification model for semantic textual similarity tasks.
- Utilized Few-shot In-Context Learning with the GPT API to generate free-text rationales to enhance classification explanations.
- ESG Sentiment Analysis with Financial NLP
- Jan 2023 - Apr 2023
- Analyzed ESG news headlines to discern sentiment, topics, and entity relationships.
Technical Skills
- Programming & Frameworks: Python, PyTorch, R, Java
- Database & Data Tools: SQL, Data Analysis Toolkits