Education

  • Ph.D. Candidate in Computer Science
    • Virginia Tech, Blacksburg, VA
    • Sep 2024 - Present
  • M.S. in Data Analytics Engineering
    • Northeastern University, Seattle, WA
    • Sep 2021 - May 2023
  • M.S. in Actuarial Science
    • Columbia University, New York, NY
    • Sep 2019 - Dec 2020
  • B.A. in Mathematics
    • Rutgers University, New Brunswick, NJ
    • Sep 2015 - May 2019

Research Interests

  • LLM agents & tool use
  • Multimodal Modeling
  • RAG & knowledge-grounded reasoning
  • Trustworthiness & Explainable AI

Publications

  • BTW: A Non-Parametric Variance Stabilization Framework for Multimodal Model Integration
    • Jun Hou, Le Wang, Xuan Wang
    • EMNLP 2025
  • Explainable AI for Clinical Outcome Prediction: A Survey of Clinician Perceptions and Preferences
    • Jun Hou, Lucy Lu Wang
    • In AMIA 2025 Informatics Summit

Experience

  • Graduate Research Assistant
    • Virginia Tech, Blacksburg, VA
    • Sep 2024 - Present
    • Advisor: Dr. Xuan Wang
      • Building personalized LLM agents for healthcare, centered on a patient-centric temporal EHR graph to improve personalization accuracy and clinical relevance.
      • Integrating the graph with retrieval-augmented LLM reasoning for prediction and explanation.
      • Adopted a multimodal mixture-of-experts architecture for efficient and comprehensive clinical prediction tasks using the curated MIMIC-IV dataset, integrating time series, text, imaging, and signal data.
  • Teaching Assistant (Applied Natural Language Processing)
    • Northeastern University, Seattle, WA
    • Sep 2023 - Dec 2023
    • Held office hours, graded student projects and assignments, and collaborated with the professor on course design.
  • Insurance Linked Security Co-Op
    • Amundi US, Boston, MA
    • Jul 2022 - Dec 2022
    • Created an interactive Shiny dashboard to enable user-friendly SQL database management with automated procedures.
  • Research Assistant
    • Columbia University, New York, NY
    • May 2020 - Dec 2020
      • Analyzed and predicted mortality data trends using queries from the SnowFlake Database.
      • Developed a determination model for Mortality Improvement Trends with smoothing and interpolation.

Research Projects

  • Survey for Validation of Explainable AI (XAI) in Clinical NLP
    • Advisor: Dr. Lucy Lu Wang
    • Jun 2023 - Mar 2024
      • Conducted a comprehensive review of XAI methods used in clinical NLP and designed a questionnaire for clinical practitioners.
      • Implemented XAI techniques, including Local Interpretable Model-Agnostic Explanations (LIME) and self-attention visualization on ULMS-BERT and Bio-BERT models.
      • Fine-tuned a BERT-based clinical outcome classification model for semantic textual similarity tasks.
      • Utilized Few-shot In-Context Learning with the GPT API to generate free-text rationales to enhance classification explanations.
  • ESG Sentiment Analysis with Financial NLP
    • Jan 2023 - Apr 2023
      • Analyzed ESG news headlines to discern sentiment, topics, and entity relationships.

Technical Skills

  • Programming & Frameworks: Python, PyTorch, R, Java
  • Database & Data Tools: SQL, Data Analysis Toolkits