CV

Education

Ph.D. in Information Science (Human-Computer Interaction), Cornell University, Aug. 2021 - Dec. 2026
B.A. in Linguistics (Dean’s List), University of California, Berkeley, Aug. 2016 - Dec. 2019

Selected Research

Conversational Dynamics & Multimodal Prediction in Computer-Mediated Communication, Jan. 2022 - Aug. 2024
- Multimodal cues as externalizations of predictive state: Established that prosody, gaze, and facial expression function as externalizations of speakers’ and listeners’ real-time predictive models — when these channels are progressively suppressed (video → audio → transcript), word prediction degrades systematically in both accuracy and sensitivity to affect (COLING 2025; Proc. of the 31st Int’l Conference on Computational Linguistics, pp. 7949–7962)
- Novel computational measure of prediction: Developed an LLM-based method to approximate human next-word prediction in real-time speech, replacing traditional neuroscience paradigms with a scalable corpus-and-LM approach for quantifying prediction failure in conversational turn-taking
- Differential cost of lean channels for non-native speakers: Showed that L2 listeners are disproportionately affected by the loss of multimodal cues — limited L2 exposure and negative affect compound the cost of lean channels, leaving NNS with higher processing load and less reliable prediction than native speakers under the same conditions (CogSci 2025; eScholarship)
- Statistical modeling pipeline: Built reproducible analysis pipelines for lab-collected behavioral data combining GLMMs (lme4, lmerTest), LASSO-regularized regression, and multimodal behavioral coding (ELAN, Praat) across in-lab studies (cumulative N = 300+)
- Theoretical contribution: Identified which cognitive mechanisms break down in which CMC modalities — providing the empirical foundation for XPLAIN’s intervention design
AI-Mediated Communication as Cognitive Scaffolds — XPLAIN, Aug. 2024 - Present
- Theory-driven system vision: Designed a real-time AI scaffold for non-native speakers in online meetings, grounded in the prediction-breakdown findings above; targets cognitive bottlenecks (semantic prediction failure, temporal pressure, comprehension/production trade-offs) at specific stages of speech processing (CHI EA 2026, CSCW 2025, CUI 2026 under review)
- Novel cognitive-fluency metric: Developed a psycholinguistic measure that synchronizes four multimodal behavioral channels — speech disfluencies, gaze shifts, mouse clicks, and facial expressions — with specific speech-processing stages, enabling real-time inference of emerging cognitive trouble
- Phase 1 — Concept validation via three Wizard-of-Oz studies: Designed and ran three lab-based WoZ studies in which a confederate simulated the AI agent’s anticipatory cues, rephrasing, and timing adjustments to validate intervention type, timing, and individual fit:
  - Baseline scaffolding study (N = 38): Evaluated three core scaffolds — lexical clarifications, idea/content suggestions, and topic summaries — in dyadic virtual meetings; demonstrated improvements in 30–80% perceived communicative efficiency, participation, and inclusivity, moderated by individual differences in language proficiency, personality, and prior AI experience (short WIP at CSCW 2025; full manuscript in preparation)
  - Personalization study (N = 40): Tested context-adaptive interventions calibrated to user-level cognitive dimensions; companion latent class analysis on qualitative interview data (n = 29) identified eight two-class groupings across thematic domains (e.g., AI usage and trust, communication strategies, feature-specific evaluations), revealing that user profiles reflect domain-specific adaptation patterns rather than a global typology (CogSci 2026, accepted)
  - Translation study (N = 27): Examined how L1- vs. L2-cued translation scaffolds support real-time turn-taking under domain-specific lexical demand, integrating prediction theory, cognitive load theory, and language-dependent memory theory (manuscript in preparation)
- Phase 2 — Real-time evaluation pipeline for naturalistic conversation: Engineered an evaluation pipeline that ingests synchronized multimodal streams (speech, gaze, mouse-click, facial expression) during live online-meeting conversations and outputs real-time estimates of users’ performance and cognitive states (emerging prediction trouble, processing load, repair initiation); modeled intervention effects with event-centered GLMMs and interaction-state shifts via latent transition models; cumulative 100+ customer-discovery interviews
- Listener perception (pre-registered, in-field): Designed a pre-registered two-phase blind/informed listener-perception study (OSF) measuring listeners’ implicit detection of AI-assisted conversational speech, acoustic cue attribution, and the moderating effect of prior AI-knowledge — testing whether the same multimodal traces that enable AI scaffolding (disfluency, timing, prosody, visible ease/strain) trigger negative attributions of competence and authenticity from native evaluators, surfacing a “double bind” for non-native speakers using AI support
- Translation to product & impact: Selected for the Cornell eLab Spring 2025 cohort with $5,000 prototype-build funding; identified educational and workplace use cases for bridging knowledge gaps and reducing communication-driven social friction
Sustaining Public Goods via Prosocial Behavior, Citizens & Technology Lab (PI: J. Nathan Matias), Cornell University, Jan. 2025 - Present
- Lead statistician on parallel pre-registered field experiments across four Wikipedia language communities (Arabic, German, Persian, Polish; N = 15,558 editors), quantifying the causal effect of peer-to-peer gratitude on upstream reciprocity and sustained volunteer participation
- Built end-to-end causal inference pipelines in R: power analyses, intent-to-treat (ITT) and complier-average causal effect (CACE) estimation, multilevel models with cluster-robust standard errors, sensitivity analyses, and cross-community meta-analytic pooling
- Designed reproducible analysis & visualization workflow handling longitudinal behavioral data (edits, contributions, retention) over multi-month observation windows
- Translated statistical findings into actionable strategies for online community health and social sustainability; manuscript under review at PNAS
Multilingual FrameNet Project, International Computer Science Institute, Aug. 2019 - Dec. 2019
- Studied structures of English lexical databases, ran model trainings on semantic relations, and refined labelings for multilingual alignment

Professional Experience

Founder & Product Lead, XPLAIN (Proactive Meeting AI), Aug. 2024 - Present
- Architected a real-time AI-MC system utilizing LLM-based cognitive scaffolding to mitigate miscommunication in online meetings
- Recruited & directed a 19-person cross-functional research team (UX, Engineering, Data)
- Secured traction via venture showcases & demos via Cornell eLab incubator
Content Strategist (Trust and Safety), ByteDance Inc., Dec. 2020 - Jun. 2021
- Architected a new content moderation framework for sensitive word detection AI models across TikTok, Helo, Fictum (>500M users)
- Established HITL data-labeling standards and QA evaluation metrics for AI safety models
Linguist Intern - Ontology in NLU, Facebook, July - Aug. 2019
- Built the first Mandarin NLU model for Facebook AI Assistant chatbot with culture-specific syntactic and semantic adaptations

Technical Skills

Interactive AI Systems & Prototyping: Wizard-of-Oz concept validation, real-time multimodal evaluation pipelines (synchronized speech, gaze, mouse, facial expression streams) for naturalistic conversation, web-based experiment platforms, full-stack MVP development
AI/LLM Methods: Prompt & context engineering, in-context learning, LLM-based behavioral simulation, surprisal & next-word-prediction extraction from LMs, speech transcription (Whisper), LSTM/sequence-model fine-tuning (PyTorch, CANDOR multimodal corpus), AI agent task design & benchmarking, human-in-the-loop (HITL) evaluation protocols
Quantitative & Statistical Modeling: GLMMs / multilevel models, causal inference (field experiments, parallel design), latent class & latent transition analysis, regression with regularization (LASSO), time-series prediction, event-centered analyses
Qualitative & Mixed-Methods Research: Semi-structured interviews, inductive thematic coding, contextual inquiry, focus groups, multimodal behavioral coding/annotation, customer-discovery interviews (100+), worker co-design
Research Methods: Experimental design, user studies (80+ in-lab), surveys, usability testing, prototyping
Programming & Tools: Python (pandas, NumPy, scikit-learn, statsmodels, OpenAI/Anthropic APIs), R (tidyverse, lme4, lmerTest), SQL, JsPsych, PsychoPy, Praat, ELAN, Qualtrics, Atlas.ti, Figma, Miro, Git/GitHub, LaTeX, JupyterLab

Languages

Mandarin Chinese — Native
English — Fluent
German — B2 (Goethe-Institut PASCH scholar, 2015)

Publications

User Heterogeneity in AI-Mediated Communication: Extending Cognitive Theories via Latent Class Analysis

He, W. P. & Fussell, S. R. (2026). "User Heterogeneity in AI-Mediated Communication: Extending Cognitive Theories via Latent Class Analysis." Proceedings of the 48th Annual Meeting of the Cognitive Science Society (CogSci 2026). Rio de Janeiro, Brazil. (Accepted)

XPLAIN: A Proactive Scaffold Across Speech Processing Stages—Supporting Non-Native Speakers in Real-Time AI-Mediated Turn-Taking

He, W. P. & Fussell, S. R. (2026). "XPLAIN: A Proactive Scaffold Across Speech Processing Stages—Supporting Non-Native Speakers in Real-Time AI-Mediated Turn-Taking." Proceedings of the ACM Conference on Conversational User Interfaces (CUI 2026). (Under review)

Disfluency as a Window into Cognitive Mediation: Psycholinguistic Metrics for Evaluating AI-Integrated Spoken Communication

He, W. P. (2026). "Disfluency as a Window into Cognitive Mediation: Psycholinguistic Metrics for Evaluating AI-Integrated Spoken Communication." Extended Abstracts of the 2026 CHI Conference on Human Factors in Computing Systems. Barcelona, Spain.

Proactivity in Scaffolding Comprehension and Production in Real-Time Turn-Taking: A Case Study of Bridging Communication Gaps for Non-Native Speakers

He, W. P. & Fussell, S. R. (2025). "Proactivity in Scaffolding Comprehension and Production in Real-Time Turn-Taking: A Case Study of Bridging Communication Gaps for Non-Native Speakers." Companion Publication of the 28th ACM Conference on Computer-Supported Cooperative Work and Social Computing (CSCW Companion '25). Bergen, Norway. ACM.

Difference in the Cognitive Mechanism of Predictive Processing in Computer-Mediated Communication: A Comparison Study of L2 Speakers

He, W. P. & Fussell, S. R. (2025). "Difference in the Cognitive Mechanism of Predictive Processing in Computer-Mediated Communication: A Comparison Study of L2 Speakers." Proceedings of the 47th Annual Meeting of the Cognitive Science Society (CogSci 2025). San Francisco, CA.

Exploring Content Predictability in Turn-taking Through Different Computer-Mediated Communications

He, W., MacDonald, C. C., Yoo, Y., Eizayaga, M., Shim, R., Katreczko, L. D., & Fussell, S. R. (2025). "Exploring Content Predictability in Turn-taking Through Different Computer-Mediated Communications." Proceedings of the 31st International Conference on Computational Linguistics (COLING 2025), pages 7949–7962. Abu Dhabi, UAE. Association for Computational Linguistics.

How Thanking Peers Sustains Volunteer Participation in Public Goods: Parallel Field Experiments in Four Wikipedia Language Communities

Matias, J. N., Kamin, J., Al-Kashif, R., He, W. P., Klein, M., & Pennington, E. "How Thanking Peers Sustains Volunteer Participation in Public Goods: Parallel Field Experiments in Four Wikipedia Language Communities." Proc Natl Acad Sci USA (PNAS). (Under review)

Manuscripts In Preparation

He, W. P. & Fussell, S. R. Cognitive Trade-offs Between Speech Disfluencies and Active Engagement from Proactive AI-Mediated Scaffolds in Turn-taking.
He, W. P. & Fussell, S. R. Context-Adaptive Translations for Optimized Processing under Time-Pressured AI-Mediated Turn-Taking for Non-Native Speakers.
He, W. P. & Fussell, S. R. Deploying Cognitive Measures as Personalization Metrics for Proactive AI-Mediated Scaffolds in Turn-taking: A Mixed-Methods Study.

Talks & Presentations

Disfluency as a Window into Cognitive Mediation: Psycholinguistic Metrics for Evaluating AI-Integrated Spoken Communication

April 13, 2026

Conference Presentation at CHI Conference on Human Factors in Computing Systems (CHI EA '26), Barcelona, Spain

Teaching

Teaching Assistant, Cornell University (2022–2025):

INFO 2951 Introduction to Data Science — Spr 2025
INFO 6310 Behavior and Information Technology — Fall 2024
INFO 4450 Computer-Mediated Communication — Spr 2024
INFO 4430 Teams and Technology — Fall 2023
PSYCH 2250 Psychology of Language — Spr 2023
PSYCH 1101 Introduction to Psychology — Fall 2022
HD 4380 Language and Thought — Spr & Fall 2022

Guest Lectures:

INFO 4450 — Content Prediction in the Fluency of Computer-Mediated Communication (Spr 2024)
INFO 6310 — Mechanisms of Content Prediction in Computer-Mediated Communication (Fall 2024)

Student Mentorship (INFO 4900 Independent Study): Coached 25 undergrads and 14 MEng students in research execution and professional development.

Grants & Awards

Entrepreneurship Lab (eLab) Start-Up Fund, Cornell University, 2025 — $5,000
Research & Travel Grant, Cognitive Science Program & Cornell University, 2022, 2024, 2025 — $5,000
SAGE Fellowship, Cornell University, 2021-2023 — ~$65,000/year
Linguistics Society of America Summer Grant, UC Berkeley, 2019 — $1,500
International PASCH Junior Scholarship, Goethe-Institut, Germany, 2015 — $15,000

Service & Leadership

Journal Club Co-Lead, Cognitive Science of Language Lab (PI: Prof. Morten H. Christiansen), Cornell University, 2021 - 2023 — Led weekly discussions on cutting-edge research in psycholinguistics, computational linguistics, and language cognition for two academic years
W.E. Cornell, 2023-24 Cohort — Guided entrepreneurship & leadership program for women in STEM
Vice President, Graduate and Professional Women’s Network, Cornell University, Aug. 2022 - Dec. 2024
Community Engagement Researcher, Psychology, Cornell University, May 2022 - Dec. 2022
Student Mentorship: Cumulatively coached 25 undergrads and 14 engineering masters via Independent Study program

Wanqing (Psyche) He

CV

Education

Selected Research

Professional Experience

Technical Skills

Languages

Publications

User Heterogeneity in AI-Mediated Communication: Extending Cognitive Theories via Latent Class Analysis

XPLAIN: A Proactive Scaffold Across Speech Processing Stages—Supporting Non-Native Speakers in Real-Time AI-Mediated Turn-Taking

Disfluency as a Window into Cognitive Mediation: Psycholinguistic Metrics for Evaluating AI-Integrated Spoken Communication

Proactivity in Scaffolding Comprehension and Production in Real-Time Turn-Taking: A Case Study of Bridging Communication Gaps for Non-Native Speakers

Difference in the Cognitive Mechanism of Predictive Processing in Computer-Mediated Communication: A Comparison Study of L2 Speakers

Exploring Content Predictability in Turn-taking Through Different Computer-Mediated Communications

How Thanking Peers Sustains Volunteer Participation in Public Goods: Parallel Field Experiments in Four Wikipedia Language Communities

Manuscripts In Preparation

Talks & Presentations

Disfluency as a Window into Cognitive Mediation: Psycholinguistic Metrics for Evaluating AI-Integrated Spoken Communication

Proactivity in Scaffolding Comprehension and Production in Real-Time Turn-Taking

Difference in the Cognitive Mechanism of Predictive Processing in Computer-Mediated Communication

Exploring Content Predictability in Turn-taking Through Different Computer-Mediated Communications

Statistical Chunking in Reading: The Modulating Effect of High-Frequency Multi-Word Chunks on Reading Performance

To Chunk or Not to Chunk: Statistical Learning of High-Frequency Word-Marker Pairs

Verbal Tone Interactions in Iquito: An Optimality-Theoretic Analysis

Modeling Tone System: The Role of Dispersion Theory in Optimization

Infant Walking, Social Communicative Abilities, and Vocabulary Increases

Teaching

Grants & Awards

Service & Leadership