Bingsheng "Arthur" Yao


profile-square.jpeg

- I am on the job market this year looking for tenured-track assistant professor position.
- I am seeking self-motivated research assistants who are interested in working with me on NLP/HCI research projects. Please email (b [dot] yao [at] northeastern [dot] edu) me with your CV, your research interest, and tell me if there’s any of my previous papers you are interested in (and why).

About Me

I am a postdoc associate at Northeastern University (PI: Prof. Dakuo Wang). Before joining Northeastern, I got Ph.D. in Computer Science from Rensselaer Polytechnic Institute (Advisor: Prof. Jim Hendler). My interdisciplinary background in Natural Language Processing (NLP) and Human-Computer Interaction (HCI) enables me to design use-inspired AI systems that responsibly support real-world stakeholders’ collaboration, empowering human-AI collaboration. Specifically, I propose human-centered NLP, a framework comprising three critical pillars: (i) uncovering real-world stakeholders’ collaboration workflow, (ii) developing domain-specific NLP models in low-resource environments, and (iii) designing human-centered systems into stakeholders’ workflow.

Research

I. Uncovering Real-World Stakeholders’ Needs and Challenges

Medical and Healthcare
[In Submission] Challenges and AI Potential for Post-Treatment Cancer Patient-Provider Communication
[CHI ‘23] Human-AI Collaboration in Sepsis Early Diagnosis

Children Education
[CSCW ‘24] Parent’s Need for Children Storytelling

Interdisciplinary Team Collaboration
[In Submission] Interdisciplinary Team Collaboration Using Activity Theory

II. Developing Domain-Specific NLP Models in Low-Resource Environments

Domain-Adaptation Methodologies
[NAACL ‘24] In-Context Sampling for Robust Domain-Adaptive LLMs
[EMNLP ‘23] Active Learning Framework with Human Natural Language Explanations

Medical and Healthcare
[IMWUT ‘24] Mental-LLM: Comprehensive Exploration of Domain-Specific LLM for Mental Health Prediction

Children Education
[EMNLP ‘24] StorySparkQA Dataset for Story-Based Real-World Knowledge
[ACL ‘22] Expert-Annotated FairytaleQA Dataset
[ACL ‘22] QA-Pair Generation Pipeline for Children Reading Comprehension


III. Embedding Human-Centered Systems into Stakeholders’ Workflow

Medical and Healthcare
[IMWUT ‘24] Talk2Care: LLM-Based Voice Assistant for Older Adults-Provider Communication
[In Submission] AI-Based Multi-Modal Remote Patient Monitoring and Risk Prediction for Cancer Treatment-Induced Cardiotoxicity

Children Education
[CHI ‘22] StoryBuddy: Interactive Storytelling Chatbot System

Service

I have served on program committees for various top conferences and journals:

Area Chair

ACL ARR (Since Jun. ‘24)
CHI ‘25, CSCW ‘25

Organizing Committee

CHI ‘24 Special Interest Group - Human-Centered Privacy Research in the Age of Large Language Models
CSCW ‘24 Workshop - Challenges and Opportunities of LLM-Based Synthetic Personae and Data in HCI

Reviewer

Conferences: ACL ARR (Aug. ‘23, Oct. ‘23, Dec. ‘23), EMNLP ‘23, NAACL ‘24, CHI ‘24, IUI ‘24, IMWUT ‘24
Journals: Natural Human Behavior, IJHCI, IJHCS

Note

My apologies for the inconvenience that the publication list and CV on my website are usually not up-to-date – I’m trying, but honestly this is more time consuming than writing grant proposals. Please refer to my Google Scholar for latest update.

The best way to reach out is through emails b [dot] yao [at] northeastern [dot] edu.


News

2024.10 Our paper StorySparkQA Dataset with Real-World Knowledge for Children Education was accepted to EMNLP 2024
2024.09 Our paper Secret Use of Large Language Models was accepted to CSCW 2025
2024.08 Our paper Parent’s Needs for Preschoolers’ Storytelling and Reading Activities was accepted to CSCW 2024
2024.07 Our paper Early Sepsis Prediction with Uncertainty Quantification and Active Sensing was accepted to KDD 2024
2024.04 Our paper LLM-based Voice Assistant for Remote Communication between Older Adults and Care Providers was accepted to IMWUT 2024
2024.03 First-authored paper In-Context Sampling Strategy for Reliable LLM Prompting was accepted to NAACL 2024 Findings
2024.03 Guest talk at USC titled “Bridging AI Research and Real-world Scenarios”. Thanks Prof. Yao Du for the invitation!
2024.02 I am joining Prof. Dakuo Wang’s Human-Centered AI Lab at Northeastern University as a postdoc associate!
2024.01 Two of our papers, Human-AI Collaboration in Sepsis Diagnosis and User’s Sensitive Disclosure with LLM were accepted to CHI 2024
2024.01 I passed the Ph.D. dissertation defense. My deepest gratitude to all those who supported and helped me, especially Prof. Jim Hendler and Prof. Dakuo Wang
2023.12 Our paper Mental-LLM was accepted to IMWUT 2024
2023.10 Our paper, Discourse Framework for Science Journalism, was accepted to EMNLP 2023, and another first-authored paper, Active Learning Empowered by Natural Language Explanations, was accepted to EMNLP 2023 Findings
2023.07 First-authored paper Objective Evaluation of Human Explanations was accepted to ACL 2023 for Oral Presentation
2022.05 Two first-authored papers, QA-Pair Generation for Story Books and FairytaleQA Dataset were accepted to ACL 2022
2022.04 Our paper StoryBuddy was accepted to CHI 2022
2021.09 Our paper Narrative Open-Domain QA Techniques was accepted to TACL (2021) 9
2020.03 Our paper Trust in AutoML was accepted to IUI 2020

Publications

2024

  1. Exploring Parent’s Needs for Children-Centered AI to Support Preschoolers’ Storytelling and Reading Activities
    Yuling Sun, Jiali Liu, Bingsheng Yao , Jiaju Chen, Dakuo Wang, and 4 more authors
    arXiv preprint arXiv:2401.13804, 2024
  2. Who Changed the Destiny of Rural Students, and How?: Unpacking ICT-Mediated Remote Education in Rural China
    Yuling Sun, Xiuqi Zhu, Xiaomu Zhou, Bingsheng Yao , Kai Zhang, and 3 more authors
    arXiv preprint arXiv:2401.13799, 2024

2023

  1. More Samples or More Prompt Inputs? Exploring Effective In-Context Sampling for LLM Few-Shot Prompt Engineering
    Bingsheng Yao , Guiming Chen, Ruishi Zou, Yuxuan Lu, Jiachen Li, and 4 more authors
    arXiv preprint arXiv:2311.09782, 2023
  2. Human Still Wins over LLM: An Empirical Study of Active Learning on Domain-Specific Annotation Tasks
    Yuxuan Lu, Bingsheng Yao , Shao Zhang, Yun Wang, Peng Zhang, and 3 more authors
    arXiv preprint arXiv:2311.09825, 2023
  3. FairytaleCQA: Integrating a Commonsense Knowledge Graph into Children’s Storybook Narratives
    Jiaju Chen, Yuxuan Lu, Shao Zhang, Bingsheng Yao , Yuanzhe Dong, and 5 more authors
    arXiv preprint arXiv:2311.09756, 2023
  4. " Mango Mango, How to Let The Lettuce Dry Without A Spinner?”: Exploring User Perceptions of Using An LLM-Based Conversational Assistant Toward Cooking Partner
    Szeyi Chan, Jiachen Li, Bingsheng Yao , Amama Mahmood, Chien-Ming Huang, and 3 more authors
    arXiv preprint arXiv:2310.05853, 2023
  5. LLM-Powered Conversational Voice Assistants: Interaction Patterns, Opportunities, Challenges, and Design Guidelines
    Amama Mahmood, Junxiang Wang, Bingsheng YaoDakuo Wang, and Chien-Ming Huang
    arXiv preprint arXiv:2309.13879, 2023
  6. Talk2Care: Facilitating Asynchronous Patient-Provider Communication with Large-Language-Model
    Ziqi Yang, Xuhai Xu, Bingsheng Yao , Shao Zhang, Ethan Rogers, and 4 more authors
    arXiv preprint arXiv:2309.09357, 2023
  7. " It’s a Fair Game”, or Is It? Examining How Users Navigate Disclosure Risks and Benefits When Using LLM-Based Conversational Agents
    Zhiping Zhang, Michelle Jia, Bingsheng Yao , Sauvik Das, Ada Lerner, and 3 more authors
    arXiv preprint arXiv:2309.11653, 2023
  8. Rethinking Human-AI Collaboration in Complex Medical Decision Making: A Case Study in Sepsis Diagnosis
    Shao Zhang, Jianing Yu, Xuhai Xu, Changchang Yin, Yuxuan Lu, and 6 more authors
    arXiv preprint arXiv:2309.12368, 2023
  9. Mental-LLM: Leveraging Large Language Models for Mental Health Prediction via Online Text Data
    Xuhai Xu, Bingsheng Yao , Yuanzhe Dong, Saadia Gabriel, Hong Yu, and 4 more authors
    arXiv preprint arXiv:2307.14385, 2023
  10. Beyond Labels: Empowering Human Annotators with Natural Language Explanations through a Novel Active-Learning Architecture
    Bingsheng YaoIshan JindalLucian PopaYannis Katsis, Sayan Ghosh, and 6 more authors
    In Findings of the Association for Computational Linguistics: EMNLP 2023, Dec 2023
  11. Are Human Explanations Always Helpful? Towards Objective Evaluation of Human Natural Language Explanations
    Bingsheng YaoPrithviraj SenLucian PopaJames Hendler, and Dakuo Wang
    In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Jul 2023
  12. ‘Don’t Get Too Technical with Me’: A Discourse Structure-Based Framework for Automatic Science Journalism
    Ronald Cardenas, Bingsheng YaoDakuo Wang, and Yufang Hou
    In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Dec 2023

2022

  1. It is AI’s Turn to Ask Humans a Question: Question-Answer Pair Generation for Children’s Story Books
    Bingsheng Yao*Dakuo Wang*Tongshuang Wu, Zheng Zhang, Toby Li, and 2 more authors
    In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), May 2022
  2. Fantastic Questions and Where to Find Them: FairytaleQA – An Authentic Dataset for Narrative Comprehension
    Ying Xu*Dakuo Wang*Mo Yu*, Daniel Ritchie*, Bingsheng Yao* , and 13 more authors
    In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), May 2022
  3. StoryBuddy: A Human-AI Collaborative Chatbot for Parent-Child Interactive Storytelling with Flexible Parental Involvement
    Zheng Zhang*, Ying Xu*, Yanhao Wang, Bingsheng Yao , Daniel Ritchie, and 4 more authors
    In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems, May 2022
  4. Nece: Narrative Event Chain Extraction Toolkit
    Guangxuan Xu*, Paulina Toro Isaza*, Moshi Li*, Akintoye Oloko, Bingsheng Yao , and 5 more authors
    arXiv preprint arXiv:2208.08063, May 2022
  5. GEMv2: Multilingual NLG Benchmarking in a Single Line of Code
    Sebastian Gehrmann, Abhik Bhattacharjee, Abinaya Mahendiran, Alex Wang, Alexandros Papangelis, and 72 more authors
    In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, Dec 2022
  6. A Corpus for Commonsense Inference in Story Cloze Test
    Bingsheng Yao , Ethan Joseph, Julian Lioanag, and Mei Si
    In Proceedings of the Thirteenth Language Resources and Evaluation Conference, Jun 2022
  7. Efficient Long Sequence Encoding via Synchronization
    Xiangyang MouMo YuBingsheng Yao , and Lifu Huang
    arXiv preprint arXiv:2203.07644, Jun 2022

2021

  1. Narrative Question Answering with Cutting-Edge Open-Domain QA Techniques: A Comprehensive Study
    Xiangyang Mou*, Chenghao Yang*, Mo Yu*Bingsheng Yao , Xiaoxiao Guo, and 2 more authors
    Transactions of the Association for Computational Linguistics, Jun 2021
  2. Building a Storytelling Conversational Agent Through Parent-AI Collaboration
    Zheng Zhang*, Ying Xu*, Yanhao Wang, Tongshuang WuBingsheng Yao , and 4 more authors
    Jun 2021

2020

  1. Frustratingly Hard Evidence Retrieval For QA Over Books
    Xiangyang MouMo YuBingsheng Yao , Chenghao Yang, Xiaoxiao Guo, and 2 more authors
    arXiv preprint arXiv:2007.09878, Jun 2020
  2. Trust in AutoML: Exploring Information Needs for Establishing Trust in Automated Machine Learning Systems
    Jaimie Drozdal, Justin Weisz, Dakuo Wang, Gaurav Dass, Bingsheng Yao , and 4 more authors
    In Proceedings of the 25th International Conference on Intelligent User Interfaces, Jun 2020