基于灵溪数据集的论文被顶会AAAI 2025录用

Author:TICC编辑部
Click:53
Date:2024年12月10日

a9c4bd6dfaac70e46994ff7a2a43edc1.png

MDD-5k: A New Diagnostic Conversation Dataset for Mental Disorders Synthesized via Neuro-Symbolic LLM Agents



The clinical diagnosis of most mental disorders primarily relies on the conversations between psychiatrist and patient. The creation of such diagnostic conversation datasets is promising to boost the AI mental healthcare community. However, directly collecting the conversations in real diagnosis scenarios is near impossible due to stringent privacy and ethical considerations. To address this issue, we seek to synthesize diagnostic conversation by exploiting anonymous patient cases that are easier to access. Specifically, we design a neuro-symbolic multi-agent framework for synthesizing the diagnostic conversation of mental disorders with large language models. It takes patient case as input and is capable of generating multiple diverse conversations with one single patient case. The framework basically involves the interaction between a doctor agent and a patient agent, and achieves text generation under symbolic control via a dynamic diagnosis tree from a tool agent. By applying the proposed framework, we develop the largest Chinese mental disorders diagnosis dataset MDD-5k, which is built upon 1000 cleaned real patient cases by cooperating with a pioneering psychiatric hospital, and contains 5000 high-quality long conversations with diagnosis results as labels. To the best of our knowledge, it's also the first labelled Chinese mental disorders diagnosis dataset. Human evaluation demonstrates the proposed MDD-5k dataset successfully simulates human-like diagnostic process of mental disorders.

c2174040d961ac7f46be04684f565f91.png


由盛大 Theta 殷聪驰、李峰、张澍、邵骏、姜迅,以及 TCCI 人工智能与精神健康前沿实验室研究员陈剑华共同发表的研究论文 《MDD-5k: A New Diagnostic Conversation Dataset for Mental Disorders Synthesized via Neuro-Symbolic LLM Agents》 成功入选第 39 届 AAAI 人工智能国际会议 (AAAI-25)。这是一项全球人工智能领域的顶级盛会,将于 2025 年 2 月 25 日至 3 月 4 日在美国费城举行。

本论文提出了一种基于大语言模型的多智能体框架,依据匿名精神疾病患者的病例信息,通过构建动态诊断树模拟人类医生的诊断过程,合成了高质量精神科疾病的诊断对话,以解决实际诊断场景中数据获取的隐私和伦理挑战。我们开发了全球最大的中文精神科疾病诊断数据集 MDD-5k,包含 5000 例医生与患者之间的多轮对话。这一成果为人工智能在精神健康领域的研究与应用提供了全新工具和实践基础。

Contact Us
Please complete and submit the inquiry form, and we will get back to you within 24 business hours.
* Name
* Phone
* Email Address
* Company/Institution Name
* Contact Address
* Client need
* CAPTCHA Code :

Please carefully review our Privacy Policy. We collect your personal information solely to establish contact and provide better services. By checking the box, you confirm that you have read and agree to the terms and conditions outlined in the Privacy Policy.


We use cookies to personalize and enhance your browsing experience on our website By clicking "Accept all cookies", you consent to the use of cookies You can read our Cookie Policy for more information.