
Bihai Datasets

Lingxi Datasets

Bihai Datasets

Lingxi Datasets

Building the Data Foundation for Human Health
Upholding the philanthropic mission of the Tianqiao and Chrissy Chen Institute (TCCI), we are creating a high-quality
data-sharing platform in the field of human health. Our goal is to empower scientific research and AI innovation
through data and ultimately benefit humanity.
01.Target Users
02.Core Product
03.Understanding Needs
04.Comprehensive Advantages

Provide customized data processing and value-added services for data providers, offer one-stop solutions for data consumers, and enable data collection and analysis applications for technology partners.

A professional, interdisciplinary research and data team with extensive experience in scientific research and data, delivering comprehensive solutions.

Researchers, AI engineers, data scientists, technical developers, and industry professionals.

High-quality and scarce clinical datasets related to human health, particularly brain health.

Provide customized data processing and value-added services for data providers, offer one-stop solutions for data consumers, and enable data collection and analysis applications for technology partners.

A professional, interdisciplinary research and data team with extensive experience in scientific research and data, delivering comprehensive solutions.

Researchers, AI engineers, data scientists, technical developers, and industry professionals.

High-quality and scarce clinical datasets related to human health, particularly brain health.
Premium Datasets
We offer high-quality datasets, either self-built or authorized by partners, to support your research and development efficiently.



01. Lingxi: Depression & Anxiety Speech Consultation Dataset
4,500 clinical patient consultation dialogues with speech and text.
Matched with complete medical records, including diagnostic labels, medical history, medication, and psychiatric examinations.
Covers the following categories: Depression, anxiety, bipolar disorder, and sleep disorders.
Supports next-generation large language model training, diagnostic AI development, and empathetic AI models.
02. Galaxy: Aging Cohort from Economically Developed Chinese Regions
Over 4,000 participants aged 50 and above, with 10 years of complete follow-up and an ongoing 15-year study.
350 variables, 20,000 total data entries, and 12,000 ml of biological samples.
Covers lifestyle, chronic diseases, neuropsychological tests, biomarkers, etc.
Applicable for cognitive impairment prediction, disease progression analysis, and algorithm development.
03. Bihai: Intracranial Epilepsy Dataset
High-sampling-rate iEEG data from 100 epilepsy patients.
Average continuous recording duration of 14 days per patient.
Includes detailed electrode potentials, expert-annotated events, medical history, and follow-up information.
01. Lingxi: Depression & Anxiety Speech Consultation Dataset
4,500 clinical patient consultation dialogues with speech and text.
Matched with complete medical records, including diagnostic labels, medical history, medication, and psychiatric examinations.
Covers the following categories: Depression, anxiety, bipolar disorder, and sleep disorders.
Supports next-generation large language model training, diagnostic AI development, and empathetic AI models.
02. Galaxy: Aging Cohort from Economically Developed Chinese Regions
Over 4,000 participants aged 50 and above, with 10 years of complete follow-up and an ongoing 15-year study.
350 variables, 20,000 total data entries, and 12,000 ml of biological samples.
Covers lifestyle, chronic diseases, neuropsychological tests, biomarkers, etc.
Applicable for cognitive impairment prediction, disease progression analysis, and algorithm development.
03. Bihai: Intracranial Epilepsy Dataset
High-sampling-rate iEEG data from 100 epilepsy patients.
Average continuous recording duration of 14 days per patient.
Includes detailed electrode potentials, expert-annotated events, medical history, and follow-up information.
01. Lingxi: Depression & Anxiety Speech Consultation Dataset
4,500 clinical patient consultation dialogues with speech and text.
Matched with complete medical records, including diagnostic labels, medical history, medication, and psychiatric examinations.
Covers the following categories: Depression, anxiety, bipolar disorder, and sleep disorders.
Supports next-generation large language model training, diagnostic AI development, and empathetic AI models.
02. Galaxy: Aging Cohort from Economically Developed Chinese Regions
Over 4,000 participants aged 50 and above, with 10 years of complete follow-up and an ongoing 15-year study.
350 variables, 20,000 total data entries, and 12,000 ml of biological samples.
Covers lifestyle, chronic diseases, neuropsychological tests, biomarkers, etc.
Applicable for cognitive impairment prediction, disease progression analysis, and algorithm development.
03. Bihai: Intracranial Epilepsy Dataset
High-sampling-rate iEEG data from 100 epilepsy patients.
Average continuous recording duration of 14 days per patient.
Includes detailed electrode potentials, expert-annotated events, medical history, and follow-up information.
-
Data Services
Dedicated to providing comprehensive data collection, cleaning, annotation, and analysis services for partners, we help obtain high-quality, multimodal brain science data by combining advanced algorithms with expert annotations to ensure data accuracy and integrity. Our end-to-end data service capabilities enable partners to tackle complex challenges in brain science and accelerate research progress and technological innovation
Access Data Services
-
Technical Collaboration
We provide advanced technical tools to help you quickly and conveniently acquire high-quality data for building superior brain science datasets. With our customized technical solutions, you can efficiently conduct data collection, processing, and analysis to accelerate your research goals.
Inquire About Technical Collaboration
Supporting Diverse Applications
We are committed to supporting partners across various application scenarios to ensure seamless integration of scientific research with practical applications.
By unlocking the value of data, we accelerate scientific progress and drive the translation of research outcomes.
-
Driving Scientific Research
Leveraging high-quality life science data and advanced technology tools to advance in-depth exploration and breakthroughs in life sciences.
-
Supporting Technological Innovation
Utilizing scientific data to enhance human health, new technology development, and drug discovery, we accelerate technological innovation and product evolution through the fusion of digital intelligence.
-
Training AI Models
With extensive scientific data and research expertise, we support the development and training of foundational AI models in life sciences, fostering a new AI-driven research paradigm.
To foster scientific collaboration and information sharing, we partner with the following institutions:
We collaborate with top global hospitals, universities, and research institutions and use data as the foundation to drive breakthroughs and innovations in life sciences.







