The details of SympGAN can be found in our Knowledge-Based Systems paper. Please CITE our paper when SympGAN is useful for your research.

title   = {SympGAN: A systematic knowledge integration system for symptom–gene associations network},
author  = {Lu, Kezhi and Yang, Kuo and Sun, Hailong and Zhang, Qian and Zheng, Qiguang and Xu, Kuan and Chen, Jianxin and Zhou, Xuezhong},
journal = {Knowledge-Based Systems},
volume  = {276},
pages   = {110752},
year    = {2023},

    SympGAN is a high-quality symptom-gene associations networks curated from the integrated relationships from both disease and drug perspectives. The construction of SympGAN filled in the gap and built millions of associations among symptoms, genotypes, diseases and drugs, which has a great prospect to promote precision medicine and health care.

    SympGAN curated a currently most complete set of 12,560 normal symptom terminologies and a high-quality symptom-gene associations with 401,126 records between 3,144 symptoms and 15,625 genes for the first time by manual efforts, Named Entity Recognition filtering with RoBERTa and computational methods from large-scale integrated symptom-related biomedical relationships from multiple databases. The biomedical relationships contains symptom-disease relationships (184,648 records between 2,504 symptoms and 15,317 diseases), symptom-drug relationships (61,731 records between 965 symptoms and 1,159 drugs), drug-target associations (14,115 records between 5,372 drugs and 2,586 targets) and phenotype-genotype associations (404,566 records between 18,928 phenotypes and 19,331 genes).

Symptoms and signs

Symptom-gene relationships 401,126
Disease phenotypes

Disease-gene relationships 404,566
Therapeutic drugs

Symptom-disease relationships 184,648

Symptom-drug relationships 61,731