科研进展
FPLS-DC:基于距离协方差的函数型偏最小二乘方法在影像遗传学中的应用 (潘文亮与合作者)
发布时间:2024-06-21 |来源:

Imaging genetics integrates imaging and genetic techniques to examine how genetic variations influence the function and structure of organs like the brain or heart, providing insights into their impact on behavior and disease phenotypes. The use of organ-wide imaging endophenotypes has increasingly been used to identify potential genes associated with complex disorders. However, analyzing organ-wide imaging data alongside genetic data presents two significant challenges: high dimensionality and complex relationships. To address these challenges, we propose a novel, nonlinear inference framework designed to partially mitigate these issues.

Results

We propose a functional partial least squares through distance covariance (FPLS-DC) framework for efficient genome wide analyses of imaging phenotypes. It consists of two components. The first component utilizes the FPLS-derived base functions to reduce image dimensionality while screening genetic markers. The second component maximizes the distance correlation between genetic markers and projected imaging data, which is a linear combination of the FPLS-basis functions, using simulated annealing algorithm. In addition, we proposed an iterative FPLS-DC method based on FPLS-DC framework, which effectively overcomes the influence of inter-gene correlation on inference analysis. We efficiently approximate the null distribution of test statistics using a gamma approximation. Compared to existing methods, FPLS-DC offers computational and statistical efficiency for handling large-scale imaging genetics. In real-world applications, our method successfully detected genetic variants associated with the hippocampus, demonstrating its value as a statistical toolbox for imaging genetic studies.

Availability and implementation

The FPLS-DC method we propose opens up new research avenues and offers valuable insights for analyzing functional and high-dimensional data. In addition, it serves as a useful tool for scientific analysis in practical applications within the field of imaging genetics research. The R package FPLS-DC is available in Github: https://github.com/BIG-S2/FPLSDC.

Publication:

Bioinformatics (Volume: 40, Issue: 4, April 2024)

https://doi.org/10.1093/bioinformatics/btae173

Author:

Wenliang Pan

Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China

Email: panwliang@amss.ac.cn

Yue Shan

Departments of Biostatistics, Statistics, Genetics, and Computer Science and Biomedical Research Imaging Center, University of North Carolina, Chapel Hill, NC 27599, USA

Chuang Li

Department of Statistical Science, School of Mathematics, Sun Yat-sen University

, Guangzhou, China

Shuai Huang

Departments of Biostatistics, Statistics, Genetics, and Computer Science and Biomedical Research Imaging Center, University of North Carolina, Chapel Hill, NC 27599, USA

Tengfei Li

Departments of Radiology and Biomedical Research Imaging Center, University of North Carolina, Chapel Hill, NC 27599, USA

Yun Li

Departments of Biostatistics, Statistics, Genetics, and Computer Science and Biomedical Research Imaging Center, University of North Carolina, Chapel Hill, NC 27599, USA

Hongtu Zhu

Departments of Biostatistics, Statistics, Genetics, and Computer Science and Biomedical Research Imaging Center, University of North Carolina, Chapel Hill, NC 27599, USA

Departments of Radiology and Biomedical Research Imaging Center, University of North Carolina, Chapel Hill, NC 27599, USA



附件下载:

    联系我们
    参考
    相关文章