Schizophrenia Prediction Using Integrated Imaging Genomic Networks
Volume 2, Issue 3, Page No 702–710, 2017
Adv. Sci. Technol. Eng. Syst. J. 2(3), 702–710 (2017);
DOI: 10.25046/aj020390
Keywords: Classification, Data Integration, Network Fusion, Imaging Genomic Data
In order to increase the diagnosis accuracy of schizophrenia (SCZ) disease, it is essential to comprehensively employ complementary information from multiple types of data. It is well known that a network is a general method for analyzing relationships between patients, with its nodes representing patients and its edges showing relationships between them. In this study, we constructed a fused network using three types of data including genetic, epigenetic and neuroimaging data from a study of schizophrenia patients. We developed a network-based prediction approach taking advantage of the whole network of patients rather than just individual clusters in the network. The majority neighborhood of a node in the network was exploited as a feature for discriminating SCZ from healthy controls. Compared with other 9 graph-based label prediction methods, our network-fusion based label prediction method shows the best performance according to the prediction accuracy. The prediction power of our proposed method was also tested under different parameters settings and an optimal parameter was found for achieving the best performance. The method is also computationally efficient and can be extended to identify other clinical outcomes.
1 Introduction
This paper is an extension of work originally presented in the IEEE International Conference on Bioinformatics and Biomedicine (BIBM 2016) [1].
Currently, there is a lack of objective ways for the diagnosis of schizophrenia (SCZ) [2]. It is generally diagnosed according to criteria set in either the American Psychiatric Associations fifth edition of the Diagnostic and Statistical Manual of Mental Disorders (DSM 5), or the World Health Organizations International Statistical Classification of Diseases and Related Health Problems (ICD-10). These criteria use the self-reported experiences of the patients and report abnormalities in their behaviors, followed by a clinical assessment by a mental health professional. However, symptoms associated with schizophrenia occur along a continuum in the population and must reach a certain severity before a diagnosis is made [3]. Although in psychiatry as in all of general medicine there is an irreducible element of the subjective [4], we should try our best to reduce the subjective part of medical and psychiatry practice. This goal is well recognized by the National Institutes of Mental Health as well [5].
Like many other complex diseases, SCZ is caused by a combination of genetic, biological, and environmental factors, creating a difficult challenge for diagnosis and defining subtypes. In addition, SCZ is a highly heritable disorder [6]. The current belief is that there are a number of genes that contribute to susceptibility or pathology of SCZ, but none of them exhibits full responsibility for the disease. It is believed that SCZ is caused by a number of genetic and environmental factors [7].
Recently, high-throughput studies are rapidly accumulating a wealth of ’omics’-scale data, which enable comprehensive monitoring of a biological system. Meanwhile, there are remarkable medical images accumulated in hospitals, such as images from functional magnetic resonance imaging (fMRI). fMRI is a fairly new tool that has been used to measure brain activation utilising the dependency of the magnetic properties of haemoglobin on the amount of oxygen it carries. Blood-oxygen-level-dependent (BOLD) signals measure the alterations in cerebral blood flow that mark functional brain activity [8]. The intrinsic BOLD contrast makes fMRI an invaluable noninvasive instrument for the investigation of the underlying physiological disturbances that lead to manifest psychiatric disorders. It is anticipated that successful application and analysis of fMRI in neurological disorders can be used to characterise and diagnose mental illnesses such as Alzheimers disease, SCZ, bipolar disorder, mild traumatic brain injury and addiction. It is a valuable way to combine genetic, epigenetic and imaging data for the diagnosis of SCZ disease. Many studies have demonstrated that the integration of fMRI and SNP information will give a more comprehensive analysis of schizophrenia. For example, Yang et al. [9] proposed a hybrid machine learning method to identify schizophrenia patients from healthy controls by combining genetic and fMRI data, and this method achieved better classification accuracy than using either data alone. Castro et al. [10] developed a multiple kernel learning approach that employed both the phase and the magnitude of complex-valued fMRI data for the classification, showing improved classification accuracy compared with using only the magnitude of fMRI data. Cao et al. proposed a sparse representation clustering (SRC) model [11, 12] to simultaneously select SNPs and fMRI voxels as biomarkers for schizophrenia followed by a classifier such as support vector machine (SVM). The integrative model was tested on 20 SCZ patients and 20 healthy controls, demonstrating that the complementary information from both fMRI and SNP data can be utilized to improve diagnosis [13].
The above mentioned SCZ studies, however, only used two types of data: SNPs and fMRI data. DNA methylation is one of several epigenetic mechanisms and can be used by cells to control gene expression. There are a number of mechanisms controlling gene expression in eukaryotes, but DNA methylation is a frequently used epigenetic signaling tool, which can fix genes in the “off “ position [14]. In addition, the integration methods used in the above mentioned research directly used features extracted from data as predictors. Alternatively, network-based classification methods mainly employ the topological features as predictors. With the rapid development of systems biology, network-based classification or prediction methods have been widely used [15, 16, 17, 18, 19, 20, 21].
In our study, we combined three types of data including SNPs, DNA methylation and fMRI to construct networks, which were subsequently be fused for diagnosis of SCZ. Specifically, for each type of data, we first constructed a patient network, called a single network. Thus for three types of data, we constructed three single networks. Then we fused two or three single patient networks into one subject network. As a result, we created four fused networks: three fused networks from the combinations of two data types and one fused network from three data types. Finally, we employed the neighborhood majority of the nodes in the network to predict unknown labels, that is, identifying whether a subject is a schizophrenia patient.
2 Data Preparation
In this study, participant recruitment and data collection were conducted at the Mind Research Network. Three types of data (SNP, DNA methylation and fMRI) were collected from 208 subjects including 96 schizophrenia patients (age: 34 ± 11, 22 females) and 112 healthy controls (age: 32 ± 11, 44 females). All of them were provided written informed consents. Healthy participants were free of any medical, neurological or psychiatric illnesses and had no history of substance abuse. By the clinical interview of patients for DSM IV-TR Disorders or the Comprehensive Assessment of Symptoms and History, patients met criteria for DSM-IV-TR schizophrenia. Antipsychotic history was collected as part of the psychiatric assessment. After a series of quality controls, we selected 184 subjects, including 80 SZ cases (age: 34 ± 11, 20 females and 60 males) and 104 healthy controls (age:32 ± 11, 38 females and 66 males). After pre-processing, 27,508 DNA methylation sites, 41,s236 fMRI voxels and 722,177 SNP loci were obtained for the subsequent biomarker selections [22, 23, 24, 25, 26, 27].
2.1 SNPs data collection and preprocessing
A blood sample was obtained for each participant and DNA was extracted. Genotyping for all participants was performed at the Mind Research Network using the Illumina Infinium HumanOmni1-Quad assay covering 1,140,419 SNP loci. Bead Studio was used to make the final genotype calls. Next, the PLINK software
package(http://pngu.mgh.harvard.edu/∼purcell/plink) was used to perform a series of standard quality control procedures, resulting in the final dataset spanning 722,177 SNP loci. Each SNP was categorized into three clusters based on their genotype and was represented with discrete numbers: 0 for ‘BB‘(no minor allele), 1 for ‘AB‘ (one minor allele) and 2 for ‘AA‘ (two minor alleles).
2.2 DNA methylation data collection and preprocessing
DNA from blood samples was assessed by the Illumina Infinium Methylation27 Assay. A methylation value, beta (β), represents the ratio of the methylated probe intensity to the total probe intensity. A series of quality controls (QC) on the beta values were applied to remove bad samples and probes, such as 1) Beta value QC: Change any beta value to NaN, if p>0.05. 2) Bad sample/ bad marker removing: Samples with >5% of missing (NaN) values; markers with >5% of missing (NaN) values. This resulted in the identification of good methylation data from 224 subjects, 27,508 markers (some have missing values <5%). After QC, we used the K nearest neighbor (KNN) method [28]to impute for the missing values.
2.3 fMRI data collection and preprocessing
The fMRI data were collected during a sensorimotor task, a block-design motor response to auditory stimulation. During the on-block, 200msec tones were presented with a 500 msec stimulus onset asynchrony (SOA). A total of 16 different tones were presented in each on-block, with frequency ranging from 236 Hz to 1318 Hz. The fMRI images were acquired on Siemens 3T Trio Scanners and a 1.5T Sonata with echo-planar imaging (EPI) sequences using the following parameters (TR = 2000 msec, TE = 30 msec (3.0T)/40 msec (1.5T), field of view = 22cm, slice thickness = 4mm, 1mm skip, 27 slices, acquisition matrix = 64 × 64, flip angle =90◦.). Data were pre-processed in SPM5 (http://www.fil.ion.ucl.ac.uk/spm) and were realigned, spatially normalized and resliced to 3×3× 3mm3, smoothed with a 10×10×10mm3 Gaussian kernel to reduce spatial noise, and analyzed by multiple regression considering the stimulus and their temporal derivatives plus an intercept term as regressors. Finally the stimulus-on versus stimulus-off contrast images were extracted with 53×63×46 voxels and all the voxels with missing measurements were excluded.
3 Methods
3.1 Overview
We employed a network-based semi-supervised learning (SSL) scheme, which improves the predictive power by using unlabeled data [29, 30, 31, 32]. It is compuationally efficient and the learning time depends nearly linearly on the number of network edges. Also, the accuracy is comparable with other methods such as kernel-based methods with a longer learning time [18, 33]. Moreover, the network structure can facilitate the interpretation of gene-gene interactions and/or region- region connections in the brain [34, 35, 36], which is one of the advantages of network-based SSL.
Based on a fused network, our label prediction method mainly consists of three procedures: network construction, network fusion and label prediction using network-derived features.
3.2 Network Construction
Based on the features selected from SNPs, DNA methylation and fMRI data, a subject network can be constructed. Suppose we have n samples and m measurements (e.g., DNA methylation). A subject similarity network is represented as a graph G = (V,E). The vertices V correspond to the subjects {x1,x2,…,xn} and the strengths of the edges E are the weighted value of the similarity between subjects. The edge weights are represented by an n × n similarity matrixW, with each Wij indicating the similarity between subjects xi and xj. ρ(xi,xj) is represented as the Euclidean distance between subjects xi and xj. In order to calculate edge strength, a Gaussian function was taken on the Euclidean distance between subjects:
Wij = exp(− σi,x2j)2), if i ∼ j (1) ρ(x
0. otherwise
0
If i is in j s K-nearest-neighborhood and vice versa, nodes i,j can be connected by an edge. In our study, we constructed three individual networks for SNPs, DNA methylation, and fMRI data.
3.3 Network Fusion
We fuse multiple networks into one network, using mean fusion in our study. Multiple networks, denoted by Gm = (V m,Em,wm),m = 1,2,3,…,N, are constructed from multiple types of data. They have the same network nodes and the edge strength in the fused network is the mean value of edge strength of all individual networks.
G = (V ,E,w), with V = ∪mV m,E = ∪mEm,
N and w(i,j) = 1 Xwm(i,j). (2)
N
m=1
Using this graph fusion method, we fused two or three single patient networks into one subject network. As a result, we created four fused networks: three fused networks from the combinations of two data types and one fused network for three data types.
3.4 Label Prediction Based on Neighborhood Majority
A network is a map of interactions, with the links measuring the association between objects or subunits. A labeled subject is marked either by 0 or 1, representing two possible labels: healthy or disease, respectively. The edges have an essential role in influencing the propagation between the subjects to predict the true label of the unknown subject. We assumed that the labels of two subjects are more likely to be the same if the two subjects are more closely related to each other. Therefore, the labels can be predicted based on the similarities between subjects with their genomic profiles and/or imaging features. Edges in network-based SSL represent the similarities between subjects that can be extracted from different data including SNPs, DNA methylation and fMRI data
(Figure).
Network-based classification methods can be classified into two types [21]: direct annotation schemes, which infer the class label of a node based on its connections in the network, and module-assisted schemes, which first identify modules of related nodes and then annotate each module based on the known labels of its members. Neighborhood counting is a simple and direct method for label prediction, which determines the label of a subject based on the known label of subjects lying in its immediate neighborhood. Here, we used neighborhood majority to predict the unknown labels. We first calculated the neighborhood majority of a query subject. Then, the subject is assigned the same type of label as the one of its immediate neighbors with greatest neighborhood majority (Figure).
For a query subject i, the majority of its immediate neighbors can be defined as:
Nk
X
Mk = Wij (3)
j=1
where k represents subject class index and Nk is the number of subjects in class k. Then the label of subject i is defined as:
y(i) = y(arg maxMk) (4) k
4 Results and Discussions
We employed a network-based classification method to predict whether new subjects are schizophrenia patients. After preprocessing the three types of collected data: SNPs, DNA methylation and function MRI data, we chose 184 subjects with all three types of data. We used 5-fold cross validation to evaluate our prediction method. A two-step feature selection method was applied to the labeled data sets. At first, for SNPs and DNA methylation data sets, we preselected the features based on the genes in KEGG pathways and this procedure yielded 14,875 SNPs and 6,935 DNA methylation sites. Then, for three types of data, we utilized the t-test to select the significantly differential features (p≤0.01) between normal and disease status. The preselected features were used to construct a subject-subject network. A single-type network was built for each data type. That is, three single-type networks were generated. Next, a network fusion method was used to build fused networks from two or three single-type networks. Thus we can get four fused networks: three fused networks from the pairwise combinations of two data types and one fused network from three data types. In our previous study [37], it has been shown that the technique of data fusion will improve the classification performance of network based classifiers. Therefore, in this paper, we focus on the study of the fused network from three types of data. Based on the fused network, we utilized nodes’ features in a network to predict the label of a testing subject. This process is illustrated in Figure. Our prediction method is called MMN, which is a majority-neighborhood-based classification method by mean fusion.
We also compared the performance of our model with that of several other network based classification methods, with fixed parameter settings (σ2 = 510,andK = 20). We compared 10 network based methods, whose differences consist in the procedure of network fusion and the algorithm of classification. For network fusion, two alternative methods are the mean fusion as used by our model, and Similarity Network Fusion (SNF) [38]. For classification procedure, we used five different classification methods: majority-neighborhood based classification (MN), which is adopted by our model, spectral clustering (SPC) [39, 40], support vector machine (SVM) [41], local and global consistency (LGC) [32], and label propagation (LP) [42]. SPC is a module-assisted classification approach based on spectral clustering, which is efficient in capturing global structure of a graph [43]. SVM utilizes 11 node features extracted from the network for label classification, and these 11 node features are: betweenness centrality, closeness centrality, degree centrality, Bonacich Power centrality scores, the (Harary) graph centrality, information centrality, the load centrality, the vertex prestige scores, the stress centrality, and clustering coefficients [44]. LGC utilizes both the local and the global consistency of a new patient in a patient-patient network for classification. LP predicts the clinical outcome of the new patient based on a network structure called label propagation. Thus, we have 10 different models, which are MMN (our model), SMN, MSPC, SSPC, MSVM, SSVM, MLGC, SLGC, MLP, and SLP, respectively. To evaluate the performances of these 10 network-based label prediction methods, we computed some widely used metrics, e.g., true positive rate (TPR), true negative rate (TNR), negative predictive value (NPV), false positive rate (FPR), false negative rate (FNR), false discovery rate (FDR), accuracy (ACC), and F1 (F1 = 2TP/2TP+FP+FN) of each method. The classification performances of all these models are listed in Table 1.
From these results it is evident that our model, MMN performs significantly better than other network based classification methods, in terms of some key metrics, e.g. ACC, FDR, etc. Also, it seems that those mean fusion based methods (MLP, MLGC, MMN, MSPC, MSVM) works slightly better than thef corresponding SNF based methods (SLP, SLGC, SMN, SSPC, SSVM), which demonstrates the advantage of mean fusion over SNF. In summary, mean fusion is more suitable than SNF for classification, and MMN works significantly better than other graph-based
Table 1: The performances of different graph-based label prediction methods
Table 2: The performances of our label prediction method MMN with different σ2(K = 20)
|
methods.
At first, we want to compare two different network fusion methods: mean fusion and SNF [38]. For each classification approach, we used both network fusion methods. So according to the network fusion, the 10 prediction methods implemented in our study are all grouped into pairwise comparisons. For example, in SLP and MLP, the first letter S in the method symbol SLP denotes SNF fusion method and the first M in the method symbol MLP means mean fusion approach. It is the same for other pairs of prediction methods:
Table 3: The performances of our label prediction method MMN with different K(σ2 = 510)
|
SLGC and MLGC, SMN and MMN, SSPC and MSPC, SSVM and MSVM. Table 1 shows the performances of different prediction algorithms between two network fusion methods. As can be seen in Table 2, for the prediction methods based on LP and LGC, there are no differences in performances between two network fusion approaches. For SSPC and MSPC, the only one group of module-assisted prediction methods, their accuracies are close to each other and neither got good performance. The results are similar to another pair of prediction methods: SSVM and MSVM. The major difference between SPC-based pair is that the mean-fusion-based method MSVM obtained higher accuracy than SNF-based method SSVM. However, the comparison results between SMN and MMN differ greatly from other graph-based prediction method pairs. For each of the performance metrics, MMN performs better than SMN, especially the ACC. Actually, the network fusion method used in this study uses uniform weight, i.e., equal weight for each of multiple individual networks. In the future, we will try to develop a weighted mean fusion method, which can assign an optimal weight for each individual network.
Above all, our prediction method MMN performs the best among all those graph-based label prediction methods and it is computationally efficient.
When computing the edge strength for constructing networks, there are two parameters σ,K related to the prediction accuracy. When selecting an optimal σ, we fixed the parameter K (K=20) and evaluated the performances with σ2 ranging from 100 to 950. The results are shown in Table 2 and we can see that when σ2 is too small or too large, the prediction accuracy decreases and MMN performs best with σ2 = 510. When selecting an optimal K, σ2 was fixed at 510 with K changing in the range 7, 9, 12, 18, 20, 37 corresponding to 1/25, 1/20, 1/15, 1/10, 1/9, 1/5 of the number of the samples. As shown in Table 3, our methods achieves the best accuracy with K = 20, which is about 1/10 of whole sample (nodes) size.
5 Conclusions
We combined SNPs, DNA methylation and fMRI data into a single comprehensive network using a simple network fusion approach. A network-based label prediction method was applied to the network for predicting schizophrenia patient. Compared with other 9 graph-based label prediction approaches, our prediction method shows the best performance. However, our network fusion method used a uniform weight based combination of each network and in the future we will find an optimal weight based method to improve the prediction accuracy. In addition, Kim et al.[19] incorporated genomic knowledge when integrating multi-omics data for predicting the clinical outcome of cancer, which improved the predictive power. Therefore, we will also incorporate other prior knowledge into the construction of patient connection networks.
Conflict of Interest The authors declare no conflict of interest.
Acknowledgment The work are partly supported by the grants: the NIH (R01 GM109068, R01 MH104680, and R01MH107354) and NSF (#1539067). China Postdoctoral Science Foundation Grant, No. 2015M580352.
- S.-P. Deng, D. Lin, V. D. Calhoun, and Y.-P. Wang, “Diagnosing schizophrenia by integrating genomic and imaging data through network fusion,” in Bioinformatics and Biomedicine (BIBM), 2016 IEEE International Conference on. IEEE, 2016, pp. 1307–1313.
- A. P. Association et al., Diagnostic and statistical manual of mental disorders (DSM-5®). American Psychiatric Pub, 2013.
- M. M. Picchioni and R. M. Murray, “Schizophrenia,” BMJ, vol. 335, no. 7610, 2007.
- R. Pies, “How objective are psychiatric diagnoses,” Psychiatry, vol. 4, no. 10, 2007.
- T. Insel, B. Cuthbert, M. Garvey, R. Heinssen, D. S. Pine, K. Quinn, C. Sanislow, and P. Wang, “Research domain criteria (rdoc): toward a new classification framework for research on mental disorders,” American Journal of Psychiatry, vol. 167, no. 7, pp. 748–751, 2010.
- S. Ripke, B. M. Neale, A. Corvin, J. T. Walters, K.- H. Farh, P. A. Holmans, P. Lee, B. Bulik-Sullivan, D. A. Collier, H. Huang et al., “Biological insights from 108 schizophrenia-associated genetic loci,” Nature, vol. 511, no. 7510, p. 421, 2014.
- A. P. M. M. S. Owen, Schizophrenia. London, England: Lancet, 2016.
- S. Ogawa, T.-M. Lee, A. R. Kay, and D. W. Tank, “Brain magnetic resonance imaging with contrast dependent on blood oxygenation,” Proceedings of the National Academy of Sciences, vol. 87, no. 24, pp. 9868–9872, 1990.
- H. Yang, J. Liu, J. Sui, G. Pearlson, and V. D. Calhoun, “A hybrid machine learning method for fusing fmri and genetic data: combining both improves classification of schizophrenia,” Frontiers in human neuroscience, vol. 4, p. 192, 2010.
- E. Castro, V. Gómez-Verdejo, M. Martínez Ramón, K. A. Kiehl, and V. D. Calhoun, “A multiple kernel learning approach to perform classification of groups from complex-valued fmri data analysis: Application to schizophrenia,” NeuroImage, vol. 87, pp. 1–17, 2014.
- H. Cao, J. Duan, D. Lin, V. Calhoun, and Y.- P. Wang, “Integrating fmri and snp data for biomarker identification for schizophrenia with a sparse representation based variable selection method,” BMC medical genomics, vol. 6, no. Suppl 3, p. S2, 2013.
- H. Cao, D. Lin, J. Duan, Y.-P. Wang, and V. Calhoun, “Bio marker identification for diagnosis of schizophrenia with integrated analysis of fmri and snps,” in Bioinformatics and Biomedicine (BIBM), 2012 IEEE International Conference on. IEEE, 2012, pp. 1–6.
- D. Lin, H. Cao, Y.-P. Wang, and V. Calhoun, “Classification of schizophrenia patients with combined analysis of snp and fmri data based on sparse representation,” in Bioinformatics and Biomedicine (BIBM), 2011 IEEE International Conference on. IEEE, 2011, pp. 394–397.
- S. Robertson, “What is dna methylation?” new Medical. [Online]. Available: http://www.news-medical.net/life-sciences/What-is-DNA-Methylation.aspx
- T. Shlomi, M. N. Cabili, M. J. Herrgrd, B. Ø. Palsson, and E. Ruppin, “Network-based prediction of human tissue-specific metabolism,” Nature biotechnology, vol. 26, no. 9, pp. 1003–1010, 2008.
- H.-Y. Chuang, E. Lee, Y.-T. Liu, D. Lee, and T. Ideker, “Network-based classification of breast cancer metastasis,” Molecular systems biology, vol. 3, no. 1, p. 140, 2007.
- L. Q. Uddin, K. Supekar, C. J. Lynch, A. Khouzam, J. Phillips, C. Feinstein, S. Ryali, and V. Menon, “Salience network–based classification and prediction of symptom severity in children with autism,” JAMA psychiatry, vol. 70, no. 8, pp. 869–879, 2013.
- K. Tsuda, H. Shin, and B. Scholkopf, “Fast protein classification with multiple networks,” Bioinformatics, vol. 21, no. suppl 2, pp. ii59–ii65, 2005.
- D. Kim, J.-G. Joung, K.-A. Sohn, H. Shin, Y. R. Park, M. D. Ritchie, and J. H. Kim, “Knowledge boosting: a graph-based integration approach with multi-omics data and genomic knowledge for cancer clinical outcome prediction,” Journal of the American Medical Informatics Association, vol. 22, no. 1, pp. 109–120, 2015.
- D. Kim, H. Shin, Y. S. Song, and J. H. Kim, “Synergistic effect of different levels of genomic data for cancer clinical outcome prediction,” Journal of biomedical informatics, vol. 45, no. 6, pp. 1191–1198, 2012.
- R. Sharan, I. Ulitsky, and R. Shamir, “Networkbased prediction of protein function,” Molecular systems biology, vol. 3, no. 1, p. 88, 2007.
- H. Cao, J. Duan, D. Lin, Y. Y. Shugart, V. Calhoun, and Y.-P. Wang, “Sparse representation based biomarker selection for schizophrenia with integrated analysis of fmri and snps,” Neuroimage, vol. 102, pp. 220–228, 2014.
- D. Lin, J. Zhang, J. Li, H. He, H.-W. Deng, and Y.-P. Wang, “Integrative analysis of multiple diverse omics datasets by sparse group multitask regression,” Multi-omic Data Integration, p. 126, 2015.
- R. L. Gollub, J. M. Shoemaker, M. D. King, T. White, S. Ehrlich, S. R. Sponheim, V. P. Clark, J. A. Turner, B. A. Mueller, V. Magnotta et al., “The mcic collection: a shared repository of multimodal, multi-site brain image data from a clinical investigation of schizophrenia,” Neuroinformatics, vol. 11, no. 3, pp. 367–388, 2013.
- J. Liu, J. Chen, S. Ehrlich, E. Walton, T. White, N. Perrone-Bizzozero, J. Bustillo, J. A. Turner, and V. D. Calhoun, “Methylation patterns in whole blood correlate with symptoms in schizophrenia patients,” Schizophrenia bulletin, vol. 40, no. 4, pp. 769–776, 2014.
- J. Liu, M. Morgan, K. Hutchison, and V. D. Calhoun, “A study of the influence of sex on genome wide methylation,” PloS one, vol. 5, no. 4, p. e10028, 2010.
- E. Walton, J. Liu, J. Hass, T. White, M. Scholz, V. Roessner, R. Gollub, V. D. Calhoun, and S. Ehrlich, “Mb-comt promoter dna methylation is associated with working-memory processing in schizophrenia patients and healthy controls,” epigenetics, vol. 9, no. 8, pp. 1101–1107, 2014.
- N. S. Altman, “An introduction to kernel and nearest-neighbor nonparametric regression,” The American Statistician, vol. 46, no. 3, pp. 175–185, 1992.
- O. Chapelle, J. Weston, and B. Scholkopf, “Cluster kernels for semi-supervised learning,” in Advances in neural information processing systems, 2002, pp. 585–592.
- X. Zhu, Z. Ghahramani, J. Lafferty et al., “Semisupervised learning using gaussian fields and harmonic functions,” in ICML, vol. 3, 2003, pp. 912–919.
- M. Belkin, I. Matveeva, and P. Niyogi, “Regularization and semi-supervised learning on large graphs,” in International Conference on Computational Learning Theory. Springer, 2004, pp. 624–638.
- D. Zhou, O. Bousquet, T. N. Lal, J. Weston, and B. Scholkopf, “Learning with local and global consistency,” Advances in neural information processing systems, vol. 16, no. 16, pp. 321–328, 2004.
- H. Shin, K. Tsuda, B. Scholkopf, A. Zien et al., “Prediction of protein function from networks,” in Semi supervised learning. MIT press, 2006, pp. 361–376.
- P. T. Spellman, G. Sherlock, M. Q. Zhang, V. R. Iyer, K. Anders, M. B. Eisen, P. O. Brown, D. Botstein, and B. Futcher, “Comprehensive identification of cell cycle–regulated genes of the yeast saccharomyces cerevisiae by microarray hybridization,” Molecular biology of the cell, vol. 9, no. 12, pp. 3273–3297, 1998.
- E. Segal, M. Shapira, A. Regev, D. Pe’er, D. Botstein, D. Koller, and N. Friedman, “Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data,” Nature genetics, vol. 34, no. 2, pp. 166–176, 2003.
- J. H. Ohn, J. Kim, and J. H. Kim, “Genomic characterization of perturbation sensitivity,” Bioinformatics, vol. 23, no. 13, pp. i354–i358, 2007.
- S.-P. Deng, D. Lin, V. D. Calhoun, and Y.-P. Wang, “Predicting schizophrenia by fusing networks from snps, dna methylation and fmri data,” in Engineering in Medicine and Biology Society (EMBC), 2016 IEEE 38th Annual International Conference of the. IEEE, 2016, pp. 1447–1450.
- B. Wang, A. M. Mezlini, F. Demir, M. Fiume, Z. Tu, M. Brudno, B. Haibe-Kains, and A. Goldenberg, “Similarity network fusion for aggregating data types on a genomic scale,” Nature methods, vol. 11, no. 3, pp. 333–337, 2014.
- A. Y. Ng, M. I. Jordan, Y. Weiss et al., “On spectral clustering: Analysis and an algorithm,” Advances in neural information processing systems, vol. 2, pp. 849–856, 2002.
- Y.-C. Wei and C.-K. Cheng, “Towards efficient hierarchical designs by ratio cut partitioning,” in Computer Aided Design, 1989. ICCAD-89. Digest of Technical Papers., 1989 IEEE International Conference on. IEEE, 1989, pp. 298–301.
- C. Cortes and V. Vapnik, “Support-vector networks,” Machine learning, vol. 20, no. 3, pp. 273–297, 1995.
- X. Zhu and Z. Ghahramani, “Learning from labeled and unlabeled data with label propagation,” Citeseer, Tech. Rep., 2002.
- U. Von Luxburg, “A tutorial on spectral clustering,” Statistics and computing, vol. 17, no. 4, pp. 395–416, 2007.
- A. W. Wolfe, “Social network analysis: Methods and applications,” American Ethnologist, vol. 24, no. 1, pp. 219–220, 1997.
- Surapol Vorapatratorn, Nontawat Thongsibsong, "AI-Based Photography Assessment System using Convolutional Neural Networks", Advances in Science, Technology and Engineering Systems Journal, vol. 10, no. 2, pp. 28–34, 2025. doi: 10.25046/aj100203
- Joshua Carberry, Haiping Xu, "GPT-Enhanced Hierarchical Deep Learning Model for Automated ICD Coding", Advances in Science, Technology and Engineering Systems Journal, vol. 9, no. 4, pp. 21–34, 2024. doi: 10.25046/aj090404
- Pui Ching Wong, Shahrum Shah Abdullah, Mohd Ibrahim Shapiai, "Double-Enhanced Convolutional Neural Network for Multi-Stage Classification of Alzheimer’s Disease", Advances in Science, Technology and Engineering Systems Journal, vol. 9, no. 2, pp. 09–16, 2024. doi: 10.25046/aj090202
- Sutham Satthamsakul, Ari Kuswantori, Witsarut Sriratana, Worapong Tangsrirat, Taweepol Suesut, "Landmarking Technique for Improving YOLOv4 Fish Recognition in Various Background Conditions", Advances in Science, Technology and Engineering Systems Journal, vol. 8, no. 3, pp. 100–107, 2023. doi: 10.25046/aj080312
- Sathyabama Kaliyapillai, Saruladha Krishnamurthy, Thiagarajan Murugasamy, "An Ensemble of Voting- based Deep Learning Models with Regularization Functions for Sleep Stage Classification", Advances in Science, Technology and Engineering Systems Journal, vol. 8, no. 1, pp. 84–94, 2023. doi: 10.25046/aj080110
- Fatima-Zahra Elbouni, Aziza EL Ouaazizi, "Birds Images Prediction with Watson Visual Recognition Services from IBM-Cloud and Conventional Neural Network", Advances in Science, Technology and Engineering Systems Journal, vol. 7, no. 6, pp. 181–188, 2022. doi: 10.25046/aj070619
- Bougar Marieme, Ziyati El Houssaine, "Analysis Methods and Classification Algorithms with a Novel Sentiment Classification for Arabic Text using the Lexicon-Based Approach", Advances in Science, Technology and Engineering Systems Journal, vol. 7, no. 3, pp. 12–18, 2022. doi: 10.25046/aj070302
- Mostafa Yacoub, Amira Rezk, Mohamed Senousy, "Enhancing Decision Trees for Data Stream Mining", Advances in Science, Technology and Engineering Systems Journal, vol. 6, no. 5, pp. 330–334, 2021. doi: 10.25046/aj060537
- Valerii Dmitrienko, Serhii Leonov, Aleksandr Zakovorotniy, "New Neural Networks for the Affinity Functions of Binary Images with Binary and Bipolar Components Determining", Advances in Science, Technology and Engineering Systems Journal, vol. 6, no. 4, pp. 91–99, 2021. doi: 10.25046/aj060411
- Susanto Kumar Ghosh, Mohammad Rafiqul Islam, "Convolutional Neural Network Based on HOG Feature for Bird Species Detection and Classification", Advances in Science, Technology and Engineering Systems Journal, vol. 6, no. 2, pp. 733–745, 2021. doi: 10.25046/aj060285
- Alisson Steffens Henrique, Anita Maria da Rocha Fernandes, Rodrigo Lyra, Valderi Reis Quietinho Leithardt, Sérgio D. Correia, Paul Crocker, Rudimar Luis Scaranto Dazzi, "Classifying Garments from Fashion-MNIST Dataset Through CNNs", Advances in Science, Technology and Engineering Systems Journal, vol. 6, no. 1, pp. 989–994, 2021. doi: 10.25046/aj0601109
- Mandlenkosi Shezi, Abejide Ade-Ibijola, "Deaf Chat: A Speech-to-Text Communication Aid for Hearing Deficiency", Advances in Science, Technology and Engineering Systems Journal, vol. 5, no. 5, pp. 826–833, 2020. doi: 10.25046/aj0505100
- Mohammed Hamim, Ismail El Moudden, Hicham Moutachaouik, Mustapha Hain, "Gene Selection for Cancer Classification: A New Hybrid Filter-C5.0 Approach for Breast Cancer Risk Prediction", Advances in Science, Technology and Engineering Systems Journal, vol. 6, no. 1, pp. 871–878, 2021. doi: 10.25046/aj060196
- Reem Bayari, Ameur Bensefia, "Text Mining Techniques for Cyberbullying Detection: State of the Art", Advances in Science, Technology and Engineering Systems Journal, vol. 6, no. 1, pp. 783–790, 2021. doi: 10.25046/aj060187
- Anass Barodi, Abderrahim Bajit, Taoufiq El Harrouti, Ahmed Tamtaoui, Mohammed Benbrahim, "An Enhanced Artificial Intelligence-Based Approach Applied to Vehicular Traffic Signs Detection and Road Safety Enhancement", Advances in Science, Technology and Engineering Systems Journal, vol. 6, no. 1, pp. 672–683, 2021. doi: 10.25046/aj060173
- Inna Valieva, Iurii Voitenko, Mats Björkman, Johan Åkerberg, Mikael Ekström, "Multiple Machine Learning Algorithms Comparison for Modulation Type Classification Based on Instantaneous Values of the Time Domain Signal and Time Series Statistics Derived from Wavelet Transform", Advances in Science, Technology and Engineering Systems Journal, vol. 6, no. 1, pp. 658–671, 2021. doi: 10.25046/aj060172
- Hendro Arieyanto, Andry Chowanda, "Classification of Wing Chun Basic Hand Movement using Virtual Reality for Wing Chun Training Simulation System", Advances in Science, Technology and Engineering Systems Journal, vol. 6, no. 1, pp. 250–256, 2021. doi: 10.25046/aj060128
- Ndiatenda Ndou, Ritesh Ajoodha, Ashwini Jadhav, "A Case Study to Enhance Student Support Initiatives Through Forecasting Student Success in Higher-Education", Advances in Science, Technology and Engineering Systems Journal, vol. 6, no. 1, pp. 230–241, 2021. doi: 10.25046/aj060126
- Arwa Alshamsi, Reem Bayari, Said Salloum, "Sentiment Analysis in English Texts", Advances in Science, Technology and Engineering Systems Journal, vol. 5, no. 6, pp. 1683–1689, 2020. doi: 10.25046/aj0506200
- Rewan Kumar Dahal, Ganesh Bhattarai, Dipendra Karki, "Determinants of Technological and Innovation Performance of the Nepalese Cellular Telecommunications Industry from the Customers’ Perspective", Advances in Science, Technology and Engineering Systems Journal, vol. 5, no. 6, pp. 1013–1020, 2020. doi: 10.25046/aj0506122
- Jinwon Cheon, Sunwoong Choi, "Hand Gesture Classification using Inaudible Sound with Ensemble Method", Advances in Science, Technology and Engineering Systems Journal, vol. 5, no. 6, pp. 967–971, 2020. doi: 10.25046/aj0506115
- Hao Tuan Huynh, Nghia Duong-Trung, Dinh Quoc Truong, Hiep Xuan Huynh, "Vietnamese Text Classification with TextRank and Jaccard Similarity Coefficient", Advances in Science, Technology and Engineering Systems Journal, vol. 5, no. 6, pp. 363–369, 2020. doi: 10.25046/aj050644
- Munaf Salim Najim Al-Din, "Real-Time Identification and Classification of Driving Maneuvers using Smartphone", Advances in Science, Technology and Engineering Systems Journal, vol. 5, no. 6, pp. 193–205, 2020. doi: 10.25046/aj050623
- Fei Gao, Jiangjiang Liu, "Effective Segmented Face Recognition (SFR) for IoT", Advances in Science, Technology and Engineering Systems Journal, vol. 5, no. 6, pp. 36–44, 2020. doi: 10.25046/aj050605
- Daniyar Nurseitov, Kairat Bostanbekov, Maksat Kanatov, Anel Alimova, Abdelrahman Abdallah, Galymzhan Abdimanap, "Classification of Handwritten Names of Cities and Handwritten Text Recognition using Various Deep Learning Models", Advances in Science, Technology and Engineering Systems Journal, vol. 5, no. 5, pp. 934–943, 2020. doi: 10.25046/aj0505114
- Gökalp Çınarer, Bülent Gürsel Emiroğlu, Recep Sinan Arslan, Ahmet Haşim Yurttakal, "Brain Tumor Classification Using Deep Neural Network", Advances in Science, Technology and Engineering Systems Journal, vol. 5, no. 5, pp. 765–769, 2020. doi: 10.25046/aj050593
- Lana Abdulrazaq Abdullah, Muzhir Shaban Al-Ani, "CNN-LSTM Based Model for ECG Arrhythmias and Myocardial Infarction Classification", Advances in Science, Technology and Engineering Systems Journal, vol. 5, no. 5, pp. 601–606, 2020. doi: 10.25046/aj050573
- Alami Hamza, Noureddine En-Nahnahi, Said El Alaoui Ouatik, "Contextual Word Representation and Deep Neural Networks-based Method for Arabic Question Classification", Advances in Science, Technology and Engineering Systems Journal, vol. 5, no. 5, pp. 478–484, 2020. doi: 10.25046/aj050559
- Nittaya Kerdprasop, Kittisak Kerdprasop, Paradee Chuaybamroong, "Economic and Environmental Analysis of Life Expectancy in China and India: A Data Driven Approach", Advances in Science, Technology and Engineering Systems Journal, vol. 5, no. 5, pp. 308–313, 2020. doi: 10.25046/aj050539
- Martin Marinov, Alexander Efremov, "Four-Dimensional Sparse Data Structures for Representing Text Data", Advances in Science, Technology and Engineering Systems Journal, vol. 5, no. 5, pp. 154–166, 2020. doi: 10.25046/aj050521
- Tran Thanh Dien, Nguyen Thanh-Hai, Nguyen Thai-Nghe, "Deep Learning Approach for Automatic Topic Classification in an Online Submission System", Advances in Science, Technology and Engineering Systems Journal, vol. 5, no. 4, pp. 700–709, 2020. doi: 10.25046/aj050483
- Panida Lorwongtrakool, Phayung Meesad, "Correlation-Based Incremental Learning Network for Gas Sensors Drift Compensation Classification", Advances in Science, Technology and Engineering Systems Journal, vol. 5, no. 4, pp. 660–666, 2020. doi: 10.25046/aj050479
- Anouar Bachar, Noureddine El Makhfi, Omar EL Bannay, "Machine Learning for Network Intrusion Detection Based on SVM Binary Classification Model", Advances in Science, Technology and Engineering Systems Journal, vol. 5, no. 4, pp. 638–644, 2020. doi: 10.25046/aj050476
- Roberta Avanzato, Francesco Beritelli, "A CNN-based Differential Image Processing Approach for Rainfall Classification", Advances in Science, Technology and Engineering Systems Journal, vol. 5, no. 4, pp. 438–444, 2020. doi: 10.25046/aj050452
- Rizki Jaka Maulana, Gede Putra Kusuma, "Malware Classification Based on System Call Sequences Using Deep Learning", Advances in Science, Technology and Engineering Systems Journal, vol. 5, no. 4, pp. 207–216, 2020. doi: 10.25046/aj050426
- Krina B. Gabani, Mayuri A. Mehta, Stephanie Noronha, "Racial Categorization Methods: A Survey", Advances in Science, Technology and Engineering Systems Journal, vol. 5, no. 3, pp. 388–401, 2020. doi: 10.25046/aj050350
- Jan Sikora, David Fojtík, "Classification of Timber Load on Trucks", Advances in Science, Technology and Engineering Systems Journal, vol. 5, no. 2, pp. 683–687, 2020. doi: 10.25046/aj050284
- Bokyoon Na, Geoffrey C Fox, "Object Classifications by Image Super-Resolution Preprocessing for Convolutional Neural Networks", Advances in Science, Technology and Engineering Systems Journal, vol. 5, no. 2, pp. 476–483, 2020. doi: 10.25046/aj050261
- Lenin G. Falconi, Maria Perez, Wilbert G. Aguilar, Aura Conci, "Transfer Learning and Fine Tuning in Breast Mammogram Abnormalities Classification on CBIS-DDSM Database", Advances in Science, Technology and Engineering Systems Journal, vol. 5, no. 2, pp. 154–165, 2020. doi: 10.25046/aj050220
- Michael Wenceslaus Putong, Suharjito, "Classification Model of Contact Center Customers Emails Using Machine Learning", Advances in Science, Technology and Engineering Systems Journal, vol. 5, no. 1, pp. 174–182, 2020. doi: 10.25046/aj050123
- Bui Thanh Hung, "Integrating Diacritics Restoration and Question Classification into Vietnamese Question Answering System", Advances in Science, Technology and Engineering Systems Journal, vol. 4, no. 5, pp. 207–212, 2019. doi: 10.25046/aj040526
- Michael Santacroce, Daniel Koranek, Rashmi Jha, "Detecting Malicious Assembly using Convolutional, Recurrent Neural Networks", Advances in Science, Technology and Engineering Systems Journal, vol. 4, no. 5, pp. 46–52, 2019. doi: 10.25046/aj040506
- Antonio Fuduli, Pierangelo Veltri, Eugenio Vocaturo, Ester Zumpano, "Melanoma detection using color and texture features in computer vision systems", Advances in Science, Technology and Engineering Systems Journal, vol. 4, no. 5, pp. 16–22, 2019. doi: 10.25046/aj040502
- Tlija Amira, Istrate Dan, Badii Atta, Gattoufi Said, Bennani Az-eddine, Wegrzyn-Wolska Katarzyna, "Stress Level Classification Using Heart Rate Variability", Advances in Science, Technology and Engineering Systems Journal, vol. 4, no. 3, pp. 38–46, 2019. doi: 10.25046/aj040306
- Abba Suganda Girsang, Andi Setiadi Manalu, Ko-Wei Huang, "Feature Selection for Musical Genre Classification Using a Genetic Algorithm", Advances in Science, Technology and Engineering Systems Journal, vol. 4, no. 2, pp. 162–169, 2019. doi: 10.25046/aj040221
- Bayan AlSaaidah, Waleed Al-Nuaimy, Mohammed Rasoul Al-Hadidi, Iain Young, "Zebrafish Larvae Classification based on Decision Tree Model: A Comparative Analysis", Advances in Science, Technology and Engineering Systems Journal, vol. 3, no. 4, pp. 347–353, 2018. doi: 10.25046/aj030435
- Sehla Loussaief, Afef Abdelkrim, "Machine Learning framework for image classification", Advances in Science, Technology and Engineering Systems Journal, vol. 3, no. 1, pp. 1–10, 2018. doi: 10.25046/aj030101
- Ruijian Zhang, Deren Li, "Applying Machine Learning and High Performance Computing to Water Quality Assessment and Prediction", Advances in Science, Technology and Engineering Systems Journal, vol. 2, no. 6, pp. 285–289, 2017. doi: 10.25046/aj020635
- Mohamed El Beqqal, Mostafa Azizi, "Review on security issues in RFID systems", Advances in Science, Technology and Engineering Systems Journal, vol. 2, no. 6, pp. 194–202, 2017. doi: 10.25046/aj020624
- Muhammad Asif Manzoor, Yasser Morgan, "Support Vector Machine based Vehicle Make and Model Recognition System", Advances in Science, Technology and Engineering Systems Journal, vol. 2, no. 3, pp. 1080–1085, 2017. doi: 10.25046/aj0203137
- Mohamed Salim El Bazzi, Driss Mammass, Abdelatif Ennaji, Taher Zaki, "Features based approach for indexation and representation of unstructured Arabic documents", Advances in Science, Technology and Engineering Systems Journal, vol. 2, no. 3, pp. 900–905, 2017. doi: 10.25046/aj0203112
- Zulfiqar Ali, Waseem Shahzad, "Performance Evaluation of Associative Classifiers in Perspective of Discretization Methods", Advances in Science, Technology and Engineering Systems Journal, vol. 2, no. 3, pp. 845–854, 2017. doi: 10.25046/aj0203105
- Turgay Yalcin, Muammer Ozdemir, "Computational Intelligence Methods for Identifying Voltage Sag in Smart Grid", Advances in Science, Technology and Engineering Systems Journal, vol. 2, no. 3, pp. 412–419, 2017. doi: 10.25046/aj020353
- Adewale Opeoluwa Ogunde, Ajibola Rasaq Olanbo, "A Web-Based Decision Support System for Evaluating Soil Suitability for Cassava Cultivation", Advances in Science, Technology and Engineering Systems Journal, vol. 2, no. 1, pp. 42–50, 2017. doi: 10.25046/aj020105
- Hocine Chebi, Dalila Acheli, Mohamed Kesraoui, "Dynamic detection of abnormalities in video analysis of crowd behavior with DBSCAN and neural networks", Advances in Science, Technology and Engineering Systems Journal, vol. 1, no. 5, pp. 56–63, 2016. doi: 10.25046/aj010510
