Title: | Machine Learning Assisted Network Inference |
---|---|
Description: | Find dark genes. These genes are often disregarded due to no detected mutation or differential expression, but are important in coordinating the functionality in cancer networks. |
Authors: | Mehrab Ghanat Bari |
Maintainer: | Mehrab Ghanat Bari <[email protected]> |
License: | GPL-3 |
Version: | 1.0 |
Built: | 2025-02-14 04:25:58 UTC |
Source: | https://github.com/cran/malani |
Returns accuracy performance of all genes. G support vector machine (SVM) classifiers trained using G different data matrixes, are used to predict labels in test data. Models are ranked based on prediction performances.
Gsvmod(dat.train, lab.train, dat.test, lab.test)
Gsvmod(dat.train, lab.train, dat.test, lab.test)
dat.train |
Train data with G features and (k-1)*S/k samples. Parameter k comes from cross-validation scheme and is specified by user (default is 2). |
lab.train |
Class labels for train data. |
dat.test |
Test data with G features and S/k samples. |
lab.test |
Class labels for test data. |
Accuracy scores for models. Each model represents one gene.
Train G-1 SVM models in k-fold
cross validation scheme to select initial genes list.
intGenes(dat, grp, nfolds.out = 2, top.per = 0.05)
intGenes(dat, grp, nfolds.out = 2, top.per = 0.05)
dat |
Original gene expression data matrix with G rows (number of genes) and S column (number of samples). |
grp |
Class labels. |
nfolds.out |
Outer cross validation number (default is 2). |
top.per |
All genes are ranked based on their models performance and |
Selected initial genes.
data(malanidata) int <- intGenes(dat,grp) print(int$top.genes)
data(malanidata) int <- intGenes(dat,grp) print(int$top.genes)
A numeric matrix G*S contains gene expressions data. G are the genes (rows) and S are the samples (columns).
malanidata
malanidata
A matrix of numeric values, 100 genes, 20 samples and class labels.
data(malanidata)
data(malanidata)
Combine each gene in initial set with all genes in the original set. Top npair
pairs are selected to construct the Q
matrix.
pairmod(X, LX, theta, npair = 10)
pairmod(X, LX, theta, npair = 10)
X |
Original gene expression data matrix. With G rows (number of genes) and S column (number of samples). |
LX |
Class labels. |
theta |
Initial gene set. |
npair |
Given a gene in initial set, top |
Best (npair*G/20
) performing pairs.