About MVIAeval | Construction of MVIAeval | Usage of MVIAeval |
---|---|---|
● Usage |
Missing value imputation is important for microarray data analyses because microarray data with missing values would significantly degrade the performance of the downstream analyses. Although many microarray missing value imputation algorithms have been developed, an objective and comprehensive performance comparison framework is still lacking. Therefore, in our previous paper (Chiu et al. 2013), we proposed a framework which can perform a comprehensive performance comparison of different existing algorithms. Our performance comparison framework can also be applied to evaluate the performance of a newly developed algorithm. However, constructing our framework is not an easy task for the interested researchers. To save researchers time and effort, here we present an easy-to-use web tool named MVIAeval (Missing Value Imputation Algorithm evaluator) which implements our performance comparison framework.
In MVIAeval, we collected 20 benchmark microarray datasets of different species and different types.
GEO Dataset | Dim | Type | Organism | Title |
---|---|---|---|---|
GDS3323 | 45101*6 | Non-time Series | Mus musculus | Na+/H+ exchanger 3 deficiency effect on the colon |
GDS3215 | 12625*6 | Non-time Series | Homo sapiens | 13-cis retinoic acid effect on SEB-1 sebocyte cell line |
GDS3485 | 45011*6 | Non-time Series | Mus musculus | Zinc transporter SLC39A13 deficiency effect on chondrocytes |
GDS3476 | 45011*6 | Non-time Series | Mus musculus | NF-E2-related factor 2 Nrf2 activation effect on the liver |
GDS3197 | 45101*6 | Non-time Series | Mus musculus | Transcriptional coactivator PGC-1beta hypomorphic mutation effect on the liver |
GDS3149 | 45101*6 | Non-time Series | Mus musculus | Suppressor of cytokine signaling 3 deficiency effect on the regenerating liver |
GDS2107 | 15923*6 | Non-time Series | Rattus norvegicus | Long-term ethanol consumption effect on pancreas |
GDS3464 | 15617*6 | Non-time Series | Danio rerio | SPT5 mutant embryos |
GDS3426 | 23015*6 | Non-time Series | Staphylococcus epidermidis | Staphylococcus epidermidis SarZ mutant |
GDS3421 | 10208*6 | Non-time Series | Escherichia coli | Frag1 cells response to ionic and non-ionic hyperosmotic stress |
GDS3360 | 22575*8 | Time Series | Homo sapiens | Chlamydia pneumoniae infection effect on HL epithelial cells: time course |
GDS2863 | 31099*6 | Time Series | Rattus norvegicus | Tienilic acid effect on the liver: time course |
GDS5057 | 34760*8 | Time Series | Mus musculus | Mepenzolate bromide effect on lung: time course |
GDS5055 | 45307*10 | Time Series | Mus musculus | Histone demethylase KDM1A deficiency effect on 3T3-L1 preadipocytes: time course |
GDS3428 | 22283*9 | Time Series | Homo sapiens | Immature dendritic cell response to butanol fraction of Echinacea purpurea: time course |
GDS4484 | 45101*8 | Time Series | Mus musculus | Cerebellar neuronal cell response to thyroid hormone: time course |
GDS3785 | 17589*8 | Time Series | Homo sapiens | Osteoarthritic chondrocytes and healthy mesenchymal stem cell during chondrogenic differentiation: time course |
GDS3930 | 8799*9 | Time Series | Rattus norvegicus | Bone morphogenic protein effect on cultured sympathetic neurons: time course |
GDS4321 | 10208*8 | Time Series | Escherichia coli | Escherichia coli O157:H7 response to cinnamaldehyde: time course |
GDS3032 | 22277*8 | Time Series | Homo sapiens | Quercetin effect on intestinal cell differentiation in vitro: time course |
In addition, we implemented 12 existing algorithms including two global approach algorithms and 10 local approach algorithms.
Algorithm | Category | Year of Published | Reference |
---|---|---|---|
SVD | Gobal | 2001 | [ Troyanskaya et al. 2001 ] |
BPCA | Gobal | 2003 | [ Oba et al. 2003 ] |
KNN | Local | 2001 | [ Troyanskaya et al. 2001 ] |
SKNN | Local | 2004 | [ Kim et al. 2004 ] |
IKNN | Local | 2007 | [ Brás et al. 2007 ] |
LS | Local | 2004 | [ Bø et al. 2004 ] |
LLS | Local | 2005 | [Kim et al. 2005 ] |
ILLS | Local | 2006 | [Cai et al. 2006 ] |
SLLS | Local | 2008 | [Zhang et al. 2008 ] |
Shrinkage LLS | Local | 2013 | [Wang et al. 2013 ] |
Shrinkage SLLS | Local | 2013 | [Wang et al. 2013 ] |
Shrinkage ILLS | Local | 2013 | [Wang et al. 2013 ] |
Performance Index | Benchmark Datasets | Ranking of USER Using ORS | Ranking of USER Using ONS | Detail |
---|---|---|---|---|
1/NRMSE | Five Time Series: GDS3360,GDS2863,GDS5057,GDS5055,GDS3428 | 5 | 6 | Detail |
Five Non-time Series: GDS3323,GDS3215,GDS3485,GDS3476,GDS3197 | 6 | 6 | Detail | |
CPP | Five Time Series: GDS3360,GDS2863,GDS5057,GDS5055,GDS3428 | 7 | 9 | Detail |
Five Non-time Series: GDS3323,GDS3215,GDS3485,GDS3476,GDS3197 | 11 | 8 | Detail | |
BLCI | Five Time Series: GDS3360,GDS2863,GDS5057,GDS5055,GDS3428 | 3 | 4 | Detail |
Five Non-time Series: GDS3323,GDS3215,GDS3485,GDS3476,GDS3197 | 7 | 7 | Detail | |
1/NRMSE+CPP+BLCI | Five Time Series: GDS3360,GDS2863,GDS5057,GDS5055,GDS3428 | 6 | 7 | Detail |
Five Non-time Series: GDS3323,GDS3215,GDS3485,GDS3476,GDS3197 | 6 | 6 | Detail |