File format of a standard data matrix
Updated by Fengfeng
Zhou, 2017-05-28
Most of the data mining algorithms accept two TEXT files as inputs
One file has the data matrix, and the columns are separated by either TAB (file suffix: tsv) or comma (file suffix: csv):
|
|
SampleID1 |
SampleID2 |
SampleID3 |
SampleID4 |
Feature1 |
value11 |
value12 |
value13 |
value14 |
Feature2 |
value21 |
value22 |
value23 |
value24 |
Feature3 |
value31 |
value32 |
value33 |
value34 |
|
|
And the class labels of the samples are defined in a separate file.
|
|
Class |
Other annotations |
SampleID1 |
P |
value1 |
SampleID2 |
N |
value2 |
SampleID3 |
N |
value3 |
|
|