File format of a standard data matrix

Updated by Fengfeng Zhou, 2017-05-28

Most of the data mining algorithms accept two TEXT files
as inputs

One file has the data matrix, and the columns are separated by either TAB (file suffix: tsv) or comma (file suffix: csv):

  SampleID1 SampleID2 SampleID3 SampleID4
Feature1 value11 value12 value13 value14
Feature2 value21 value22 value23 value24
Feature3 value31 value32 value33 value34

And the class labels of the samples are defined in a separate file.

  Class Other annotations
SampleID1 P value1
SampleID2 N value2
SampleID3 N value3