Input data format
Input data files should be only txt file (not Excel, html or anything elshe!)
You can copy- paste this set into the program as:
The position of Y-values in both tables is:
The default way is (A), with dependent variable (Y) following independent variables (X1-X5). In the case (B) you should use REVERSED=1. The first line of data should indicate number of rows (data entries) that are available in the data for the training data set.
Suppose, you want to use 2 last rows as a test set. This can be done by :
The program will know that there are two data set. The first one will be used for training (and in general, always the first) and the second one to test the algorithm performance. Up to 10 sets can be added in the same way and only the first set will be used to train the program.
If you do not know the target values of the test set, the first line should be changed to:
If data sets can contains names of data entries, this should be indicated by NAMES=1. An example of the same data set with names is:
You can also see that there is no requirement for alignment of data in columns. The data can be separated with any number of tabs and spaces.See FAQ if you have questions. How to cite this applet? Are you looking for a new job in chemoinformatics?
Copyright 2001 -- 2016 http://www.vcclab.org. All rights reserved.