Download molegro virtual docker
Transcript
13 Data Analyzer page 182/321 Figure 125: Creating a subset using a random selection of records. Create Subset Using 'Subset' Column Subsets can also be created from the subset identifiers listed in the 'subset' column (if available) using the Create Subset using 'Subset' Column... menu invoked from the Preparation menu. This option can be used to create subsets based on a clustering of a given dataset since the cluster association for each record is provided in the 'subset' column. From the sub-menu it is possible to select how the new subset should be created. The options available are identical to the ones described in the 'Creating Subsets From Selected Rows' section. Figure 126: Creating a subset using 'subset' column. It is possible to choose the number of records to extract for each subset identifier that should be part of the new subset. The maximum number of records that can be extracted for each subset identifier corresponds to the number of records of the subset identifier with the lowest number of records (to ensure that the same number of records are extracted for each subset identifier). The new subset containing the randomly selected records is created when pressing the OK button. 13.12 Dataset Scaling and Normalization Numerical columns can be scaled or normalized using the Scale and molegro virtual docker – user manual