Package org.apache.mahout.classifier.df.data

Class Summary
Data Holds a list of vectors and their corresponding Dataset.
DataConverter Converts String to Instance using a Dataset
DataLoader Converts the input data to a Vector Array using the information given by the Dataset.
Generates for each line a Vector that contains :
double parsed value for NUMERICAL attributes int value for CATEGORICAL and LABEL attributes
adds an IGNORED first attribute that will contain a unique id for each instance, which is the line number of the instance in the input data
Dataset Contains information about the attributes.
DataUtils Helper methods that deals with data lists and arrays of values
DescriptorUtils Contains various methods that deal with descriptor strings
Instance Represents one data instance.
 

Enum Summary
Dataset.Attribute Attributes type
 

Exception Summary
DescriptorException Exception thrown when parsing a descriptor
 



Copyright © 2008–2014 The Apache Software Foundation. All rights reserved.