Package weka.core.converters
Class ArffLoader
java.lang.Object
weka.core.converters.AbstractLoader
weka.core.converters.AbstractFileLoader
weka.core.converters.ArffLoader
- All Implemented Interfaces:
Serializable,BatchConverter,FileSourcedConverter,IncrementalConverter,Loader,URLSourcedLoader,EnvironmentHandler,RevisionHandler
public class ArffLoader
extends AbstractFileLoader
implements BatchConverter, IncrementalConverter, URLSourcedLoader
Reads a source that is in arff (attribute relation
file format) format.
- Version:
- $Revision: 14534 $
- Author:
- Mark Hall (mhall@cs.waikato.ac.nz), FracPete (fracpete at waikato dot ac dot nz)
- See Also:
-
Nested Class Summary
Nested ClassesModifier and TypeClassDescriptionstatic classReads data from an ARFF file, either in incremental or batch mode.Nested classes/interfaces inherited from interface weka.core.converters.Loader
Loader.StructureNotReadyException -
Field Summary
FieldsFields inherited from interface weka.core.converters.Loader
BATCH, INCREMENTAL, NONE -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionReturn the full data set.Returns a description of the file type.Get the file extension used for arff filesString[]Gets all the file extensions used for this type of filegetNextInstance(Instances structure) Read the data set incrementally---get the next instance in the data set or returns null if there are no more instances to get.booleanGet whether to retain the values of string attributes in memory (in the header) when reading incrementally.Returns the revision string.Determines and returns (if possible) the structure (internally the header) of the data set as an empty set of instances.Returns a string describing this Loaderstatic voidMain method.voidreset()Resets the Loader ready to read a new data set or the same data set again.Tool tip text for this propertyget the File specified as the sourceReturn the current urlvoidsets the source FilevoidsetRetainStringVals(boolean retain) Set whether to retain the values of string attributes in memory (in the header) when reading incrementally.voidsetSource(InputStream in) Resets the Loader object and sets the source of the data set to be the supplied InputStream.voidResets the Loader object and sets the source of the data set to be the supplied url.voidSet the url to load fromMethods inherited from class weka.core.converters.AbstractFileLoader
getUseRelativePath, runFileLoader, setEnvironment, setSource, setUseRelativePath, useRelativePathTipTextMethods inherited from class weka.core.converters.AbstractLoader
setRetrieval
-
Field Details
-
FILE_EXTENSION
the file extension -
FILE_EXTENSION_COMPRESSED
-
-
Constructor Details
-
ArffLoader
public ArffLoader()
-
-
Method Details
-
globalInfo
Returns a string describing this Loader- Returns:
- a description of the Loader suitable for displaying in the explorer/experimenter gui
-
retainStringValsTipText
Tool tip text for this property- Returns:
- the tool tip for this property
-
setRetainStringVals
public void setRetainStringVals(boolean retain) Set whether to retain the values of string attributes in memory (in the header) when reading incrementally.- Parameters:
retain- true if string values are to be retained in memory when reading incrementally
-
getRetainStringVals
public boolean getRetainStringVals()Get whether to retain the values of string attributes in memory (in the header) when reading incrementally.- Returns:
- true if string values are to be retained in memory when reading incrementally
-
getFileExtension
Get the file extension used for arff files- Specified by:
getFileExtensionin interfaceFileSourcedConverter- Returns:
- the file extension
-
getFileExtensions
Gets all the file extensions used for this type of file- Specified by:
getFileExtensionsin interfaceFileSourcedConverter- Returns:
- the file extensions
-
getFileDescription
Returns a description of the file type.- Specified by:
getFileDescriptionin interfaceFileSourcedConverter- Returns:
- a short file description
-
reset
Resets the Loader ready to read a new data set or the same data set again.- Specified by:
resetin interfaceLoader- Overrides:
resetin classAbstractFileLoader- Throws:
IOException- if something goes wrong
-
setSource
Resets the Loader object and sets the source of the data set to be the supplied url.- Parameters:
url- the source url.- Throws:
IOException- if an error occurs
-
retrieveFile
get the File specified as the source- Specified by:
retrieveFilein interfaceFileSourcedConverter- Overrides:
retrieveFilein classAbstractFileLoader- Returns:
- the source file
-
setFile
sets the source File- Specified by:
setFilein interfaceFileSourcedConverter- Overrides:
setFilein classAbstractFileLoader- Parameters:
file- the source file- Throws:
IOException- if an error occurs
-
setURL
Set the url to load from- Specified by:
setURLin interfaceURLSourcedLoader- Parameters:
url- the url to load from- Throws:
IOException- if the url can't be set.
-
retrieveURL
Return the current url- Specified by:
retrieveURLin interfaceURLSourcedLoader- Returns:
- the current url
-
setSource
Resets the Loader object and sets the source of the data set to be the supplied InputStream.- Specified by:
setSourcein interfaceLoader- Overrides:
setSourcein classAbstractLoader- Parameters:
in- the source InputStream.- Throws:
IOException- always thrown.
-
getStructure
Determines and returns (if possible) the structure (internally the header) of the data set as an empty set of instances.- Specified by:
getStructurein interfaceLoader- Specified by:
getStructurein classAbstractLoader- Returns:
- the structure of the data set as an empty set of Instances
- Throws:
IOException- if an error occurs
-
getDataSet
Return the full data set. If the structure hasn't yet been determined by a call to getStructure then method should do so before processing the rest of the data set.- Specified by:
getDataSetin interfaceLoader- Specified by:
getDataSetin classAbstractLoader- Returns:
- the structure of the data set as an empty set of Instances
- Throws:
IOException- if there is no source or parsing fails
-
getNextInstance
Read the data set incrementally---get the next instance in the data set or returns null if there are no more instances to get. If the structure hasn't yet been determined by a call to getStructure then method should do so before returning the next instance in the data set.- Specified by:
getNextInstancein interfaceLoader- Specified by:
getNextInstancein classAbstractLoader- Parameters:
structure- the dataset header information, will get updated in case of string or relational attributes- Returns:
- the next instance in the data set as an Instance object or null if there are no more instances to be read
- Throws:
IOException- if there is an error during parsing
-
getRevision
Returns the revision string.- Specified by:
getRevisionin interfaceRevisionHandler- Returns:
- the revision
-
main
Main method.- Parameters:
args- should contain the name of an input file.
-