public class CombineFileSplit extends java.lang.Object implements InputSplit
FileSplit,
CombineFileSplit * class does not represent a split of a file, but a split of input files
into smaller sets. A split may contain blocks from different file but all
the blocks in the same split are probably local to some rack RecordReader's,
with reading one record per file.FileSplit,
CombineFileInputFormat| Constructor and Description |
|---|
CombineFileSplit()
default constructor
|
CombineFileSplit(CombineFileSplit old)
Copy constructor
|
CombineFileSplit(JobConf job,
org.apache.hadoop.fs.Path[] files,
long[] lengths) |
CombineFileSplit(JobConf job,
org.apache.hadoop.fs.Path[] files,
long[] start,
long[] lengths,
java.lang.String[] locations) |
| Modifier and Type | Method and Description |
|---|---|
JobConf |
getJob() |
long |
getLength()
Get the total number of bytes in the data of the
InputSplit. |
long |
getLength(int i)
Returns the length of the ith Path
|
long[] |
getLengths()
Returns an array containing the lengths of the files in the split
|
java.lang.String[] |
getLocations()
Returns all the Paths where this input-split resides
|
int |
getNumPaths()
Returns the number of Paths in the split
|
long |
getOffset(int i)
Returns the start offset of the ith Path
|
org.apache.hadoop.fs.Path |
getPath(int i)
Returns the ith Path
|
org.apache.hadoop.fs.Path[] |
getPaths()
Returns all the Paths in the split
|
long[] |
getStartOffsets()
Returns an array containing the startoffsets of the files in the split
|
void |
readFields(java.io.DataInput in) |
java.lang.String |
toString() |
void |
write(java.io.DataOutput out) |
public CombineFileSplit()
public CombineFileSplit(JobConf job, org.apache.hadoop.fs.Path[] files, long[] start, long[] lengths, java.lang.String[] locations)
public CombineFileSplit(JobConf job, org.apache.hadoop.fs.Path[] files, long[] lengths)
public CombineFileSplit(CombineFileSplit old) throws java.io.IOException
java.io.IOExceptionpublic JobConf getJob()
public long getLength()
InputSplitInputSplit.getLength in interface InputSplitpublic long[] getStartOffsets()
public long[] getLengths()
public long getOffset(int i)
public long getLength(int i)
public int getNumPaths()
public org.apache.hadoop.fs.Path getPath(int i)
public org.apache.hadoop.fs.Path[] getPaths()
public java.lang.String[] getLocations()
throws java.io.IOException
getLocations in interface InputSplitInputSplit is
located as an array of Strings.java.io.IOExceptionpublic void readFields(java.io.DataInput in)
throws java.io.IOException
readFields in interface org.apache.hadoop.io.Writablejava.io.IOExceptionpublic void write(java.io.DataOutput out)
throws java.io.IOException
write in interface org.apache.hadoop.io.Writablejava.io.IOExceptionpublic java.lang.String toString()
toString in class java.lang.ObjectCopyright © 2009 The Apache Software Foundation